Maximum Likelihood Estimation of the Negative Binomial Dispersion Parameter for Highly Overdispersed Data, with Applications to Infectious Diseases

Open Access

14 February 2007

journal article
research article
Published by Public Library of Science (PLoS) in PLOS ONE

Vol. 2 (2), e180
https://doi.org/10.1371/journal.pone.0000180

Abstract

The negative binomial distribution is used commonly throughout biology as a model for overdispersed count data, with attention focused on the negative binomial dispersion parameter, k. A substantial literature exists on the estimation of k, but most attention has focused on datasets that are not highly overdispersed (i.e., those with k≥1), and the accuracy of confidence intervals estimated for k is typically not explored. This article presents a simulation study exploring the bias, precision, and confidence interval coverage of maximum-likelihood estimates of k from highly overdispersed distributions. In addition to exploring small-sample bias on negative binomial estimates, the study addresses estimation from datasets influenced by two types of event under-counting, and from disease transmission data subject to selection bias for successful outbreaks. Results show that maximum likelihood estimates of k can be biased upward by small sample size or under-reporting of zero-class events, but are not biased downward by any of the factors considered. Confidence intervals estimated from the asymptotic sampling variance tend to exhibit coverage below the nominal level, with overestimates of k comprising the great majority of coverage errors. Estimation from outbreak datasets does not increase the bias of k estimates, but can add significant upward bias to estimates of the mean. Because k varies inversely with the degree of overdispersion, these findings show that overestimation of the degree of overdispersion is very rare for these datasets.

Keywords

This publication has 25 references indexed in Scilit:

Bias‐Corrected Maximum Likelihood Estimator of the Negative Binomial Dispersion Parameter
Biometrics, 2005
Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control Measures
American Journal of Epidemiology, 2004
Spatial modelling of individual-level parasite counts using the negative binomial distribution
Biostatistics, 2000
Confidence curves and improved exact confidence intervals for discrete distributions
The Canadian Journal of Statistics / La Revue Canadienne de Statistique, 2000
Linear model analysis of net catch data using the negative binomial distribution
Canadian Journal of Fisheries and Aquatic Sciences, 1999
Analysis of Frequency Count Data Using the Negative Binomial Distribution
Ecology, 1996
Estimation of the Negative Binomial Parameter κ by Maximum Quasi -Likelihood
Biometrics, 1989
The Negative Binomial Distribution
Journal of the Royal Statistical Society: Series D (The Statistician), 1985
Multistage Estimation Compared with Fixed-Sample-Size Estimation of the Negative Binomial Parameter k
Biometrics, 1984
Small Sample Comparison of Different Estimators of Negative Binomial Parameters
Biometrics, 1977

Cited by 165 articles