×

An empirical approach to determine a threshold for assessing overdispersion in Poisson and negative binomial models for count data. (English) Zbl 07550064

Summary: Overdispersion is a problem encountered in the analysis of count data that can lead to invalid inference if unaddressed. Decision about whether data are overdispersed is often reached by checking whether the ratio of the Pearson chi-square statistic to its degrees of freedom is greater than one; however, there is currently no fixed threshold for declaring the need for statistical intervention. We consider simulated cross-sectional and longitudinal datasets containing varying magnitudes of overdispersion caused by outliers or zero inflation, as well as real datasets, to determine an appropriate threshold value of this statistic which indicates when overdispersion should be addressed.

MSC:

62J12 Generalized linear models (logistic models)
62Fxx Parametric inference

Software:

COUNT; glm
PDFBibTeX XMLCite
Full Text: DOI Link

References:

[1] Aberle, D. R.; Adams, A. M., Reduced lung-cancer mortality with low-dose computed tomographic screening, 2011. New England Journal of Medicine, 365, 5, 395-409
[2] Breslow, N., Tests of hypotheses in overdispersed poisson regression and other quasi-likelihood models, 1990. Journal of the American Statistical Association, 85, 410, 565-571
[3] Cameron, A. C.; Trivedi, P. K., Econometric models based on count data: comparison and application of some estimators and tests, 1986. Econometrics, 1, 1, 29-53
[4] Cameron, A. C.; Trivedi, P. K., Regression-based tests for overdispersion in the poisson model, 1990. Journal of Econometrics, 46, 347-364
[5] Cameron, A. C.; Trivedi, P. K., 1998. Regression analysis of count data, New York: Cambridge University Press, New York · Zbl 0924.62004
[6] Collings, B. J.; Margolin, B. H., Testing goodness-of-fit for the Poisson assumption when observations are not identically distributed, 1985. Journal of the American Statistical Association, 80, 390, 411-418 · Zbl 0578.62024
[7] Cox, D. R., Some remarks on overdispersion, 1983. Biometrika, 70, 1, 269-274 · Zbl 0511.62007
[8] Dean, C. B., Testing for overdispersion in poisson and binomial regression models, 1992. American Statistical Association, 87, 418, 451-457
[9] Dean, C.; Lawless, J. F., Tests for detecting overdispersion in Poisson regression models, 1989. Journal of the American Statistical Association, 84, 467-472
[10] Faddy, M. J.; Smith, D. M., Analysis of count data with covariate dependence in both mean and variance, 2011. Journal of Applied Statistics, 38, 12, 2683-2694 · Zbl 1511.62332
[11] Gurmu, S., Tests for detecting overdispersion in the positive poisson regression model, 1991. American Statistical Association, 9, 2, 215-222
[12] Hardin, J.; Hilbe, J. M., 2001, 2007. Generalized linear models and extensions, College Station, Texas: Stata Press, College Station, Texas
[13] Hilbe, J. M., 2007, 2011. Negative binomial regression, Cambridge: Cambridge University Press, Cambridge · Zbl 1131.62068
[14] Lambert, D.; Roeder, K., Overdispersion diagnostics for generalized linear models, 1995. Journal of the American Statistical Association, 90, 432, 1225-1236 · Zbl 0864.62048
[15] Lawless, J. F., Regression methods for poisson process data, 1987. American Statistical Association, 82, 399, 808-815 · Zbl 0657.62103
[16] Lee, S.; Park, C.; Bynng, S. K., Tests for detecting overdispersion in poisson models, 2007. Communications in Statistics - Theory and Methods, 24, 9, 2405-2420 · Zbl 0937.62559
[17] McCullagh, P.; Nelder, J. A., 1983, 1989. Generalized linear models, London: London, New York: Chapman and Hall, London: London, New York · Zbl 0588.62104
[18] Molla, D. T.; Muniswamy, B., Power of tests for overdispersion parameter in negative binomial regression model, 2012. IOSR Journal of Mathematics, 1, 4, 29-36
[19] Morel, J. G.; Neerchal, N. K., 2012. Overdispersion Models in SAS, Cary, NC: SAS Institute, Inc., Cary, NC
[20] Mortelmans, K.; Zeiger, E., The Ames Salmonella/microsome mutagenicity assay, 2000. Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis, 455, 1-2, 29-60
[21] O’Hara Hines, R. J., A comparison of score tests for overdispersion in generalized linear models, 1997. Journal of Statistical Computation and Simulation, 58, 1, 323-342 · Zbl 1065.62535
[22] Payne, E. H.; Hardin, J. W.; Egede, L. E.; Ramakrishnan, V.; Selassie, A.; Gebregziabher, M., Approaches for dealing with various sources of overdispersion in modeling count data: scale adjustment versus modeling, 2015. Statistical Methods in Medical Research
[23] Rigby, R. A.; Stasinopoulos, D. M., A framework for modelling overdispersed count data, including the Poisson-shifted generalized inverse Gaussian distribution, 2008. Computational Statistics & Data Analysis, 53, 2, 381-393 · Zbl 1231.62019
[24] Rodriguez, G., 2015. Generalized Linear Models, Models for over-dispersed count data (2015), Princeton University
[25] Ruoyan, M., 2004. Estimation of dispersion parameters in GLMs with and without random effects, Stockholm University: Mathematical Statistics, Stockholm University
[26] Yang, Z.; Hardin, J. W.; Addy, C. L., A score test for overdispersion in poisson regression based on the generalized poisson-2 model, 2009. Journal of Statistical Planning and Inference, 139, 4, 1514-21 · Zbl 1153.62013
[27] Yang, Z.; Hardin, J. W.; Addy, C. L.; Vuong, Q. H., Testing approaches for overdispersion in Poisson regression versus the generalized Poisson model, 2007. Biometrical Journal, 49, 4, 565-84 · Zbl 1442.62709
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.