» Articles » PMID: 21875452

Analysis of Overdispersed Count Data: Application to the Human Papillomavirus Infection in Men (HIM) Study

Overview
Date 2011 Aug 31
PMID 21875452
Citations 29
Authors
Affiliations
Soon will be listed here.
Abstract

The Poisson model can be applied to the count of events occurring within a specific time period. The main feature of the Poisson model is the assumption that the mean and variance of the count data are equal. However, this equal mean-variance relationship rarely occurs in observational data. In most cases, the observed variance is larger than the assumed variance, which is called overdispersion. Further, when the observed data involve excessive zero counts, the problem of overdispersion results in underestimating the variance of the estimated parameter, and thus produces a misleading conclusion. We illustrated the use of four models for overdispersed count data that may be attributed to excessive zeros. These are Poisson, negative binomial, zero-inflated Poisson and zero-inflated negative binomial models. The example data in this article deal with the number of incidents involving human papillomavirus infection. The four models resulted in differing statistical inferences. The Poisson model, which is widely used in epidemiology research, underestimated the standard errors and overstated the significance of some covariates.

Citing Articles

Use of oral moist tobacco (snus) in puberty and its association with asthma in the population-based RHINESSA study.

Lopez-Cervantes J, Schlunssen V, Senaratna C, Accordini S, Callejas F, Franklin K BMJ Open Respir Res. 2024; 11(1).

PMID: 39038916 PMC: 11268032. DOI: 10.1136/bmjresp-2024-002401.


Modeling County-Level Rare Disease Prevalence Using Bayesian Hierarchical Sampling Weighted Zero-Inflated Regression.

Xie H, Rolka D, Barker L J Data Sci. 2024; 21(1):145-157.

PMID: 38799122 PMC: 11119276. DOI: 10.6339/22-JDS1049.


Health Facilities Readiness and Determinants to Manage Cardiovascular Disease in Afghanistan, Bangladesh, and Nepal: Evidence from the National Service Provision Assessment Survey.

Huda M, Rahman M, Mostofa M, Sarkar P, Islam M, Adam I Glob Heart. 2024; 19(1):31.

PMID: 38524910 PMC: 10959132. DOI: 10.5334/gh.1311.


A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables.

Jhwueng D, Wu C Biology (Basel). 2023; 12(8).

PMID: 37627032 PMC: 10452298. DOI: 10.3390/biology12081148.


Small-scale field evaluation of transfluthrin-treated eave ribbons and sandals for the control of malaria vectors in rural Tanzania.

Mmbando A, Mponzi W, Ngowo H, Kifungo K, Kasubiri R, Njalambaha R Malar J. 2023; 22(1):43.

PMID: 36739391 PMC: 9898903. DOI: 10.1186/s12936-023-04476-8.


References
1.
Giuliano A, Lazcano-Ponce E, Villa L, Flores R, Salmeron J, Lee J . The human papillomavirus infection in men study: human papillomavirus prevalence and type distribution among men residing in Brazil, Mexico, and the United States. Cancer Epidemiol Biomarkers Prev. 2008; 17(8):2036-43. PMC: 3471778. DOI: 10.1158/1055-9965.EPI-08-0151. View

2.
Lindsey J, Altham P . Analysis of the human sex ratio by using overdispersion models. J R Stat Soc Ser C Appl Stat. 2002; 47(1):149-57. DOI: 10.1111/1467-9876.00103. View

3.
van den Broek J . A score test for zero inflation in a Poisson distribution. Biometrics. 1995; 51(2):738-43. View

4.
Hall D . Zero-inflated Poisson and binomial regression with random effects: a case study. Biometrics. 2000; 56(4):1030-9. DOI: 10.1111/j.0006-341x.2000.01030.x. View

5.
Lewsey J, Thomson W . The utility of the zero-inflated Poisson and zero-inflated negative binomial models: a case study of cross-sectional and longitudinal DMF data examining the effect of socio-economic status. Community Dent Oral Epidemiol. 2004; 32(3):183-9. DOI: 10.1111/j.1600-0528.2004.00155.x. View