» Articles » PMID: 25620827

GEE Type Inference for Clustered Zero-inflated Negative Binomial Regression with Application to Dental Caries

Overview
Date 2015 Jan 27
PMID 25620827
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

Use of zero-inflated count data models is common in applications where the number of zero counts exceeds that predicted from a traditional count data model such as Poisson or negative binomial. When count data exhibiting inflated zero counts are correlated among subjects, a natural approach will be to fit a marginal model with the help of generalized estimating equations (GEE) that can incorporate subject-to-subject correlations. A GEE based zero-inflated negative binomial (ZINB) model is proposed to fit clustered counts with excessive zeros. However, the corresponding sandwich variance estimator appears to underestimate the true variance. The theoretical reasons for its failure are explained and a correction under additional modeling assumptions is offered. In addition, a clustered resampling (bootstrap) procedure is proposed to estimate the variance and it is shown that the bootstrap procedure captures the correct variance under no additional model assumptions. Utility of this marginal GEE based ZINB model over two other competing models has been assessed using a thorough simulation study. The resulting inference procedure is applied to study the association between the dental caries and fluoride exposures using a dataset extracted from the Iowa Fluoride Study. A number of risk factors of clinical significance are reliably identified using the proposed model.

Citing Articles

A longitudinal study of alcohol consumption among adults in Victoria, Australia during the COVID-19 pandemic.

Ke T, Livingston M, Zhang Y, Pavlyshyn D, Altermatt A, Thomas A PLoS One. 2024; 19(12):e0313599.

PMID: 39652557 PMC: 11627359. DOI: 10.1371/journal.pone.0313599.


Simulation and nurse-mentoring in a statewide nurse mentoring program in Bihar, India: diagnosis of postpartum hemorrhage and intrapartum asphyxia.

Ghosh R, Cohen S, Spindler H, Vincent D, Sterling M, Das A Gates Open Res. 2023; 6:70.

PMID: 37915730 PMC: 10616110. DOI: 10.12688/gatesopenres.13490.1.


A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables.

Jhwueng D, Wu C Biology (Basel). 2023; 12(8).

PMID: 37627032 PMC: 10452298. DOI: 10.3390/biology12081148.


Health Impacts of the COVID-19 Lockdown Measure in a Low Socio-Economic Setting: A Cross-Sectional Study on Reunion Island.

Fianu A, Aissaoui H, Naty N, Lenclume V, Casimir A, Chirpaz E Int J Environ Res Public Health. 2022; 19(21).

PMID: 36360811 PMC: 9657094. DOI: 10.3390/ijerph192113932.


Analyzing longitudinal clustered count data with zero inflation: Marginal modeling using the Conway-Maxwell-Poisson distribution.

Kang T, Levy S, Datta S Biom J. 2021; 63(4):761-786.

PMID: 33393147 PMC: 9161738. DOI: 10.1002/bimj.202000061.


References
1.
Levy S, Warren J, Davis C, Kirchner H, Kanellis M, Wefel J . Patterns of fluoride intake from birth to 36 months. J Public Health Dent. 2001; 61(2):70-7. DOI: 10.1111/j.1752-7325.2001.tb03369.x. View

2.
Mwalili S, Lesaffre E, Declerck D . The zero-inflated negative binomial regression model with correction for misclassification: an example in caries research. Stat Methods Med Res. 2007; 17(2):123-39. DOI: 10.1177/0962280206071840. View

3.
Zeger S, Liang K, Albert P . Models for longitudinal data: a generalized estimating equation approach. Biometrics. 1988; 44(4):1049-60. View