» Articles » PMID: 34779034

Precisely Modeling Zero-inflated Count Phenotype for Rare Variants

Overview
Journal Genet Epidemiol
Specialties Genetics
Public Health
Date 2021 Nov 15
PMID 34779034
Authors
Affiliations
Soon will be listed here.
Abstract

Count data with excessive zeros are increasingly ubiquitous in genetic association studies, such as neuritic plaques in brain pathology for Alzheimer's disease. Here, we developed gene-based association tests to model such data by a mixture of two distributions, one for the structural zeros contributed by the Binomial distribution, and the other for the counts from the Poisson distribution. We derived the score statistics of the corresponding parameter of the rare variants in the zero-inflated Poisson regression model, and then constructed burden (ZIP-b) and kernel (ZIP-k) tests for the association tests. We evaluated omnibus tests that combined both ZIP-b and ZIP-k tests. Through simulated sequence data, we illustrated the potential power gain of our proposed method over a two-stage method that analyzes binary and non-zero continuous data separately for both burden and kernel tests. The ZIP burden test outperformed the kernel test as expected in all scenarios except for the scenario of variants with a mixture of directions in the genetic effects. We further demonstrated its applications to analyses of the neuritic plaque data in the ROSMAP cohort. We expect our proposed test to be useful in practice as more powerful than or complementary to the two-stage method.

Citing Articles

Zim4rv: an R package to modeling zero-inflated count phenotype on regional-based rare variants.

Liu X, Li Y, Fan Q BMC Bioinformatics. 2025; 26(1):18.

PMID: 39819419 PMC: 11740424. DOI: 10.1186/s12859-024-06029-5.

References
1.
Li B, Leal S . Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. Am J Hum Genet. 2008; 83(3):311-21. PMC: 2842185. DOI: 10.1016/j.ajhg.2008.06.024. View

2.
Yu D, Huber W, Vitek O . Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size. Bioinformatics. 2013; 29(10):1275-82. PMC: 3654711. DOI: 10.1093/bioinformatics/btt143. View

3.
Kaul A, Mandal S, Davidov O, Peddada S . Analysis of Microbiome Data in the Presence of Excess Zeros. Front Microbiol. 2017; 8:2114. PMC: 5682008. DOI: 10.3389/fmicb.2017.02114. View

4.
Liu Y, Xie J . Cauchy combination test: a powerful test with analytic -value calculation under arbitrary dependency structures. J Am Stat Assoc. 2020; 115(529):393-402. PMC: 7531765. DOI: 10.1080/01621459.2018.1554485. View

5.
Jung J, Dantzer J, Liu Y . Identification of multiple rare variants associated with a disease. BMC Proc. 2012; 5 Suppl 9:S103. PMC: 3287826. DOI: 10.1186/1753-6561-5-S9-S103. View