» Articles » PMID: 30197785

A Zero-inflated Beta-binomial Model for Microbiome Data Analysis

Overview
Date 2018 Sep 11
PMID 30197785
Citations 16
Authors
Affiliations
Soon will be listed here.
Abstract

The microbiome is increasingly recognized as an important aspect of the health of host species, involved in many biological pathways and processes and potentially useful as health biomarkers. Taking advantage of high-throughput sequencing technologies, modern bacterial microbiome studies are metagenomic, interrogating thousands of taxa simultaneously. Several data analysis frameworks have been proposed for microbiome sequence read count data and determining the most significant features. However, there is still room for improvement. We introduce a zero-inflated beta-binomial (ZIBB) to model the distribution of microbiome count data and to determine association with a continuous or categorical phenotype of interest. The approach can exploit mean-variance relationships to improve power and adjust for covariates. The proposed method is a mixture model with two components: (i) a zero model accounting for excess zeros and (ii) a count model to capture the remaining component by beta-binomial regression, allowing for overdispersion effects. Simulation studies show that our proposed method effectively controls type I error and has higher power than competing methods to detect taxa associated with phenotype. An R package ZIBBSeqDiscovery is available on R CRAN.

Citing Articles

Wise Roles and Future Visionary Endeavors of Current Emperor: Advancing Dynamic Methods for Longitudinal Microbiome Meta-Omics Data in Personalized and Precision Medicine.

Oh V, Li R Adv Sci (Weinh). 2024; 11(47):e2400458.

PMID: 39535493 PMC: 11653615. DOI: 10.1002/advs.202400458.


A comparative analysis of mutual information methods for pairwise relationship detection in metagenomic data.

Francis D, Sun F BMC Bioinformatics. 2024; 25(1):266.

PMID: 39143554 PMC: 11323399. DOI: 10.1186/s12859-024-05883-7.


Modeling County-Level Rare Disease Prevalence Using Bayesian Hierarchical Sampling Weighted Zero-Inflated Regression.

Xie H, Rolka D, Barker L J Data Sci. 2024; 21(1):145-157.

PMID: 38799122 PMC: 11119276. DOI: 10.6339/22-JDS1049.


Machine learning and deep learning applications in microbiome research.

Hernandez Medina R, Kutuzova S, Nielsen K, Johansen J, Hansen L, Nielsen M ISME Commun. 2023; 2(1):98.

PMID: 37938690 PMC: 9723725. DOI: 10.1038/s43705-022-00182-9.


Statistical Analysis of Multiplex Immunofluorescence and Immunohistochemistry Imaging Data.

Wrobel J, Harris C, Vandekar S Methods Mol Biol. 2023; 2629:141-168.

PMID: 36929077 DOI: 10.1007/978-1-0716-2986-4_8.


References
1.
Zhou Y, Xia K, Wright F . A powerful and flexible approach to the analysis of RNA sequence count data. Bioinformatics. 2011; 27(19):2672-8. PMC: 3179656. DOI: 10.1093/bioinformatics/btr449. View

2.
Zhao N, Chen J, Carroll I, Ringel-Kulka T, Epstein M, Zhou H . Testing in Microbiome-Profiling Studies with MiRKAT, the Microbiome Regression-Based Kernel Association Test. Am J Hum Genet. 2015; 96(5):797-807. PMC: 4570290. DOI: 10.1016/j.ajhg.2015.04.003. View

3.
Qin J, Li Y, Cai Z, Li S, Zhu J, Zhang F . A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature. 2012; 490(7418):55-60. DOI: 10.1038/nature11450. View

4.
Robinson M, McCarthy D, Smyth G . edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2009; 26(1):139-40. PMC: 2796818. DOI: 10.1093/bioinformatics/btp616. View

5.
Macklaim J, Fernandes A, Di Bella J, Hammond J, Reid G, Gloor G . Comparative meta-RNA-seq of the vaginal microbiota and differential expression by Lactobacillus iners in health and dysbiosis. Microbiome. 2014; 1(1):12. PMC: 3971606. DOI: 10.1186/2049-2618-1-12. View