» Articles » PMID: 28178947

An Integrative Bayesian Dirichlet-multinomial Regression Model for the Analysis of Taxonomic Abundances in Microbiome Data

Overview
Publisher Biomed Central
Specialty Biology
Date 2017 Feb 10
PMID 28178947
Citations 27
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The Human Microbiome has been variously associated with the immune-regulatory mechanisms involved in the prevention or development of many non-infectious human diseases such as autoimmunity, allergy and cancer. Integrative approaches which aim at associating the composition of the human microbiome with other available information, such as clinical covariates and environmental predictors, are paramount to develop a more complete understanding of the role of microbiome in disease development.

Results: In this manuscript, we propose a Bayesian Dirichlet-Multinomial regression model which uses spike-and-slab priors for the selection of significant associations between a set of available covariates and taxa from a microbiome abundance table. The approach allows straightforward incorporation of the covariates through a log-linear regression parametrization of the parameters of the Dirichlet-Multinomial likelihood. Inference is conducted through a Markov Chain Monte Carlo algorithm, and selection of the significant covariates is based upon the assessment of posterior probabilities of inclusions and the thresholding of the Bayesian false discovery rate. We design a simulation study to evaluate the performance of the proposed method, and then apply our model on a publicly available dataset obtained from the Human Microbiome Project which associates taxa abundances with KEGG orthology pathways. The method is implemented in specifically developed R code, which has been made publicly available.

Conclusions: Our method compares favorably in simulations to several recently proposed approaches for similarly structured data, in terms of increased accuracy and reduced false positive as well as false negative rates. In the application to the data from the Human Microbiome Project, a close evaluation of the biological significance of our findings confirms existing associations in the literature.

Citing Articles

Analyzing microbiome data with taxonomic misclassification using a zero-inflated Dirichlet-multinomial model.

Koslovsky M BMC Bioinformatics. 2025; 26(1):69.

PMID: 40016656 PMC: 11869466. DOI: 10.1186/s12859-025-06078-4.


A regularized Bayesian Dirichlet-multinomial regression model for integrating single-cell-level omics and patient-level clinical study data.

Guo Y, Yu L, Guo L, Xu L, Li Q Biometrics. 2025; 81(1).

PMID: 39887052 PMC: 11783250. DOI: 10.1093/biomtc/ujaf005.


Score matching for differential abundance testing of compositional high-throughput sequencing data.

Ostner J, Li H, Muller C bioRxiv. 2024; .

PMID: 39713439 PMC: 11661129. DOI: 10.1101/2024.12.05.627006.


A Survey of Statistical Methods for Microbiome Data Analysis.

Lutz K, Jiang S, Neugent M, De Nisco N, Zhan X, Li Q Front Appl Math Stat. 2024; 8.

PMID: 39575140 PMC: 11581570. DOI: 10.3389/fams.2022.884810.


Association of body index with fecal microbiome in children cohorts with ethnic-geographic factor interaction: accurately using a Bayesian zero-inflated negative binomial regression model.

Huang J, Lu Y, Tian F, Ni Y mSystems. 2024; 9(12):e0134524.

PMID: 39570024 PMC: 11651110. DOI: 10.1128/msystems.01345-24.


References
1.
Wu G, Chen J, Hoffmann C, Bittinger K, Chen Y, Keilbaugh S . Linking long-term dietary patterns with gut microbial enterotypes. Science. 2011; 334(6052):105-8. PMC: 3368382. DOI: 10.1126/science.1208344. View

2.
Chen J, Bushman F, Lewis J, Wu G, Li H . Structure-constrained sparse canonical correlation analysis with an application to microbiome data analysis. Biostatistics. 2012; 14(2):244-58. PMC: 3590923. DOI: 10.1093/biostatistics/kxs038. View

3.
La Rosa P, Brooks J, Deych E, Boone E, Edwards D, Wang Q . Hypothesis testing and power calculations for taxonomic-based human microbiome data. PLoS One. 2013; 7(12):e52078. PMC: 3527355. DOI: 10.1371/journal.pone.0052078. View

4.
Lange A, Jost S, Heider D, Bock C, Budeus B, Schilling E . AmpliconDuo: A Split-Sample Filtering Protocol for High-Throughput Amplicon Sequencing of Microbial Communities. PLoS One. 2015; 10(11):e0141590. PMC: 4629888. DOI: 10.1371/journal.pone.0141590. View

5.
Cryan J, OMahony S . The microbiome-gut-brain axis: from bowel to behavior. Neurogastroenterol Motil. 2011; 23(3):187-92. DOI: 10.1111/j.1365-2982.2010.01664.x. View