» Articles » PMID: 29795540

Methods for Phylogenetic Analysis of Microbiome Data

Overview
Journal Nat Microbiol
Date 2018 May 26
PMID 29795540
Citations 37
Authors
Affiliations
Soon will be listed here.
Abstract

How does knowing the evolutionary history of microorganisms affect our analysis of microbiological datasets? Depending on the research question, the common ancestry of microorganisms can be a source of confounding variation, or a scaffolding used for inference. For example, when performing regression on traits, common ancestry is a source of dependence among observations, whereas when searching for clades with correlated abundances, common ancestry is the scaffolding for inference. The common ancestry of microorganisms and their genes are organized in trees-phylogenies-which can and should be incorporated into analyses of microbial datasets. While there has been a recent expansion of phylogenetically informed analytical tools, little guidance exists for which method best answers which biological questions. Here, we review methods for phylogeny-aware analyses of microbiome datasets, considerations for choosing the appropriate method and challenges inherent in these methods. We introduce a conceptual organization of these tools, breaking them down into phylogenetic comparative methods, ancestral state reconstruction and analysis of phylogenetic variables and distances, and provide examples in Supplementary Online Tutorials. Careful consideration of the research question and ecological and evolutionary assumptions will help researchers choose a phylogeny and appropriate methods to produce accurate, biologically informative and previously unreported insights.

Citing Articles

PhyloMix: enhancing microbiome-trait association prediction through phylogeny-mixing augmentation.

Jiang Y, Liao D, Zhu Q, Lu Y Bioinformatics. 2025; 41(2).

PMID: 39799515 PMC: 11849959. DOI: 10.1093/bioinformatics/btaf014.


Applying rearrangement distances to enable plasmid epidemiology with pling.

Frolova D, Lima L, Roberts L, Bohnenkamper L, Wittler R, Stoye J Microb Genom. 2024; 10(10).

PMID: 39401066 PMC: 11472880. DOI: 10.1099/mgen.0.001300.


Phylogenetic association analysis with conditional rank correlation.

Wang S, Yuan B, Cai T, Li H Biometrika. 2024; 111(3):881-902.

PMID: 39239268 PMC: 11373757. DOI: 10.1093/biomet/asad075.


Compositional features analysis by machine learning in genome represents linear adaptation of monkeypox virus.

Zhang S, Li Y, Cai Y, Kang X, Feng Y, Li Y Front Genet. 2024; 15:1361952.

PMID: 38495668 PMC: 10940399. DOI: 10.3389/fgene.2024.1361952.


GENERALIZED MATRIX DECOMPOSITION REGRESSION: ESTIMATION AND INFERENCE FOR TWO-WAY STRUCTURED DATA.

Wang Y, Shojaie A, Randolph T, Knight P, Ma J Ann Appl Stat. 2023; 17(4):2944-2969.

PMID: 38149262 PMC: 10751029. DOI: 10.1214/23-aoas1746.


References
1.
Martiny J, Jones S, Lennon J, Martiny A . Microbiomes in light of traits: A phylogenetic perspective. Science. 2015; 350(6261):aac9323. DOI: 10.1126/science.aac9323. View

2.
Hug L, Baker B, Anantharaman K, Brown C, Probst A, Castelle C . A new view of the tree of life. Nat Microbiol. 2016; 1:16048. DOI: 10.1038/nmicrobiol.2016.48. View

3.
Falkowski P, Fenchel T, DeLong E . The microbial engines that drive Earth's biogeochemical cycles. Science. 2008; 320(5879):1034-9. DOI: 10.1126/science.1153213. View

4.
Bardgett R, Freeman C, Ostle N . Microbial contributions to climate change through carbon cycle feedbacks. ISME J. 2008; 2(8):805-14. DOI: 10.1038/ismej.2008.58. View

5.
Yang Z, Rannala B . Molecular phylogenetics: principles and practice. Nat Rev Genet. 2012; 13(5):303-14. DOI: 10.1038/nrg3186. View