» Articles » PMID: 21039646

Supervised Classification of Human Microbiota

Overview
Specialty Microbiology
Date 2010 Nov 3
PMID 21039646
Citations 209
Authors
Affiliations
Soon will be listed here.
Abstract

Recent advances in DNA sequencing technology have allowed the collection of high-dimensional data from human-associated microbial communities on an unprecedented scale. A major goal of these studies is the identification of important groups of microorganisms that vary according to physiological or disease states in the host, but the incidence of rare taxa and the large numbers of taxa observed make that goal difficult to obtain using traditional approaches. Fortunately, similar problems have been addressed by the machine learning community in other fields of study such as microarray analysis and text classification. In this review, we demonstrate that several existing supervised classifiers can be applied effectively to microbiota classification, both for selecting subsets of taxa that are highly discriminative of the type of community, and for building models that can accurately classify unlabeled data. To encourage the development of new approaches to supervised classification of microbiota, we discuss several structures inherent in microbial community data that may be available for exploitation in novel approaches, and we include as supplemental information several benchmark classification tasks for use by the community.

Citing Articles

Predicting nasal diseases based on microbiota relationship network.

Liang Y, Mao J, Qiu T, Li B, Zhang C, Zhang K Sci Prog. 2025; 108(1):368504251320832.

PMID: 39962881 PMC: 11833901. DOI: 10.1177/00368504251320832.


Longitudinal Microbiome-based Interpretable Machine Learning for Identification of Time-Varying Biomarkers in Early Prediction of Disease Outcomes.

Dai Y, Qian Y, Qu Y, Guan W, Xie J, Wang D bioRxiv. 2024; .

PMID: 39605360 PMC: 11601495. DOI: 10.1101/2024.10.18.619118.


Wise Roles and Future Visionary Endeavors of Current Emperor: Advancing Dynamic Methods for Longitudinal Microbiome Meta-Omics Data in Personalized and Precision Medicine.

Oh V, Li R Adv Sci (Weinh). 2024; 11(47):e2400458.

PMID: 39535493 PMC: 11653615. DOI: 10.1002/advs.202400458.


Application of machine learning based genome sequence analysis in pathogen identification.

Gao Y, Liu M Front Microbiol. 2024; 15:1474078.

PMID: 39417073 PMC: 11480060. DOI: 10.3389/fmicb.2024.1474078.


DeepPhylo: Phylogeny-Aware Microbial Embeddings Enhanced Predictive Accuracy in Human Microbiome Data Analysis.

Wang B, Shen Y, Fang J, Su X, Xu Z Adv Sci (Weinh). 2024; 11(45):e2404277.

PMID: 39403892 PMC: 11615782. DOI: 10.1002/advs.202404277.