» Articles » PMID: 30885720

MetaPheno: A Critical Evaluation of Deep Learning and Machine Learning in Metagenome-based Disease Prediction

Overview
Journal Methods
Specialty Biochemistry
Date 2019 Mar 20
PMID 30885720
Citations 38
Authors
Affiliations
Soon will be listed here.
Abstract

The human microbiome plays a number of critical roles, impacting almost every aspect of human health and well-being. Conditions in the microbiome have been linked to a number of significant diseases. Additionally, revolutions in sequencing technology have led to a rapid increase in publicly-available sequencing data. Consequently, there have been growing efforts to predict disease status from metagenomic sequencing data, with a proliferation of new approaches in the last few years. Some of these efforts have explored utilizing a powerful form of machine learning called deep learning, which has been applied successfully in several biological domains. Here, we review some of these methods and the algorithms that they are based on, with a particular focus on deep learning methods. We also perform a deeper analysis of Type 2 Diabetes and obesity datasets that have eluded improved results, using a variety of machine learning and feature extraction methods. We conclude by offering perspectives on study design considerations that may impact results and future directions the field can take to improve results and offer more valuable conclusions. The scripts and extracted features for the analyses conducted in this paper are available via GitHub:https://github.com/nlapier2/metapheno.

Citing Articles

Human Papillomavirus, Human Immunodeficiency Virus, and Oral Microbiota Interplay in Nigerian Youth (HOMINY): A Prospective Cohort Study Protocol.

Osagie E, Akhigbe P, Idemudia N, Obuekwe O, Adebiyi R, Schlecht N BMJ Open. 2025; 15(2):e091017.

PMID: 39922591 PMC: 11808902. DOI: 10.1136/bmjopen-2024-091017.


Deep learning in microbiome analysis: a comprehensive review of neural network models.

Przymus P, Rykaczewski K, Martin-Segura A, Truu J, Carrillo De Santa Pau E, Kolev M Front Microbiol. 2025; 15:1516667.

PMID: 39911715 PMC: 11794229. DOI: 10.3389/fmicb.2024.1516667.


A survey of k-mer methods and applications in bioinformatics.

Moeckel C, Mareboina M, Konnaris M, Chan C, Mouratidis I, Montgomery A Comput Struct Biotechnol J. 2024; 23:2289-2303.

PMID: 38840832 PMC: 11152613. DOI: 10.1016/j.csbj.2024.05.025.


Deep learning methods in metagenomics: a review.

Roy G, Prifti E, Belda E, Zucker J Microb Genom. 2024; 10(4).

PMID: 38630611 PMC: 11092122. DOI: 10.1099/mgen.0.001231.


phylaGAN: data augmentation through conditional GANs and autoencoders for improving disease prediction accuracy using microbiome data.

Sharma D, Lou W, Xu W Bioinformatics. 2024; 40(4).

PMID: 38569898 PMC: 11256914. DOI: 10.1093/bioinformatics/btae161.


References
1.
Han W, Wang M, Ye Y . A concurrent subtractive assembly approach for identification of disease associated sub-metagenomes. Res Comput Mol Biol. 2017; 2017:18-33. PMC: 5697791. DOI: 10.1007/978-3-319-56970-3_2. View

2.
Qin N, Yang F, Li A, Prifti E, Chen Y, Shao L . Alterations of the human gut microbiome in liver cirrhosis. Nature. 2014; 513(7516):59-64. DOI: 10.1038/nature13568. View

3.
Sczyrba A, Hofmann P, Belmann P, Koslicki D, Janssen S, Droge J . Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software. Nat Methods. 2017; 14(11):1063-1071. PMC: 5903868. DOI: 10.1038/nmeth.4458. View

4.
Ching T, Himmelstein D, Beaulieu-Jones B, Kalinin A, Do B, Way G . Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface. 2018; 15(141). PMC: 5938574. DOI: 10.1098/rsif.2017.0387. View

5.
Feldbauer R, Schulz F, Horn M, Rattei T . Prediction of microbial phenotypes based on comparative genomics. BMC Bioinformatics. 2015; 16 Suppl 14:S1. PMC: 4603748. DOI: 10.1186/1471-2105-16-S14-S1. View