» Articles » PMID: 28693478

Reliable Biomarker Discovery from Metagenomic Data Via RegLRSD Algorithm

Overview
Publisher Biomed Central
Specialty Biology
Date 2017 Jul 12
PMID 28693478
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Biomarker detection presents itself as a major means of translating biological data into clinical applications. Due to the recent advances in high throughput sequencing technologies, an increased number of metagenomics studies have suggested the dysbiosis in microbial communities as potential biomarker for certain diseases. The reproducibility of the results drawn from metagenomic data is crucial for clinical applications and to prevent incorrect biological conclusions. The variability in the sample size and the subjects participating in the experiments induce diversity, which may drastically change the outcome of biomarker detection algorithms. Therefore, a robust biomarker detection algorithm that ensures the consistency of the results irrespective of the natural diversity present in the samples is needed.

Results: Toward this end, this paper proposes a novel Regularized Low Rank-Sparse Decomposition (RegLRSD) algorithm. RegLRSD models the bacterial abundance data as a superposition between a sparse matrix and a low-rank matrix, which account for the differentially and non-differentially abundant microbes, respectively. Hence, the biomarker detection problem is cast as a matrix decomposition problem. In order to yield more consistent and solid biological conclusions, RegLRSD incorporates the prior knowledge that the irrelevant microbes do not exhibit significant variation between samples belonging to different phenotypes. Moreover, an efficient algorithm to extract the sparse matrix is proposed. Comprehensive comparisons of RegLRSD with the state-of-the-art algorithms on three realistic datasets are presented. The obtained results demonstrate that RegLRSD consistently outperforms the other algorithms in terms of reproducibility performance and provides a marker list with high classification accuracy.

Conclusions: The proposed RegLRSD algorithm for biomarker detection provides high reproducibility and classification accuracy performance regardless of the dataset complexity and the number of selected biomarkers. This renders RegLRSD as a reliable and powerful tool for identifying potential metagenomic biomarkers.

Citing Articles

Wise Roles and Future Visionary Endeavors of Current Emperor: Advancing Dynamic Methods for Longitudinal Microbiome Meta-Omics Data in Personalized and Precision Medicine.

Oh V, Li R Adv Sci (Weinh). 2024; 11(47):e2400458.

PMID: 39535493 PMC: 11653615. DOI: 10.1002/advs.202400458.


MetaAnalyst: a user-friendly tool for metagenomic biomarker detection and phenotype classification.

Alshawaqfeh M, Rababah S, Hayajneh A, Gharaibeh A, Serpedin E BMC Med Res Methodol. 2022; 22(1):336.

PMID: 36577938 PMC: 9795700. DOI: 10.1186/s12874-022-01812-5.


Impact of Refined and Unrefined Sugar and Starch on the Microbiota in Dental Biofilm.

Chhaliyil P, Fischer K, Schoel B, Chhalliyil P J Int Soc Prev Community Dent. 2022; 12(5):554-563.

PMID: 36532326 PMC: 9753916. DOI: 10.4103/jispcd.JISPCD_104_22.


Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.

Wajid B, Anwar F, Wajid I, Nisar H, Meraj S, Zafar A Funct Integr Genomics. 2021; 22(1):3-26.

PMID: 34657989 DOI: 10.1007/s10142-021-00810-y.


Jatrorrhizine Balances the Gut Microbiota and Reverses Learning and Memory Deficits in APP/PS1 transgenic mice.

Wang S, Jiang W, Ouyang T, Shen X, Wang F, Qu Y Sci Rep. 2019; 9(1):19575.

PMID: 31862965 PMC: 6925119. DOI: 10.1038/s41598-019-56149-9.


References
1.
Segata N, Izard J, Waldron L, Gevers D, Miropolsky L, Garrett W . Metagenomic biomarker discovery and explanation. Genome Biol. 2011; 12(6):R60. PMC: 3218848. DOI: 10.1186/gb-2011-12-6-r60. View

2.
Suchodolski J, Camacho J, Steiner J . Analysis of bacterial diversity in the canine duodenum, jejunum, ileum, and colon by comparative 16S rRNA gene analysis. FEMS Microbiol Ecol. 2008; 66(3):567-78. DOI: 10.1111/j.1574-6941.2008.00521.x. View

3.
DeSantis T, Hugenholtz P, Larsen N, Rojas M, Brodie E, Keller K . Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol. 2006; 72(7):5069-72. PMC: 1489311. DOI: 10.1128/AEM.03006-05. View

4.
Dinsdale E, Edwards R, Hall D, Angly F, Breitbart M, Brulc J . Functional metagenomic profiling of nine biomes. Nature. 2008; 452(7187):629-32. DOI: 10.1038/nature06810. View

5.
Morgan X, Tickle T, Sokol H, Gevers D, Devaney K, Ward D . Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment. Genome Biol. 2012; 13(9):R79. PMC: 3506950. DOI: 10.1186/gb-2012-13-9-r79. View