» Articles » PMID: 31756219

Biomarker Discovery in Inflammatory Bowel Diseases Using Network-based Feature Selection

Overview
Journal PLoS One
Date 2019 Nov 23
PMID 31756219
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

Reliable identification of Inflammatory biomarkers from metagenomics data is a promising direction for developing non-invasive, cost-effective, and rapid clinical tests for early diagnosis of IBD. We present an integrative approach to Network-Based Biomarker Discovery (NBBD) which integrates network analyses methods for prioritizing potential biomarkers and machine learning techniques for assessing the discriminative power of the prioritized biomarkers. Using a large dataset of new-onset pediatric IBD metagenomics biopsy samples, we compare the performance of Random Forest (RF) classifiers trained on features selected using a representative set of traditional feature selection methods against NBBD framework, configured using five different tools for inferring networks from metagenomics data, and nine different methods for prioritizing biomarkers as well as a hybrid approach combining best traditional and NBBD based feature selection. We also examine how the performance of the predictive models for IBD diagnosis varies as a function of the size of the data used for biomarker identification. Our results show that (i) NBBD is competitive with some of the state-of-the-art feature selection methods including Random Forest Feature Importance (RFFI) scores; and (ii) NBBD is especially effective in reliably identifying IBD biomarkers when the number of data samples available for biomarker discovery is small.

Citing Articles

Advances in Inflammatory Bowel Disease Diagnostics: Machine Learning and Genomic Profiling Reveal Key Biomarkers for Early Detection.

Syed A, Abujabal H, Ahmad S, Malebary S, Alromema N Diagnostics (Basel). 2024; 14(11).

PMID: 38893707 PMC: 11172026. DOI: 10.3390/diagnostics14111182.


Deep learning conventional learning algorithms for clinical prediction in Crohn's disease: A proof-of-concept study.

Con D, van Langenberg D, Vasudevan A World J Gastroenterol. 2021; 27(38):6476-6488.

PMID: 34720536 PMC: 8517788. DOI: 10.3748/wjg.v27.i38.6476.


Artificial intelligence applications in inflammatory bowel disease: Emerging technologies and future directions.

Gubatan J, Levitte S, Patel A, Balabanis T, Wei M, Sinha S World J Gastroenterol. 2021; 27(17):1920-1935.

PMID: 34007130 PMC: 8108036. DOI: 10.3748/wjg.v27.i17.1920.


Incorporating Machine Learning into Established Bioinformatics Frameworks.

Auslander N, Gussow A, Koonin E Int J Mol Sci. 2021; 22(6).

PMID: 33809353 PMC: 8000113. DOI: 10.3390/ijms22062903.


Machine learning based refined differential gene expression analysis of pediatric sepsis.

Abbas M, El-Manzalawy Y BMC Med Genomics. 2020; 13(1):122.

PMID: 32859206 PMC: 7453705. DOI: 10.1186/s12920-020-00771-4.


References
1.
Schmidt C, Stallmach A . Etiology and pathogenesis of inflammatory bowel disease. Minerva Gastroenterol Dietol. 2005; 51(2):127-45. View

2.
Kyrpides N, Eloe-Fadrosh E, Ivanova N . Microbiome Data Science: Understanding Our Microbial Planet. Trends Microbiol. 2016; 24(6):425-427. DOI: 10.1016/j.tim.2016.02.011. View

3.
Avella-Medina M, Battey H, Fan J, Li Q . Robust estimation of high-dimensional covariance and precision matrices. Biometrika. 2018; 105(2):271-284. PMC: 6188670. DOI: 10.1093/biomet/asy011. View

4.
van Dam S, Vosa U, van der Graaf A, Franke L, de Magalhaes J . Gene co-expression analysis for functional classification and gene-disease predictions. Brief Bioinform. 2017; 19(4):575-592. PMC: 6054162. DOI: 10.1093/bib/bbw139. View

5.
Anders S, Huber W . Differential expression analysis for sequence count data. Genome Biol. 2010; 11(10):R106. PMC: 3218662. DOI: 10.1186/gb-2010-11-10-r106. View