» Articles » PMID: 39753565

PreLect: Prevalence Leveraged Consistent Feature Selection Decodes Microbial Signatures Across Cohorts

Overview
Date 2025 Jan 3
PMID 39753565
Authors
Affiliations
Soon will be listed here.
Abstract

The intricate nature of microbiota sequencing data-high dimensionality and sparsity-presents a challenge in identifying informative and reproducible microbial features for both research and clinical applications. Addressing this, we introduce PreLect, an innovative feature selection framework that harnesses microbes' prevalence to facilitate consistent selection in sparse microbiota data. Upon rigorous benchmarking against established feature selection methodologies across 42 microbiome datasets, PreLect demonstrated superior classification capabilities compared to statistical methods and outperformed machine learning-based methods by selecting features with greater prevalence and abundance. A significant strength of PreLect lies in its ability to reliably identify reproducible microbial features across varied cohorts. Applied to colorectal cancer, PreLect identifies key microbes and highlights crucial pathways, such as lipopolysaccharide and glycerophospholipid biosynthesis, in cancer progression. This case study exemplifies PreLect's utility in discerning clinically relevant microbial signatures. In summary, PreLect's accuracy and robustness make it a significant advancement in the analysis of complex microbiota data.

References
1.
Fernandes A, Reid J, Macklaim J, McMurrough T, Edgell D, Gloor G . Unifying the analysis of high-throughput sequencing datasets: characterizing RNA-seq, 16S rRNA gene sequencing and selective growth experiments by compositional data analysis. Microbiome. 2014; 2:15. PMC: 4030730. DOI: 10.1186/2049-2618-2-15. View

2.
Callahan B, McMurdie P, Rosen M, Han A, Johnson A, Holmes S . DADA2: High-resolution sample inference from Illumina amplicon data. Nat Methods. 2016; 13(7):581-3. PMC: 4927377. DOI: 10.1038/nmeth.3869. View

3.
Goodwin A, DeStefano Shields C, Wu S, Huso D, Wu X, Murray-Stewart T . Polyamine catabolism contributes to enterotoxigenic Bacteroides fragilis-induced colon tumorigenesis. Proc Natl Acad Sci U S A. 2011; 108(37):15354-9. PMC: 3174648. DOI: 10.1073/pnas.1010203108. View

4.
Calgaro M, Romualdi C, Waldron L, Risso D, Vitulo N . Assessment of statistical methods from single cell, bulk RNA-seq, and metagenomics applied to microbiome data. Genome Biol. 2020; 21(1):191. PMC: 7398076. DOI: 10.1186/s13059-020-02104-1. View

5.
McCulloch J, Davar D, Rodrigues R, Badger J, Fang J, Cole A . Intestinal microbiota signatures of clinical response and immune-related adverse events in melanoma patients treated with anti-PD-1. Nat Med. 2022; 28(3):545-556. PMC: 10246505. DOI: 10.1038/s41591-022-01698-2. View