» Articles » PMID: 39605612

Feature Selection with Vector-symbolic Architectures: a Case Study on Microbial Profiles of Shotgun Metagenomic Samples of Colorectal Cancer

Overview
Journal bioRxiv
Date 2024 Nov 28
PMID 39605612
Authors
Affiliations
Soon will be listed here.
Abstract

The continuingly decreasing cost of next-generation sequencing has recently led to a significant increase in the number of microbiome-related studies, providing invaluable information for understanding host-microbiome interactions and their relation to diseases. A common approach in metagenomics consists of determining the composition of samples in terms of the amount and types of microbial species that populate them, with the goal to identify microbes whose profiles are able to differentiate samples under different conditions with advanced feature selection techniques. Here we propose a novel backward variable selection method based on the hyperdimensional computing paradigm, which takes inspiration from how the human brain works in the classification of concepts by encoding features into vectors in a high-dimensional space. We validated our method on public metagenomic samples collected from patients affected by colorectal cancer in a case/control scenario, by performing a comparative analysis with other state-of-the-art feature selection methods, obtaining promising results.

References
1.
Wirbel J, Pyl P, Kartal E, Zych K, Kashani A, Milanese A . Meta-analysis of fecal metagenomes reveals global microbial signatures that are specific for colorectal cancer. Nat Med. 2019; 25(4):679-689. PMC: 7984229. DOI: 10.1038/s41591-019-0406-6. View

2.
Plate T . Holographic reduced representations. IEEE Trans Neural Netw. 1995; 6(3):623-41. DOI: 10.1109/72.377968. View

3.
Giardine B, Riemer C, Hardison R, Burhans R, Elnitski L, Shah P . Galaxy: a platform for interactive large-scale genome analysis. Genome Res. 2005; 15(10):1451-5. PMC: 1240089. DOI: 10.1101/gr.4086505. View

4.
Yu J, Feng Q, Wong S, Zhang D, Liang Q, Qin Y . Metagenomic analysis of faecal microbiome as a tool towards targeted non-invasive biomarkers for colorectal cancer. Gut. 2015; 66(1):70-78. DOI: 10.1136/gutjnl-2015-309800. View

5.
Hiltemann S, Rasche H, Gladman S, Hotz H, Lariviere D, Blankenberg D . Galaxy Training: A powerful framework for teaching!. PLoS Comput Biol. 2023; 19(1):e1010752. PMC: 9829167. DOI: 10.1371/journal.pcbi.1010752. View