» Articles » PMID: 39927291

Simplicity Within Biological Complexity

Overview
Journal Bioinform Adv
Date 2025 Feb 10
PMID 39927291
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Heterogeneous, interconnected, systems-level, molecular (multi-omic) data have become increasingly available and key in precision medicine. We need to utilize them to better stratify patients into risk groups, discover new biomarkers and targets, repurpose known and discover new drugs to personalize medical treatment. Existing methodologies are limited and a paradigm shift is needed to achieve quantitative and qualitative breakthroughs.

Results: In this perspective paper, we survey the literature and argue for the development of a comprehensive, general framework for embedding of multi-scale molecular network data that would enable their explainable exploitation in precision medicine in linear time. Network embedding methods (also called graph representation learning) map nodes to points in low-dimensional space, so that proximity in the learned space reflects the network's topology-function relationships. They have recently achieved unprecedented performance on hard problems of utilizing few omic data in various biomedical applications. However, research thus far has been limited to special variants of the problems and data, with the performance depending on the underlying topology-function network biology hypotheses, the biomedical applications, and evaluation metrics. The availability of multi-omic data, modern graph embedding paradigms and compute power call for a creation and training of efficient, explainable and controllable models, having no potentially dangerous, unexpected behaviour, that make a qualitative breakthrough. We propose to develop a general, comprehensive embedding framework for multi-omic network data, from models to efficient and scalable software implementation, and to apply it to biomedical informatics, focusing on precision medicine and personalized drug discovery. It will lead to a paradigm shift in the computational and biomedical understanding of data and diseases that will open up ways to solve some of the major bottlenecks in precision medicine and other domains.

References
1.
Cui H, Wang C, Maan H, Pang K, Luo F, Duan N . scGPT: toward building a foundation model for single-cell multi-omics using generative AI. Nat Methods. 2024; 21(8):1470-1480. DOI: 10.1038/s41592-024-02201-0. View

2.
Kearnes S, Maser M, Wleklinski M, Kast A, Doyle A, Dreher S . The Open Reaction Database. J Am Chem Soc. 2021; 143(45):18820-18826. DOI: 10.1021/jacs.1c09820. View

3.
Yang C, Xiao Y, Zhang Y, Sun Y, Han J . Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark. IEEE Trans Knowl Data Eng. 2023; 34(10):4854-4873. PMC: 10619966. DOI: 10.1109/tkde.2020.3045924. View

4.
Adames N, Gallegos J, Peccoud J . Yeast genetic interaction screens in the age of CRISPR/Cas. Curr Genet. 2018; 65(2):307-327. PMC: 6420903. DOI: 10.1007/s00294-018-0887-8. View

5.
Duran C, Ciucci S, Palladini A, Ijaz U, Zippo A, Paroni Sterbini F . Nonlinear machine learning pattern recognition and bacteria-metabolite multilayer network analysis of perturbed gastric microbiome. Nat Commun. 2021; 12(1):1926. PMC: 7997970. DOI: 10.1038/s41467-021-22135-x. View