» Articles » PMID: 39804102

Enhancing Molecular Network-Based Cancer Driver Gene Prediction Using Machine Learning Approaches: Current Challenges and Opportunities

Overview
Journal J Cell Mol Med
Date 2025 Jan 13
PMID 39804102
Authors
Affiliations
Soon will be listed here.
Abstract

Cancer is a complex disease driven by mutations in the genes that play critical roles in cellular processes. The identification of cancer driver genes is crucial for understanding tumorigenesis, developing targeted therapies and identifying rational drug targets. Experimental identification and validation of cancer driver genes are time-consuming and costly. Studies have demonstrated that interactions among genes are associated with similar phenotypes. Therefore, identifying cancer driver genes using molecular network-based approaches is necessary. Molecular network-based random walk-based approaches, which integrate mutation data with protein-protein interaction networks, have been widely employed in predicting cancer driver genes and demonstrated robust predictive potential. However, recent advancements in deep learning, particularly graph-based models, have provided novel opportunities for enhancing the prediction of cancer driver genes. This review aimed to comprehensively explore how machine learning methodologies, particularly network propagation, graph neural networks, autoencoders, graph embeddings, and attention mechanisms, improve the scalability and interpretability of molecular network-based cancer gene prediction.

References
1.
Leiserson M, Vandin F, Wu H, Dobson J, Eldridge J, Thomas J . Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes. Nat Genet. 2014; 47(2):106-14. PMC: 4444046. DOI: 10.1038/ng.3168. View

2.
Valdeolivas A, Tichit L, Navarro C, Perrin S, Odelin G, Levy N . Random walk with restart on multiplex and heterogeneous biological networks. Bioinformatics. 2018; 35(3):497-505. DOI: 10.1093/bioinformatics/bty637. View

3.
Joodaki M, Ghadiri N, Maleki Z, Lotfi Shahreza M . A scalable random walk with restart on heterogeneous networks with Apache Spark for ranking disease-related genes through type-II fuzzy data fusion. J Biomed Inform. 2021; 115:103688. DOI: 10.1016/j.jbi.2021.103688. View

4.
Yang X, Sun J, Jin B, Lu Y, Cheng J, Jiang J . Multi-task aquatic toxicity prediction model based on multi-level features fusion. J Adv Res. 2024; 68:477-489. PMC: 11785906. DOI: 10.1016/j.jare.2024.06.002. View

5.
Cowen L, Ideker T, Raphael B, Sharan R . Network propagation: a universal amplifier of genetic associations. Nat Rev Genet. 2017; 18(9):551-562. DOI: 10.1038/nrg.2017.38. View