» Articles » PMID: 39975438

Product Manifold Representations for Learning on Biological Pathways

Overview
Journal ArXiv
Date 2025 Feb 20
PMID 39975438
Authors
Affiliations
Soon will be listed here.
Abstract

Machine learning models that embed graphs in non-Euclidean spaces have shown substantial benefits in a variety of contexts, but their application has not been studied extensively in the biological domain, particularly with respect to biological pathway graphs. Such graphs exhibit a variety of complex network structures, presenting challenges to existing embedding approaches. Learning high-quality embeddings for biological pathway graphs is important for researchers looking to understand the underpinnings of disease and train high-quality predictive models on these networks. In this work, we investigate the effects of embedding pathway graphs in non-Euclidean mixed-curvature spaces and compare against traditional Euclidean graph representation learning models. We then train a supervised model using the learned node embeddings to predict missing protein-protein interactions in pathway graphs. We find large reductions in distortion and boosts on in-distribution edge prediction performance as a result of using mixed-curvature embeddings and their corresponding graph neural network models. However, we find that mixed-curvature representations underperform existing baselines on out-of-distribution edge prediction performance suggesting that these representations may overfit to the training graph topology. We provide our Mixed-Curvature Product Graph Convolutional Network code at https://github.com/mcneela/Mixed-Curvature-GCN and our pathway analysis code at https://github.com/mcneela/Mixed-Curvature-Pathways.

References
1.
Chami I, Ying R, Re C, Leskovec J . Hyperbolic Graph Convolutional Neural Networks. Adv Neural Inf Process Syst. 2020; 32:4869-4880. PMC: 7108814. View

2.
Gillespie M, Jassal B, Stephan R, Milacic M, Rothfels K, Senff-Ribeiro A . The reactome pathway knowledgebase 2022. Nucleic Acids Res. 2021; 50(D1):D687-D692. PMC: 8689983. DOI: 10.1093/nar/gkab1028. View

3.
Poleksic A . Hyperbolic matrix factorization improves prediction of drug-target associations. Sci Rep. 2023; 13(1):959. PMC: 9849222. DOI: 10.1038/s41598-023-27995-5. View

4.
M A Basher A, Hallam S . Leveraging heterogeneous network embedding for metabolic pathway prediction. Bioinformatics. 2020; 37(6):822-829. PMC: 8098024. DOI: 10.1093/bioinformatics/btaa906. View

5.
Liang B, Gong H, Lu L, Xu J . Risk stratification and pathway analysis based on graph neural network and interpretable algorithm. BMC Bioinformatics. 2022; 23(1):394. PMC: 9516820. DOI: 10.1186/s12859-022-04950-1. View