» Articles » PMID: 36790067

Using Graph Neural Networks for Site-of-metabolism Prediction and Its Applications to Ranking Promiscuous Enzymatic Products

Overview
Journal Bioinformatics
Specialty Biology
Date 2023 Feb 15
PMID 36790067
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: While traditionally utilized for identifying site-specific metabolic activity within a compound to alter its interaction with a metabolizing enzyme, predicting the site-of-metabolism (SOM) is essential in analyzing the promiscuity of enzymes on substrates. The successful prediction of SOMs and the relevant promiscuous products has a wide range of applications that include creating extended metabolic models (EMMs) that account for enzyme promiscuity and the construction of novel heterologous synthesis pathways. There is therefore a need to develop generalized methods that can predict molecular SOMs for a wide range of metabolizing enzymes.

Results: This article develops a Graph Neural Network (GNN) model for the classification of an atom (or a bond) being an SOM. Our model, GNN-SOM, is trained on enzymatic interactions, available in the KEGG database, that span all enzyme commission numbers. We demonstrate that GNN-SOM consistently outperforms baseline machine learning models, when trained on all enzymes, on Cytochrome P450 (CYP) enzymes, or on non-CYP enzymes. We showcase the utility of GNN-SOM in prioritizing predicted enzymatic products due to enzyme promiscuity for two biological applications: the construction of EMMs and the construction of synthesis pathways.

Availability And Implementation: A python implementation of the trained SOM predictor model can be found at https://github.com/HassounLab/GNN-SOM.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Decoding allosteric landscapes: computational methodologies for enzyme modulation and drug discovery.

Zhu R, Wu C, Zha J, Lu S, Zhang J RSC Chem Biol. 2025; .

PMID: 39981029 PMC: 11836628. DOI: 10.1039/d4cb00282b.


Molecular Structure Discovery for Untargeted Metabolomics Using Biotransformation Rules and Global Molecular Networking.

Martin M, Bittremieux W, Hassoun S Anal Chem. 2025; 97(6):3213-3219.

PMID: 39903752 PMC: 11841678. DOI: 10.1021/acs.analchem.4c01565.


XenoMet: A Corpus of Texts to Extract Data on Metabolites of Xenobiotics.

Biziukova N, Rudik A, Dmitriev A, Tarasova O, Filimonov D, Poroikov V ACS Omega. 2025; 10(3):2459-2471.

PMID: 39895765 PMC: 11780559. DOI: 10.1021/acsomega.4c05723.


D-CyPre: a machine learning-based tool for accurate prediction of human CYP450 enzyme metabolic sites.

Yang H, Liu J, Chen K, Cong S, Cai S, Li Y PeerJ Comput Sci. 2024; 10:e2040.

PMID: 38855237 PMC: 11157575. DOI: 10.7717/peerj-cs.2040.


SelenzymeRF: updated enzyme suggestion software for unbalanced biochemical reactions.

Stoney R, Hanko E, Carbonell P, Breitling R Comput Struct Biotechnol J. 2023; 21:5868-5876.

PMID: 38074466 PMC: 10697999. DOI: 10.1016/j.csbj.2023.11.039.

References
1.
Amin S, Chavez E, Porokhin V, Nair N, Hassoun S . Towards creating an extended metabolic model (EMM) for E. coli using enzyme promiscuity prediction and metabolomics data. Microb Cell Fact. 2019; 18(1):109. PMC: 6567437. DOI: 10.1186/s12934-019-1156-3. View

2.
Strutz J, Shebek K, Broadbelt L, Tyo K . MINE 2.0: enhanced biochemical coverage for peak identification in untargeted metabolomics. Bioinformatics. 2022; 38(13):3484-3487. PMC: 9237697. DOI: 10.1093/bioinformatics/btac331. View

3.
Dang N, Matlock M, Hughes T, Swamidass S . The Metabolic Rainbow: Deep Learning Phase I Metabolism in Five Colors. J Chem Inf Model. 2020; 60(3):1146-1164. PMC: 8716320. DOI: 10.1021/acs.jcim.9b00836. View

4.
Otero-Muras I, Carbonell P . Automated engineering of synthetic metabolic pathways for efficient biomanufacturing. Metab Eng. 2020; 63:61-80. DOI: 10.1016/j.ymben.2020.11.012. View

5.
Duigou T, du Lac M, Carbonell P, Faulon J . RetroRules: a database of reaction rules for engineering biology. Nucleic Acids Res. 2018; 47(D1):D1229-D1235. PMC: 6323975. DOI: 10.1093/nar/gky940. View