» Articles » PMID: 39082648

MetaPredictor: in Silico Prediction of Drug Metabolites Based on Deep Language Models with Prompt Engineering

Overview
Journal Brief Bioinform
Specialty Biology
Date 2024 Jul 31
PMID 39082648
Authors
Affiliations
Soon will be listed here.
Abstract

Metabolic processes can transform a drug into metabolites with different properties that may affect its efficacy and safety. Therefore, investigation of the metabolic fate of a drug candidate is of great significance for drug discovery. Computational methods have been developed to predict drug metabolites, but most of them suffer from two main obstacles: the lack of model generalization due to restrictions on metabolic transformation rules or specific enzyme families, and high rate of false-positive predictions. Here, we presented MetaPredictor, a rule-free, end-to-end and prompt-based method to predict possible human metabolites of small molecules including drugs as a sequence translation problem. We innovatively introduced prompt engineering into deep language models to enrich domain knowledge and guide decision-making. The results showed that using prompts that specify the sites of metabolism (SoMs) can steer the model to propose more accurate metabolite predictions, achieving a 30.4% increase in recall and a 16.8% reduction in false positives over the baseline model. The transfer learning strategy was also utilized to tackle the limited availability of metabolic data. For the adaptation to automatic or non-expert prediction, MetaPredictor was designed as a two-stage schema consisting of automatic identification of SoMs followed by metabolite prediction. Compared to four available drug metabolite prediction tools, our method showed comparable performance on the major enzyme families and better generalization that could additionally identify metabolites catalyzed by less common enzymes. The results indicated that MetaPredictor could provide a more comprehensive and accurate prediction of drug metabolism through the effective combination of transfer learning and prompt-based learning strategies.

Citing Articles

Overcoming Challenges in Small-Molecule Drug Bioavailability: A Review of Key Factors and Approaches.

Wu K, Kwon S, Zhou X, Fuller C, Wang X, Vadgama J Int J Mol Sci. 2024; 25(23.

PMID: 39684832 PMC: 11642056. DOI: 10.3390/ijms252313121.

References
1.
Zaretzki J, Matlock M, Swamidass S . XenoSite: accurately predicting CYP-mediated sites of metabolism with neural networks. J Chem Inf Model. 2013; 53(12):3373-83. DOI: 10.1021/ci400518g. View

2.
Farid N, Kurihara A, Wrighton S . Metabolism and disposition of the thienopyridine antiplatelet drugs ticlopidine, clopidogrel, and prasugrel in humans. J Clin Pharmacol. 2009; 50(2):126-42. DOI: 10.1177/0091270009343005. View

3.
Tang W, Lu A . Metabolic bioactivation and drug-related adverse effects: current status and future directions from a pharmaceutical research perspective. Drug Metab Rev. 2009; 42(2):225-49. DOI: 10.3109/03602530903401658. View

4.
Nair V, Schwaller P, Laino T . Data-driven Chemical Reaction Prediction and Retrosynthesis. Chimia (Aarau). 2019; 73(12):997-1000. DOI: 10.2533/chimia.2019.997. View

5.
Caspi R, Billington R, Fulcher C, Keseler I, Kothari A, Krummenacker M . The MetaCyc database of metabolic pathways and enzymes. Nucleic Acids Res. 2017; 46(D1):D633-D639. PMC: 5753197. DOI: 10.1093/nar/gkx935. View