» Articles » PMID: 28573205

Prediction of Organic Reaction Outcomes Using Machine Learning

Overview
Journal ACS Cent Sci
Specialty Chemistry
Date 2017 Jun 3
PMID 28573205
Citations 131
Authors
Affiliations
Soon will be listed here.
Abstract

Computer assistance in synthesis design has existed for over 40 years, yet retrosynthesis planning software has struggled to achieve widespread adoption. One critical challenge in developing high-quality pathway suggestions is that proposed reaction steps often fail when attempted in the laboratory, despite initially seeming viable. The true measure of success for any synthesis program is whether the predicted outcome matches what is observed experimentally. We report a model framework for anticipating reaction outcomes that combines the traditional use of reaction templates with the flexibility in pattern recognition afforded by neural networks. Using 15 000 experimental reaction records from granted United States patents, a model is trained to select the major (recorded) product by ranking a self-generated list of candidates where one candidate is known to be the major product. Candidate reactions are represented using a unique edit-based representation that emphasizes the fundamental transformation from reactants to products, rather than the constituent molecules' overall structures. In a 5-fold cross-validation, the trained model assigns the major product rank 1 in 71.8% of cases, rank ≤3 in 86.7% of cases, and rank ≤5 in 90.8% of cases.

Citing Articles

Computational tools for the prediction of site- and regioselectivity of organic reactions.

Sigmund L, Assante M, Johansson M, Norrby P, Jorner K, Kabeshov M Chem Sci. 2025; .

PMID: 40070469 PMC: 11891785. DOI: 10.1039/d5sc00541h.


Chemically Informed Deep Learning for Interpretable Radical Reaction Prediction.

Tavakoli M, Chiu Y, Carlton A, Van Vranken D, Baldi P J Chem Inf Model. 2025; 65(3):1228-1242.

PMID: 39871741 PMC: 11815866. DOI: 10.1021/acs.jcim.4c01901.


Integrating Machine Learning and Large Language Models to Advance Exploration of Electrochemical Reactions.

Zheng Z, Florit F, Jin B, Wu H, Li S, Nandiwale K Angew Chem Int Ed Engl. 2024; 64(6):e202418074.

PMID: 39625837 PMC: 11795713. DOI: 10.1002/anie.202418074.


Authors' reply to: Concerns Regarding "Development of a machine learning-based model to predict hepatic inflammation in chronic hepatitis B patients with concurrent hepatic steatosis: a cohort study".

Rui F, Yeo Y, Tian X, Chen Y, Li J EClinicalMedicine. 2024; 78:102908.

PMID: 39619238 PMC: 11605130. DOI: 10.1016/j.eclinm.2024.102908.


Reactivities of -Nitrosamines against Common Reagents and Reaction Conditions.

Hodgin G, Burns M, Deadman B, Roberts C, Mimi Hii K, Nguyen B Org Process Res Dev. 2024; 28(10):3837-3846.

PMID: 39444428 PMC: 11494645. DOI: 10.1021/acs.oprd.4c00217.


References
1.
Labute P . A widely applicable set of descriptors. J Mol Graph Model. 2001; 18(4-5):464-77. DOI: 10.1016/s1093-3263(00)00068-1. View

2.
Segler M, Waller M . Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction. Chemistry. 2017; 23(25):5966-5971. DOI: 10.1002/chem.201605499. View

3.
Segler M, Waller M . Modelling Chemical Reasoning to Predict and Invent Reactions. Chemistry. 2016; 23(25):6118-6128. DOI: 10.1002/chem.201604556. View

4.
Corey E, Wipke W . Computer-assisted design of complex organic syntheses. Science. 1969; 166(3902):178-92. DOI: 10.1126/science.166.3902.178. View

5.
Wei J, Duvenaud D, Aspuru-Guzik A . Neural Networks for the Prediction of Organic Chemistry Reactions. ACS Cent Sci. 2016; 2(10):725-732. PMC: 5084081. DOI: 10.1021/acscentsci.6b00219. View