A Graph-convolutional Neural Network Model for the Prediction of Chemical Reactivity
Overview
Authors
Affiliations
We present a supervised learning approach to predict the products of organic reactions given their reactants, reagents, and solvent(s). The prediction task is factored into two stages comparable to manual expert approaches: considering possible sites of reactivity and evaluating their relative likelihoods. By training on hundreds of thousands of reaction precedents covering a broad range of reaction types from the patent literature, the neural model makes informed predictions of chemical reactivity. The model predicts the major product correctly over 85% of the time requiring around 100 ms per example, a significantly higher accuracy than achieved by previous machine learning approaches, and performs on par with expert chemists with years of formal training. We gain additional insight into predictions the design of the neural model, revealing an understanding of chemistry qualitatively consistent with manual approaches.
Generative Deep Learning-Based Efficient Design of Organic Molecules with Tailored Properties.
Han M, Joung J, Jeong M, Choi D, Park S ACS Cent Sci. 2025; 11(2):219-227.
PMID: 40028364 PMC: 11869130. DOI: 10.1021/acscentsci.4c00656.
Application of Transformers to Chemical Synthesis.
Jin D, Liang Y, Xiong Z, Yang X, Wang H, Zeng J Molecules. 2025; 30(3).
PMID: 39942600 PMC: 11821105. DOI: 10.3390/molecules30030493.
Chemically Informed Deep Learning for Interpretable Radical Reaction Prediction.
Tavakoli M, Chiu Y, Carlton A, Van Vranken D, Baldi P J Chem Inf Model. 2025; 65(3):1228-1242.
PMID: 39871741 PMC: 11815866. DOI: 10.1021/acs.jcim.4c01901.
A review of large language models and autonomous agents in chemistry.
Ramos M, Collison C, White A Chem Sci. 2025; 16(6):2514-2572.
PMID: 39829984 PMC: 11739813. DOI: 10.1039/d4sc03921a.
Applying statistical modeling strategies to sparse datasets in synthetic chemistry.
Haas B, Kalyani D, Sigman M Sci Adv. 2025; 11(1):eadt3013.
PMID: 39742471 PMC: 11691635. DOI: 10.1126/sciadv.adt3013.