» Articles » PMID: 34729406

The Evolution of Data-Driven Modeling in Organic Chemistry

Overview
Journal ACS Cent Sci
Specialty Chemistry
Date 2021 Nov 3
PMID 34729406
Citations 37
Authors
Affiliations
Soon will be listed here.
Abstract

Organic chemistry is replete with complex relationships: for example, how a reactant's structure relates to the resulting product formed; how reaction conditions relate to yield; how a catalyst's structure relates to enantioselectivity. Questions like these are at the foundation of understanding reactivity and developing novel and improved reactions. An approach to probing these questions that is both longstanding and contemporary is data-driven modeling. Here, we provide a synopsis of the history of data-driven modeling in organic chemistry and the terms used to describe these endeavors. We include a timeline of the steps that led to its current state. The case studies included highlight how, as a community, we have advanced physical organic chemistry tools with the aid of computers and data to augment the intuition of expert chemists and to facilitate the prediction of structure-activity and structure-property relationships.

Citing Articles

Connecting the complexity of stereoselective synthesis to the evolution of predictive tools.

Li J, Reid J Chem Sci. 2025; 16(9):3832-3851.

PMID: 39911341 PMC: 11791519. DOI: 10.1039/d4sc07461k.


Cross-Coupling Reactions with Nickel, Visible Light, and -Butylamine as a Bifunctional Additive.

Duker J, Philipp M, Lentner T, Cadge J, Lavarda J, Gschwind R ACS Catal. 2025; 15(2):817-827.

PMID: 39839851 PMC: 11744660. DOI: 10.1021/acscatal.4c07185.


Applying statistical modeling strategies to sparse datasets in synthetic chemistry.

Haas B, Kalyani D, Sigman M Sci Adv. 2025; 11(1):eadt3013.

PMID: 39742471 PMC: 11691635. DOI: 10.1126/sciadv.adt3013.


Compartmentalizing Donor-Acceptor Stenhouse Adducts for Structure-Property Relationship Analysis.

Reyes C, Karr A, Ramsperger C, K A, Lee H, Picazo E J Am Chem Soc. 2024; 147(1):10-26.

PMID: 39729546 PMC: 11726581. DOI: 10.1021/jacs.4c14198.


Data-Driven Discovery of a New Fluorescent BASHY Dye for Bioimaging.

Ravasco J, Felicidade J, Pinto M, Santos F, Campos-Gonzalez R, Arteaga J JACS Au. 2024; 4(11):4212-4222.

PMID: 39610736 PMC: 11600176. DOI: 10.1021/jacsau.4c00473.


References
1.
Cai Y, Liu X, Xu X, Chou K . Support Vector Machines for predicting HIV protease cleavage sites in protein. J Comput Chem. 2002; 23(2):267-74. DOI: 10.1002/jcc.10017. View

2.
Coley C, Jin W, Rogers L, Jamison T, Jaakkola T, Green W . A graph-convolutional neural network model for the prediction of chemical reactivity. Chem Sci. 2019; 10(2):370-377. PMC: 6335848. DOI: 10.1039/c8sc04228d. View

3.
Holmes E, Antti H . Chemometric contributions to the evolution of metabonomics: mathematical solutions to characterising and interpreting complex biological NMR spectra. Analyst. 2003; 127(12):1549-57. DOI: 10.1039/b208254n. View

4.
Vanhaelen Q, Lin Y, Zhavoronkov A . The Advent of Generative Chemistry. ACS Med Chem Lett. 2020; 11(8):1496-1505. PMC: 7429972. DOI: 10.1021/acsmedchemlett.0c00088. View

5.
Mater A, Coote M . Deep Learning in Chemistry. J Chem Inf Model. 2019; 59(6):2545-2559. DOI: 10.1021/acs.jcim.9b00266. View