» Articles » PMID: 38680564

Standardizing Substrate Selection: A Strategy Toward Unbiased Evaluation of Reaction Generality

Overview
Journal ACS Cent Sci
Specialty Chemistry
Date 2024 Apr 29
PMID 38680564
Authors
Affiliations
Soon will be listed here.
Abstract

With over 10,000 new reaction protocols arising every year, only a handful of these procedures transition from academia to application. A major reason for this gap stems from the lack of comprehensive knowledge about a reaction's scope, i.e., to which substrates the protocol can or cannot be applied. Even though chemists invest substantial effort to assess the scope of new protocols, the resulting scope tables involve significant biases, reducing their expressiveness. Herein we report a standardized substrate selection strategy designed to mitigate these biases and evaluate the applicability, as well as the limits, of any chemical reaction. Unsupervised learning is utilized to map the chemical space of industrially relevant molecules. Subsequently, potential substrate candidates are projected onto this universal map, enabling the selection of a structurally diverse set of substrates with optimal relevance and coverage. By testing our methodology on different chemical reactions, we were able to demonstrate its effectiveness in finding general reactivity trends by using a few highly representative examples. The developed methodology empowers chemists to showcase the unbiased applicability of novel methodologies, facilitating their practical applications. We hope that this work will trigger interdisciplinary discussions about biases in synthetic chemistry, leading to improved data quality.

Citing Articles

Designing Target-specific Data Sets for Regioselectivity Predictions on Complex Substrates.

Schleinitz J, Carretero-Cerdan A, Gurajapu A, Harnik Y, Lee G, Pandey A J Am Chem Soc. 2025; 147(9):7476-7484.

PMID: 39982221 PMC: 11887056. DOI: 10.1021/jacs.4c15902.


Catalysing (organo-)catalysis: Trends in the application of machine learning to enantioselective organocatalysis.

Schmid S, Schlosser L, Glorius F, Jorner K Beilstein J Org Chem. 2024; 20:2280-2304.

PMID: 39290209 PMC: 11406055. DOI: 10.3762/bjoc.20.196.


Data-Driven Insights into the Transition-Metal-Catalyzed Asymmetric Hydrogenation of Olefins.

Singh S, Hernandez-Lobato J J Org Chem. 2024; 89(17):12467-12478.

PMID: 39149801 PMC: 11382158. DOI: 10.1021/acs.joc.4c01396.

References
1.
Wagen C, McMinn S, Kwan E, Jacobsen E . Screening for generality in asymmetric catalysis. Nature. 2022; 610(7933):680-686. PMC: 9645431. DOI: 10.1038/s41586-022-05263-2. View

2.
Probst D, Reymond J . Visualization of very large high-dimensional data sets as minimum spanning trees. J Cheminform. 2021; 12(1):12. PMC: 7015965. DOI: 10.1186/s13321-020-0416-x. View

3.
Reymond J . The chemical space project. Acc Chem Res. 2015; 48(3):722-30. DOI: 10.1021/ar500432k. View

4.
Miyakawa T . No raw data, no science: another possible source of the reproducibility crisis. Mol Brain. 2020; 13(1):24. PMC: 7033918. DOI: 10.1186/s13041-020-0552-2. View

5.
Holler Y, Uhl A, Bathke A, Thomschewski A, Butz K, Nardone R . Reliability of EEG Measures of Interaction: A Paradigm Shift Is Needed to Fight the Reproducibility Crisis. Front Hum Neurosci. 2017; 11:441. PMC: 5582168. DOI: 10.3389/fnhum.2017.00441. View