» Articles » PMID: 39804946

Automated and Efficient Sampling of Chemical Reaction Space

Overview
Journal Adv Sci (Weinh)
Date 2025 Jan 13
PMID 39804946
Authors
Affiliations
Soon will be listed here.
Abstract

Machine learning interatomic potentials (MLIPs) promise quantum-level accuracy at classical force field speeds, but their performance hinges on the quality and diversity of training data. An efficient and fully automated approach to sample chemical reaction space without relying on human intuition, addressing a critical gap in MLIP development is presented. The method combines the speed of tight-binding calculations with selective high-level refinement, generating diverse datasets that capture both equilibrium and reactive regions of potential energy surfaces. By employing single-ended growing string and nudged elastic band methods, reaction pathways previously underrepresented in MLIP training sets, particularly near transition states are systematically explored. This approach yields datasets with rich structural and chemical diversity, essential for robust MLIP development. Open-source code is provided for the entire workflow, facilitating the integration of the approach into existing MLIP development pipelines.

Citing Articles

Automated and Efficient Sampling of Chemical Reaction Space.

Lee M, Ucak U, Jeong J, Ashyrmamatov I, Lee J, Sim E Adv Sci (Weinh). 2025; 12(9):e2409009.

PMID: 39804946 PMC: 11884589. DOI: 10.1002/advs.202409009.

References
1.
Chen R, Shao K, Fu B, Zhang D . Fitting potential energy surfaces with fundamental invariant neural network. II. Generating fundamental invariants for molecular systems with up to ten atoms. J Chem Phys. 2020; 152(20):204307. DOI: 10.1063/5.0010104. View

2.
Chai J, Head-Gordon M . Long-range corrected hybrid density functionals with damped atom-atom dispersion corrections. Phys Chem Chem Phys. 2008; 10(44):6615-20. DOI: 10.1039/b810189b. View

3.
Bannwarth C, Ehlert S, Grimme S . GFN2-xTB-An Accurate and Broadly Parametrized Self-Consistent Tight-Binding Quantum Chemical Method with Multipole Electrostatics and Density-Dependent Dispersion Contributions. J Chem Theory Comput. 2019; 15(3):1652-1671. DOI: 10.1021/acs.jctc.8b01176. View

4.
Qu C, Houston P, Conte R, Nandi A, Bowman J . MULTIMODE Calculations of Vibrational Spectroscopy and 1d Interconformer Tunneling Dynamics in Glycine Using a Full-Dimensional Potential Energy Surface. J Phys Chem A. 2021; 125(24):5346-5354. DOI: 10.1021/acs.jpca.1c03738. View

5.
Devereux C, Smith J, Huddleston K, Barros K, Zubatyuk R, Isayev O . Extending the Applicability of the ANI Deep Learning Molecular Potential to Sulfur and Halogens. J Chem Theory Comput. 2020; 16(7):4192-4202. DOI: 10.1021/acs.jctc.0c00121. View