» Articles » PMID: 33568655

Quantum Chemical Benchmark Databases of Gold-standard Dimer Interaction Energies

Overview
Journal Sci Data
Specialty Science
Date 2021 Feb 11
PMID 33568655
Citations 24
Authors
Affiliations
Soon will be listed here.
Abstract

Advances in computational chemistry create an ongoing need for larger and higher-quality datasets that characterize noncovalent molecular interactions. We present three benchmark collections of quantum mechanical data, covering approximately 3,700 distinct types of interacting molecule pairs. The first collection, which we refer to as DES370K, contains interaction energies for more than 370,000 dimer geometries. These were computed using the coupled-cluster method with single, double, and perturbative triple excitations [CCSD(T)], which is widely regarded as the gold-standard method in electronic structure theory. Our second benchmark collection, a core representative subset of DES370K called DES15K, is intended for more computationally demanding applications of the data. Finally, DES5M, our third collection, comprises interaction energies for nearly 5,000,000 dimer geometries; these were calculated using SNS-MP2, a machine learning approach that provides results with accuracy comparable to that of our coupled-cluster training data. These datasets may prove useful in the development of density functionals, empirically corrected wavefunction-based approaches, semi-empirical methods, force fields, and models trained using machine learning methods.

Citing Articles

MORE-Q, a dataset for molecular olfactorial receptor engineering by quantum mechanics.

Chen L, Medrano Sandonas L, Traber P, Dianat A, Tverdokhleb N, Hurevich M Sci Data. 2025; 12(1):324.

PMID: 39987132 PMC: 11846975. DOI: 10.1038/s41597-025-04616-6.


Refinement of Atomic Polarizabilities for a Polarizable Gaussian Multipole Force Field with Simultaneous Considerations of Both Molecular Polarizability Tensors and In-Solution Electrostatic Potentials.

Duan Y, Wang J, Cieplak P, Luo R J Chem Inf Model. 2025; 65(3):1428-1440.

PMID: 39865620 PMC: 11815842. DOI: 10.1021/acs.jcim.4c02175.


Beyond chemical structures: lessons and guiding principles for the next generation of molecular databases.

Sommer T, Clarke C, Garcia-Melchor M Chem Sci. 2024; 16(3):1002-1016.

PMID: 39660292 PMC: 11626465. DOI: 10.1039/d4sc04064c.


Data Generation for Machine Learning Interatomic Potentials and Beyond.

Kulichenko M, Nebgen B, Lubbers N, Smith J, Barros K, Allen A Chem Rev. 2024; 124(24):13681-13714.

PMID: 39572011 PMC: 11672690. DOI: 10.1021/acs.chemrev.4c00572.


Origin of the intermolecular forces that produce donor-acceptor stacks in π-conjugated charge-transfer complexes.

Tsuzuki S, Ono R, Inoue S, Matsuoka S, Hasegawa T Commun Chem. 2024; 7(1):253.

PMID: 39506085 PMC: 11542100. DOI: 10.1038/s42004-024-01329-6.


References
1.
Hegde G, Bowen R . Machine-learned approximations to Density Functional Theory Hamiltonians. Sci Rep. 2017; 7:42669. PMC: 5309850. DOI: 10.1038/srep42669. View

2.
Schutz M, Werner H, Lindh R, Manby F . Analytical energy gradients for local second-order Møller-Plesset perturbation theory using density fitting approximations. J Chem Phys. 2004; 121(2):737-50. DOI: 10.1063/1.1760747. View

3.
DeYonker N, Peterson K, Wilson A . Systematically convergent correlation consistent basis sets for molecular core-valence correlation effects: the third-row atoms gallium through krypton. J Phys Chem A. 2007; 111(44):11383-93. DOI: 10.1021/jp0747757. View

4.
Hohenstein E, Parrish R, Sherrill C, Turney J, Schaefer 3rd H . Large-scale symmetry-adapted perturbation theory computations via density fitting and Laplace transformation techniques: investigating the fundamental forces of DNA-intercalator interactions. J Chem Phys. 2011; 135(17):174107. DOI: 10.1063/1.3656681. View

5.
Laury M, Wang L, Pande V, Head-Gordon T, Ponder J . Revised Parameters for the AMOEBA Polarizable Atomic Multipole Water Model. J Phys Chem B. 2015; 119(29):9423-9437. PMC: 4772747. DOI: 10.1021/jp510896n. View