A Quantum Chemical Interaction Energy Dataset for Accurately Modeling Protein-ligand Interactions
Affiliations
Fast and accurate calculation of intermolecular interaction energies is desirable for understanding many chemical and biological processes, including the binding of small molecules to proteins. The Splinter ["Symmetry-adapted perturbation theory (SAPT0) protein-ligand interaction"] dataset has been created to facilitate the development and improvement of methods for performing such calculations. Molecular fragments representing commonly found substructures in proteins and small-molecule ligands were paired into >9000 unique dimers, assembled into numerous configurations using an approach designed to adequately cover the breadth of the dimers' potential energy surfaces while enhancing sampling in favorable regions. ~1.5 million configurations of these dimers were randomly generated, and a structurally diverse subset of these were minimized to obtain an additional ~80 thousand local and global minima. For all >1.6 million configurations, SAPT0 calculations were performed with two basis sets to complete the dataset. It is expected that Splinter will be a useful benchmark dataset for training and testing various methods for the calculation of intermolecular interaction energies.
MORE-Q, a dataset for molecular olfactorial receptor engineering by quantum mechanics.
Chen L, Medrano Sandonas L, Traber P, Dianat A, Tverdokhleb N, Hurevich M Sci Data. 2025; 12(1):324.
PMID: 39987132 PMC: 11846975. DOI: 10.1038/s41597-025-04616-6.
Kriz K, van der Spoel D J Phys Chem Lett. 2024; 15(39):9974-9978.
PMID: 39314113 PMC: 11457221. DOI: 10.1021/acs.jpclett.4c02034.
A physics-aware neural network for protein-ligand interactions with quantum chemical accuracy.
Glick Z, Metcalf D, Glick C, Spronk S, Koutsoukas A, Cheney D Chem Sci. 2024; 15(33):13313-13324.
PMID: 39183910 PMC: 11339967. DOI: 10.1039/d4sc01029a.
Alibakhshi A, Schafer L Nat Commun. 2024; 15(1):6086.
PMID: 39030194 PMC: 11271626. DOI: 10.1038/s41467-024-50408-8.
Agbaglo D, Summers T, Cheng Q, DeYonker N Phys Chem Chem Phys. 2024; 26(16):12467-12482.
PMID: 38618904 PMC: 11090134. DOI: 10.1039/d3cp06100k.