» Articles » PMID: 30136001

Machine Learning for the Prediction of Molecular Dipole Moments Obtained by Density Functional Theory

Overview
Journal J Cheminform
Publisher Biomed Central
Specialty Chemistry
Date 2018 Aug 24
PMID 30136001
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Machine learning (ML) algorithms were explored for the fast estimation of molecular dipole moments calculated by density functional theory (DFT) by B3LYP/6-31G(d,p) on the basis of molecular descriptors generated from DFT-optimized geometries and partial atomic charges obtained by empirical or ML schemes. A database was used with 10,071 structures, new molecular descriptors were designed and the models were validated with external test sets. Several ML algorithms were screened. Random forest regression models predicted an external test set of 3368 compounds achieving mean absolute error up to 0.44 D. The results represent a significant improvement of the dipole moments calculated using empirical point charges located at the nucleus, even assuming the DFT-optimized geometry (root mean square error, RMSE, of 0.68 D vs. 1.53 D and R = 0.87 vs. 0.66).

Citing Articles

-Schistosomal activity and ADMET properties of 1,2,5-oxadiazinane-containing compound synthesized by visible-light photoredox catalysis.

Itoh K, Nakahara H, Takashino A, Hara A, Katsuno A, Abe Y RSC Med Chem. 2024; .

PMID: 39399310 PMC: 11467761. DOI: 10.1039/d4md00599f.


In-silico study unveils potential phytocompounds in Andrographis paniculata against E6 protein of the high-risk HPV-16 subtype for cervical cancer therapy.

Islam M, Hossain M, Hasnat S, Shuvo M, Akter S, Maria M Sci Rep. 2024; 14(1):17182.

PMID: 39060289 PMC: 11282209. DOI: 10.1038/s41598-024-65112-2.


approach: biological prediction of nordentatin derivatives as anticancer agent inhibitors in the cAMP pathway.

Abdjan M, Aminah N, Siswanto I, Thant T, Kristanti A, Takaya Y RSC Adv. 2022; 10(70):42733-42743.

PMID: 35514899 PMC: 9058016. DOI: 10.1039/d0ra07838g.


Machine learning prediction of UV-Vis spectra features of organic compounds related to photoreactive potential.

Mamede R, Pereira F, Aires-de-Sousa J Sci Rep. 2021; 11(1):23720.

PMID: 34887473 PMC: 8660842. DOI: 10.1038/s41598-021-03070-9.


Machine learning models for hydrogen bond donor and acceptor strengths using large and diverse training data generated by first-principles interaction free energies.

Bauer C, Schneider G, Goller A J Cheminform. 2021; 11(1):59.

PMID: 33430967 PMC: 6737620. DOI: 10.1186/s13321-019-0381-4.

References
1.
Dalton L, Sullivan P, Bale D . Electric field poled organic electro-optic materials: state of the art and future prospects. Chem Rev. 2009; 110(1):25-55. DOI: 10.1021/cr9000429. View

2.
Rai B, Bakken G . Fast and accurate generation of ab initio quality atomic charges using nonparametric statistical regression. J Comput Chem. 2013; 34(19):1661-71. DOI: 10.1002/jcc.23308. View

3.
Pereira F, Xiao K, Latino D, Wu C, Zhang Q, Aires-de-Sousa J . Machine Learning Methods to Predict Density Functional Theory B3LYP Energies of HOMO and LUMO Orbitals. J Chem Inf Model. 2016; 57(1):11-21. DOI: 10.1021/acs.jcim.6b00340. View

4.
Blum L, Reymond J . 970 million druglike small molecules for virtual screening in the chemical universe database GDB-13. J Am Chem Soc. 2009; 131(25):8732-3. DOI: 10.1021/ja902302h. View

5.
Yap C . PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J Comput Chem. 2011; 32(7):1466-74. DOI: 10.1002/jcc.21707. View