» Articles » PMID: 23795551

Deep Architectures and Deep Learning in Chemoinformatics: the Prediction of Aqueous Solubility for Drug-like Molecules

Overview
Date 2013 Jun 26
PMID 23795551
Citations 146
Authors
Affiliations
Soon will be listed here.
Abstract

Shallow machine learning methods have been applied to chemoinformatics problems with some success. As more data becomes available and more complex problems are tackled, deep machine learning methods may also become useful. Here, we present a brief overview of deep learning methods and show in particular how recursive neural network approaches can be applied to the problem of predicting molecular properties. However, molecules are typically described by undirected cyclic graphs, while recursive approaches typically use directed acyclic graphs. Thus, we develop methods to address this discrepancy, essentially by considering an ensemble of recursive neural networks associated with all possible vertex-centered acyclic orientations of the molecular graph. One advantage of this approach is that it relies only minimally on the identification of suitable molecular descriptors because suitable representations are learned automatically from the data. Several variants of this approach are applied to the problem of predicting aqueous solubility and tested on four benchmark data sets. Experimental results show that the performance of the deep learning methods matches or exceeds the performance of other state-of-the-art methods according to several evaluation metrics and expose the fundamental limitations arising from training sets that are too small or too noisy. A Web-based predictor, AquaSol, is available online through the ChemDB portal ( cdb.ics.uci.edu ) together with additional material.

Citing Articles

hERGAT: predicting hERG blockers using graph attention mechanism through atom- and molecule-level interaction analyses.

Lee D, Yoo S J Cheminform. 2025; 17(1):11.

PMID: 39875959 PMC: 11776176. DOI: 10.1186/s13321-025-00957-x.


Chemically Informed Deep Learning for Interpretable Radical Reaction Prediction.

Tavakoli M, Chiu Y, Carlton A, Van Vranken D, Baldi P J Chem Inf Model. 2025; 65(3):1228-1242.

PMID: 39871741 PMC: 11815866. DOI: 10.1021/acs.jcim.4c01901.


Role of Artificial Intelligence in Drug Discovery to Revolutionize the Pharmaceutical Industry: Resources, Methods and Applications.

Singh P, Sachan K, Khandelwal V, Singh S, Singh S Recent Pat Biotechnol. 2025; 19(1):35-52.

PMID: 39840410 DOI: 10.2174/0118722083297406240313090140.


Optimization of HIV drugs through MCDM technique Analytic Hierarchy Process(AHP).

Farooq F, Sultana S, Alqahtani N, Imran M PLoS One. 2025; 20(1):e0316617.

PMID: 39823527 PMC: 11741607. DOI: 10.1371/journal.pone.0316617.


Topological indices based VIKOR assisted multi-criteria decision technique for lung disorders.

Ashraf T, Idrees N Front Chem. 2024; 12:1407911.

PMID: 39380949 PMC: 11459094. DOI: 10.3389/fchem.2024.1407911.


References
1.
Kamlet M, Doherty R, Abboud J, Abraham M, Taft R . Linear solvation energy relationships: 36. Molecular properties governing solubilities of organic nonelectrolytes in water. J Pharm Sci. 1986; 75(4):338-49. DOI: 10.1002/jps.2600750405. View

2.
Hinton G, Osindero S, Teh Y . A fast learning algorithm for deep belief nets. Neural Comput. 2006; 18(7):1527-54. DOI: 10.1162/neco.2006.18.7.1527. View

3.
Azencott C, Ksikes A, Swamidass S, Chen J, Ralaivola L, Baldi P . One- to four-dimensional kernels for virtual screening and the prediction of physical, chemical, and biological properties. J Chem Inf Model. 2007; 47(3):965-74. DOI: 10.1021/ci600397p. View

4.
Reynolds J, Gilbert D, Tanford C . Empirical correlation between hydrophobic free energy and aqueous cavity surface area. Proc Natl Acad Sci U S A. 1974; 71(8):2925-7. PMC: 388590. DOI: 10.1073/pnas.71.8.2925. View

5.
Ralaivola L, Swamidass S, Saigo H, Baldi P . Graph kernels for chemical informatics. Neural Netw. 2005; 18(8):1093-110. DOI: 10.1016/j.neunet.2005.07.009. View