» Articles » PMID: 34451887

Predicting the Skin Sensitization Potential of Small Molecules with Machine Learning Models Trained on Biologically Meaningful Descriptors

Overview
Publisher MDPI
Specialty Chemistry
Date 2021 Aug 28
PMID 34451887
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

In recent years, a number of machine learning models for the prediction of the skin sensitization potential of small organic molecules have been reported and become available. These models generally perform well within their applicability domains but, as a result of the use of molecular fingerprints and other non-intuitive descriptors, the interpretability of the existing models is limited. The aim of this work is to develop a strategy to replace the non-intuitive features by predicted outcomes of bioassays. We show that such replacement is indeed possible and that as few as ten interpretable, predicted bioactivities are sufficient to reach competitive performance. On a holdout data set of 257 compounds, the best model ("Skin Doctor CP:Bio") obtained an efficiency of 0.82 and an MCC of 0.52 (at the significance level of 0.20). Skin Doctor CP:Bio is available free of charge for academic research. The modeling strategies explored in this work are easily transferable and could be adopted for the development of more interpretable machine learning models for the prediction of the bioactivity and toxicity of small organic compounds.

Citing Articles

The Good, The Bad, and The Perplexing: Structural Alerts and Read-Across for Predicting Skin Sensitization Using Human Data.

Golden E, Ukaegbu D, Ranslow P, Brown R, Hartung T, Maertens A Chem Res Toxicol. 2023; 36(5):734-746.

PMID: 37126467 PMC: 10189792. DOI: 10.1021/acs.chemrestox.2c00383.


Predicting the Skin Sensitization Potential of Small Molecules with Machine Learning Models Trained on Biologically Meaningful Descriptors.

Wilm A, Garcia de Lomana M, Stork C, Mathai N, Hirte S, Norinder U Pharmaceuticals (Basel). 2021; 14(8).

PMID: 34451887 PMC: 8402010. DOI: 10.3390/ph14080790.

References
1.
Di P, Yin Y, Jiang C, Cai Y, Li W, Tang Y . Prediction of the skin sensitising potential and potency of compounds via mechanism-based binary and ternary classification models. Toxicol In Vitro. 2019; 59:204-214. DOI: 10.1016/j.tiv.2019.01.004. View

2.
Zhang J, Hsieh J, Zhu H . Profiling animal toxicants by automatically mining public bioassay data: a big data approach for computational toxicology. PLoS One. 2014; 9(6):e99863. PMC: 4064997. DOI: 10.1371/journal.pone.0099863. View

3.
Ribay K, Kim M, Wang W, Pinolini D, Zhu H . Predictive Modeling of Estrogen Receptor Binding Agents Using Advanced Cheminformatics Tools and Massive Public Data. Front Environ Sci. 2016; 4. PMC: 5023020. DOI: 10.3389/fenvs.2016.00012. View

4.
Matthews B . Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta. 1975; 405(2):442-51. DOI: 10.1016/0005-2795(75)90109-9. View

5.
Mehling A, Eriksson T, Eltze T, Kolle S, Ramirez T, Teubner W . Non-animal test methods for predicting skin sensitization potentials. Arch Toxicol. 2012; 86(8):1273-95. DOI: 10.1007/s00204-012-0867-6. View