» Articles » PMID: 33356152

-A New, Extensible Set of Molecular Substructures for QSAR/QSPR and Read-Across Predictions

Overview
Specialty Toxicology
Date 2020 Dec 28
PMID 33356152
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

Molecular structure-based predictive models provide a proven alternative to costly and inefficient animal testing. However, due to a lack of interpretability of predictive models built with abstract molecular descriptors they have earned the notoriety of being black boxes. Interpretable models require interpretable descriptors to provide chemistry-backed predictive reasoning and facilitate intelligent molecular design. We developed a novel set of extensible chemistry-aware substructures, , to support interpretable predictive models and read-across protocols. Performance of in chemical characterization and search for structurally similar actives for read-across applications was compared with four publicly available fingerprint sets (MACCS (166), PubChem (881), ECFP4 (1024), ToxPrint (729)) in three benchmark sets (MUV, ULS, and Tox21) spanning ∼145 000 compounds and 78 molecular targets at 1%, 2%, 5%, and 10% false discovery rates. In 18 of the 20 comparisons, interpretable features performed better than the publicly available, but less interpretable and fixed-bit length, fingerprints. Examples are provided to show the enhanced capability of in extracting compounds with higher scaffold similarity. features are interpretable and efficiently characterize diverse chemical collections, thus making them a better choice for building interpretable predictive models and read-across protocols.

Citing Articles

A Novel Machine Learning Model and a Web Portal for Predicting the Human Skin Sensitization Effects of Chemical Agents.

Tieghi R, Moreira-Filho J, Martin H, Wellnitz J, Otoch M, Rath M Toxics. 2024; 12(11).

PMID: 39590983 PMC: 11598222. DOI: 10.3390/toxics12110803.


An in vitro and machine learning framework for quantifying serum albumin binding of per- and polyfluoroalkyl substances.

Starnes H, Green A, Reif D, Belcher S Toxicol Sci. 2024; 203(1):67-78.

PMID: 39298512 PMC: 11664106. DOI: 10.1093/toxsci/kfae124.


A Systematic Analysis of Read-Across Adaptations in Testing Proposal Evaluations by the European Chemicals Agency.

Roe H, Tsai H, Ball N, Wright F, Chiu W, Rusyn I bioRxiv. 2024; .

PMID: 39257792 PMC: 11384022. DOI: 10.1101/2024.08.29.610278.


Characterizing PFAS hazards and risks: a human population-based in vitro cardiotoxicity assessment strategy.

Ford L, Lin H, Zhou Y, Wright F, Gombar V, Sedykh A Hum Genomics. 2024; 18(1):92.

PMID: 39218963 PMC: 11368000. DOI: 10.1186/s40246-024-00665-x.


Hazard and risk characterization of 56 structurally diverse PFAS using a targeted battery of broad coverage assays using six human cell types.

Ford L, Lin H, Tsai H, Zhou Y, Wright F, Sedykh A Toxicology. 2024; 503:153763.

PMID: 38423244 PMC: 11214689. DOI: 10.1016/j.tox.2024.153763.