» Articles » PMID: 35767129

First Report of Q-RASAR Modeling Toward an Approach of Easy Interpretability and Efficient Transferability

Overview
Journal Mol Divers
Date 2022 Jun 29
PMID 35767129
Authors
Affiliations
Soon will be listed here.
Abstract

Quantitative structure-activity relationship (QSAR) and read-across techniques have recently been merged into a new emerging field of read-across structure-activity relationship (RASAR) that uses the chemical similarity concepts of read-across (an unsupervised step) and finally develops a supervised learning model (like QSAR). The RASAR method has so far been used only in case of graded predictions or classification modeling. In this work, we attempt, for the first time, to apply RASAR for quantitative predictions (q-RASAR) using a case study of androgen receptor binding affinity data. We have computed a number of error-based and similarity-based measures such as weighted standard deviation of the predicted values, coefficient of variation of the computed predictions, average similarity level of close training compounds for each query molecule, standard deviation and coefficient of variation of similarity levels, maximum similarity levels to positive and negative close training compounds, a concordance measure indicating similarity to positive, negative or both classes of close training compounds, etc. We have clubbed these additional measures along with the selected chemical descriptors from the previously developed QSAR model and redeveloped new partial least squares models from the training set, and predicted the endpoint using the query data set. Interestingly, these new models outperform the internal and external validation quality of the original QSAR model. In this study, we have also introduced a new similarity-based concordance measure (Banerjee-Roy coefficient) that can significantly contribute to the model quality. A q-RASAR model also has the advantage over read-across predictions in providing easy interpretation and indicating quantitative contributions of important chemical features. The strategy described here should be applicable to other biological/toxicological/property data modeling for enhanced quality of predictions, easy interpretability, and efficient transferability.

Citing Articles

The q-RASPR approach for predicting the property and fate of persistent organic pollutants.

Chang C, Banerjee A, Kumar V, Roy K, Benfenati E Sci Rep. 2025; 15(1):1344.

PMID: 39779742 PMC: 11711441. DOI: 10.1038/s41598-024-84778-2.


Machine learning assisted classification RASAR modeling for the nephrotoxicity potential of a curated set of orally active drugs.

Banerjee A, Roy K Sci Rep. 2025; 15(1):808.

PMID: 39755865 PMC: 11700179. DOI: 10.1038/s41598-024-85063-y.


Integrating traditional QSAR and read-across-based regression models for predicting potential anti-leishmanial azole compounds.

Nandi R, Sharma A, Priya A, Kumar D Mol Divers. 2024; .

PMID: 39653961 DOI: 10.1007/s11030-024-11070-w.


Development of hybrid models by the integration of the read-across hypothesis with the QSAR framework for the assessment of developmental and reproductive toxicity (DART) tested according to OECD TG 414.

Pandey S, Roy K Toxicol Rep. 2024; 13:101822.

PMID: 39649380 PMC: 11621937. DOI: 10.1016/j.toxrep.2024.101822.


The round-robin approach applied to nanoinformatics: consensus prediction of nanomaterials zeta potential.

Varsou D, Banerjee A, Roy J, Roy K, Savvas G, Sarimveis H Beilstein J Nanotechnol. 2024; 15:1536-1553.

PMID: 39624206 PMC: 11610486. DOI: 10.3762/bjnano.15.121.


References
1.
Knapen D, Angrish M, Fortin M, Katsiadaki I, Leonard M, Margiotta-Casaluci L . Adverse outcome pathway networks I: Development and applications. Environ Toxicol Chem. 2018; 37(6):1723-1733. PMC: 6004608. DOI: 10.1002/etc.4125. View

2.
Maldonado A, Doucet J, Petitjean M, Fan B . Molecular similarity and diversity in chemoinformatics: from theory to applications. Mol Divers. 2006; 10(1):39-79. DOI: 10.1007/s11030-006-8697-1. View

3.
Schultz T, Amcoff P, Berggren E, Gautier F, Klaric M, Knight D . A strategy for structuring and reporting a read-across prediction of toxicity. Regul Toxicol Pharmacol. 2015; 72(3):586-601. DOI: 10.1016/j.yrtph.2015.05.016. View

4.
Luechtefeld T, Maertens A, Russo D, Rovida C, Zhu H, Hartung T . Analysis of publically available skin sensitization data from REACH registrations 2008-2014. ALTEX. 2016; 33(2):135-48. PMC: 5546098. DOI: 10.14573/altex.1510055. View

5.
Luechtefeld T, Marsh D, Rowlands C, Hartung T . Machine Learning of Toxicological Big Data Enables Read-Across Structure Activity Relationships (RASAR) Outperforming Animal Test Reproducibility. Toxicol Sci. 2018; 165(1):198-212. PMC: 6135638. DOI: 10.1093/toxsci/kfy152. View