» Articles » PMID: 32361862

Interpretation of Machine Learning Models Using Shapley Values: Application to Compound Potency and Multi-target Activity Predictions

Overview
Publisher Springer
Date 2020 May 4
PMID 32361862
Citations 104
Authors
Affiliations
Soon will be listed here.
Abstract

Difficulties in interpreting machine learning (ML) models and their predictions limit the practical applicability of and confidence in ML in pharmaceutical research. There is a need for agnostic approaches aiding in the interpretation of ML models regardless of their complexity that is also applicable to deep neural network (DNN) architectures and model ensembles. To these ends, the SHapley Additive exPlanations (SHAP) methodology has recently been introduced. The SHAP approach enables the identification and prioritization of features that determine compound classification and activity prediction using any ML model. Herein, we further extend the evaluation of the SHAP methodology by investigating a variant for exact calculation of Shapley values for decision tree methods and systematically compare this variant in compound activity and potency value predictions with the model-independent SHAP method. Moreover, new applications of the SHAP analysis approach are presented including interpretation of DNN models for the generation of multi-target activity profiles and ensemble regression models for potency prediction.

Citing Articles

KinasePred: A Computational Tool for Small-Molecule Kinase Target Prediction.

Di Stefano M, Piazza L, Poles C, Galati S, Granchi C, Giordano A Int J Mol Sci. 2025; 26(5).

PMID: 40076779 PMC: 11900317. DOI: 10.3390/ijms26052157.


Prediction of contrast-associated acute kidney injury with machine-learning in patients undergoing contrast-enhanced computed tomography in emergency department.

Lee K, Jung W, Jeon J, Chang H, Lee J, Huh W Sci Rep. 2025; 15(1):7088.

PMID: 40016350 PMC: 11868533. DOI: 10.1038/s41598-025-86933-9.


Development and validation of a machine learning approach for screening new leprosy cases based on the leprosy suspicion questionnaire.

Mendonca Ramos Simoes M, Rocha Lima F, Barbosa Lugao H, de Paula N, Lincoln Silva C, Ramos A Sci Rep. 2025; 15(1):6912.

PMID: 40011614 PMC: 11865526. DOI: 10.1038/s41598-025-91462-6.


Explanatory AI Predicts the Diet Adopted Based on Nutritional and Lifestyle Habits in the Spanish Population.

Sandri E, Cerda Olmedo G, Piredda M, Werner L, Dentamaro V Eur J Investig Health Psychol Educ. 2025; 15(2).

PMID: 39997075 PMC: 11854735. DOI: 10.3390/ejihpe15020011.


Writing the Signs: An Explainable Machine Learning Approach for Alzheimer's Disease Classification from Handwriting.

Ho N, Gonzalez P, Gogovi G Healthc Technol Lett. 2025; 12(1):e70006.

PMID: 39949642 PMC: 11822997. DOI: 10.1049/htl2.70006.


References
1.
Lundberg S, Erion G, Chen H, DeGrave A, Prutkin J, Nair B . From Local Explanations to Global Understanding with Explainable AI for Trees. Nat Mach Intell. 2020; 2(1):56-67. PMC: 7326367. DOI: 10.1038/s42256-019-0138-9. View

2.
Dimova D, Bajorath J . Assessing Scaffold Diversity of Kinase Inhibitors Using Alternative Scaffold Concepts and Estimating the Scaffold Hopping Potential for Different Kinases. Molecules. 2017; 22(5). PMC: 6154288. DOI: 10.3390/molecules22050730. View

3.
Rodriguez-Perez R, Vogt M, Bajorath J . Support Vector Machine Classification and Regression Prioritize Different Structural Features for Binary Compound Activity and Potency Value Prediction. ACS Omega. 2018; 2(10):6371-6379. PMC: 6045367. DOI: 10.1021/acsomega.7b01079. View

4.
Polishchuk P . Interpretation of Quantitative Structure-Activity Relationship Models: Past, Present, and Future. J Chem Inf Model. 2017; 57(11):2618-2639. DOI: 10.1021/acs.jcim.7b00274. View

5.
Baskin I, Ait A, Halberstam N, Palyulin V, Zefirov N . An approach to the interpretation of backpropagation neural network models in QSAR studies. SAR QSAR Environ Res. 2002; 13(1):35-41. DOI: 10.1080/10629360290002073. View