» Articles » PMID: 37166179

A Small Step Toward Generalizability: Training a Machine Learning Scoring Function for Structure-Based Virtual Screening

Abstract

Over the past few years, many machine learning-based scoring functions for predicting the binding of small molecules to proteins have been developed. Their objective is to approximate the distribution which takes two molecules as input and outputs the energy of their interaction. Only a scoring function that accounts for the interatomic interactions involved in binding can accurately predict binding affinity on unseen molecules. However, many scoring functions make predictions based on data set biases rather than an understanding of the physics of binding. These scoring functions perform well when tested on similar targets to those in the training set but fail to generalize to dissimilar targets. To test what a machine learning-based scoring function has learned, input attribution, a technique for learning which features are important to a model when making a prediction on a particular data point, can be applied. If a model successfully learns something beyond data set biases, attribution should give insight into the important binding interactions that are taking place. We built a machine learning-based scoring function that aimed to avoid the influence of bias via thorough train and test data set filtering and show that it achieves comparable performance on the Comparative Assessment of Scoring Functions, 2016 (CASF-2016) benchmark to other leading methods. We then use the CASF-2016 test set to perform attribution and find that the bonds identified as important by PointVS, unlike those extracted from other scoring functions, have a high correlation with those found by a distance-based interaction profiler. We then show that attribution can be used to extract important binding pharmacophores from a given protein target when supplied with a number of bound structures. We use this information to perform fragment elaboration and see improvements in docking scores compared to using structural information from a traditional, data-based approach. This not only provides definitive proof that the scoring function has learned to identify some important binding interactions but also constitutes the first deep learning-based method for extracting structural information from a target for molecule design.

Citing Articles

Narrowing the gap between machine learning scoring functions and free energy perturbation using augmented data.

Valsson I, Warren M, Deane C, Magarkar A, Morris G, Biggin P Commun Chem. 2025; 8(1):41.

PMID: 39922899 PMC: 11807228. DOI: 10.1038/s42004-025-01428-y.


The physics-AI dialogue in drug design.

Vargas-Rosales P, Caflisch A RSC Med Chem. 2025; .

PMID: 39906313 PMC: 11788922. DOI: 10.1039/d4md00869c.


Benchmarking the robustness of the correct identification of flexible 3D objects using common machine learning models.

Zhang Y, Vitalis A Patterns (N Y). 2025; 6(1):101147.

PMID: 39896260 PMC: 11783895. DOI: 10.1016/j.patter.2024.101147.


Robustly interrogating machine learning-based scoring functions: what are they learning?.

Durant G, Boyles F, Birchall K, Marsden B, Deane C Bioinformatics. 2025; 41(2).

PMID: 39874452 PMC: 11821266. DOI: 10.1093/bioinformatics/btaf040.


Identification of Potential Selective PAK4 Inhibitors Through Shape and Protein Conformation Ensemble Screening and Electrostatic-Surface-Matching Optimization.

Zhang X, Zhang M, Li Y, Deng P Curr Issues Mol Biol. 2025; 47(1.

PMID: 39852144 PMC: 11764389. DOI: 10.3390/cimb47010029.


References
1.
Moon S, Zhung W, Yang S, Lim J, Kim W . PIGNet: a physics-informed deep learning model toward generalized drug-target interaction predictions. Chem Sci. 2022; 13(13):3661-3673. PMC: 8966633. DOI: 10.1039/d1sc06946b. View

2.
Imprachim N, Yosaatmadja Y, Newman J . Crystal structures and fragment screening of SARS-CoV-2 NSP14 reveal details of exoribonuclease activation and mRNA capping and provide starting points for antiviral drug development. Nucleic Acids Res. 2022; 51(1):475-487. PMC: 9841433. DOI: 10.1093/nar/gkac1207. View

3.
Shim M, Lee S, Hwang H . Inflated prediction accuracy of neuropsychiatric biomarkers caused by data leakage in feature selection. Sci Rep. 2021; 11(1):7980. PMC: 8042090. DOI: 10.1038/s41598-021-87157-3. View

4.
Liu Z, Su M, Han L, Liu J, Yang Q, Li Y . Forging the Basis for Developing Protein-Ligand Interaction Scoring Functions. Acc Chem Res. 2017; 50(2):302-309. DOI: 10.1021/acs.accounts.6b00491. View

5.
Hochuli J, Helbling A, Skaist T, Ragoza M, Koes D . Visualizing convolutional neural network protein-ligand scoring. J Mol Graph Model. 2018; 84:96-108. PMC: 6343664. DOI: 10.1016/j.jmgm.2018.06.005. View