An Extensive Test of 14 Scoring Functions Using the PDBbind Refined Set of 800 Protein-ligand Complexes
Overview
Medical Informatics
Affiliations
Fourteen popular scoring functions, i.e., X-Score, DrugScore, five scoring functions in the Sybyl software (D-Score, PMF-Score, G-Score, ChemScore, and F-Score), four scoring functions in the Cerius2 software (LigScore, PLP, PMF, and LUDI), two scoring functions in the GOLD program (GoldScore and ChemScore), and HINT, were tested on the refined set of the PDBbind database, a set of 800 diverse protein-ligand complexes with high-resolution crystal structures and experimentally determined Ki or Kd values. The focus of our study was to assess the ability of these scoring functions to predict binding affinities based on the experimentally determined high-resolution crystal structures of proteins in complex with their ligands. The quantitative correlation between the binding scores produced by each scoring function and the known binding constants of the 800 complexes was computed. X-Score, DrugScore, Sybyl::ChemScore, and Cerius2::PLP provided better correlations than the other scoring functions with standard deviations of 1.8-2.0 log units. These four scoring functions were also found to be robust enough to carry out computation directly on unaltered crystal structures. To examine how well scoring functions predict the binding affinities for ligands bound to the same target protein, the performance of these 14 scoring functions were evaluated on three subsets of protein-ligand complexes from the test set: HIV-1 protease complexes (82 entries), trypsin complexes (45 entries), and carbonic anhydrase II complexes (40 entries). Although the results for the HIV-1 protease subset are less than desirable, several scoring functions are able to satisfactorily predict the binding affinities for the trypsin and the carbonic anhydrase II subsets with standard deviation as low as 1.0 log unit (corresponding to 1.3-1.4 kcal/mol at room temperature). Our results demonstrate the strengths as well as the weaknesses of current scoring functions for binding affinity prediction.
Improved Prediction of Ligand-Protein Binding Affinities by Meta-modeling.
Lee H, Emani P, Gerstein M J Chem Inf Model. 2024; 64(23):8684-8704.
PMID: 39576762 PMC: 11632770. DOI: 10.1021/acs.jcim.4c01116.
Breznik M, Ge Y, Bluck J, Briem H, Hahn D, Christ C ChemMedChem. 2022; 18(1):e202200425.
PMID: 36240514 PMC: 9868080. DOI: 10.1002/cmdc.202200425.
Riahi S, Lee J, Wei S, Cost R, Masiero A, Prades C Antib Ther. 2021; 4(2):109-122.
PMID: 34396040 PMC: 8344454. DOI: 10.1093/abt/tbab011.
Bender A, Jenkins J, Li Q, Adams S, Cannon E, Glen R Annu Rep Comput Chem. 2020; 2:141-168.
PMID: 32362803 PMC: 7185533. DOI: 10.1016/S1574-1400(06)02009-3.
do Carmo A, Bettanin F, Oliveira Almeida M, Pantaleao S, Rodrigues T, Homem-de-Mello P Front Chem. 2020; 8:235.
PMID: 32309275 PMC: 7145989. DOI: 10.3389/fchem.2020.00235.