IScore: a Novel Graph Kernel-based Function for Scoring Protein-protein Docking Models
Overview
Affiliations
Motivation: Protein complexes play critical roles in many aspects of biological functions. Three-dimensional (3D) structures of protein complexes are critical for gaining insights into structural bases of interactions and their roles in the biomolecular pathways that orchestrate key cellular processes. Because of the expense and effort associated with experimental determinations of 3D protein complex structures, computational docking has evolved as a valuable tool to predict 3D structures of biomolecular complexes. Despite recent progress, reliably distinguishing near-native docking conformations from a large number of candidate conformations, the so-called scoring problem, remains a major challenge.
Results: Here we present iScore, a novel approach to scoring docked conformations that combines HADDOCK energy terms with a score obtained using a graph representation of the protein-protein interfaces and a measure of evolutionary conservation. It achieves a scoring performance competitive with, or superior to, that of state-of-the-art scoring functions on two independent datasets: (i) Docking software-specific models and (ii) the CAPRI score set generated by a wide variety of docking approaches (i.e. docking software-non-specific). iScore ranks among the top scoring approaches on the CAPRI score set (13 targets) when compared with the 37 scoring groups in CAPRI. The results demonstrate the utility of combining evolutionary, topological and energetic information for scoring docked conformations. This work represents the first successful demonstration of graph kernels to protein interfaces for effective discrimination of near-native and non-native conformations of protein complexes.
Availability And Implementation: The iScore code is freely available from Github: https://github.com/DeepRank/iScore (DOI: 10.5281/zenodo.2630567). And the docking models used are available from SBGrid: https://data.sbgrid.org/dataset/684).
Supplementary Information: Supplementary data are available at Bioinformatics online.
Grassmann G, Di Rienzo L, Ruocco G, Miotto M, Milanetti E J Chem Inf Model. 2025; 65(5):2695-2709.
PMID: 39982412 PMC: 11898074. DOI: 10.1021/acs.jcim.4c02286.
Simplicity within biological complexity.
Przulj N, Malod-Dognin N Bioinform Adv. 2025; 5(1):vbae164.
PMID: 39927291 PMC: 11805345. DOI: 10.1093/bioadv/vbae164.
A comprehensive survey of scoring functions for protein docking models.
Shirali A, Stebliankin V, Karki U, Shi J, Chapagain P, Narasimhan G BMC Bioinformatics. 2025; 26(1):25.
PMID: 39844036 PMC: 11755896. DOI: 10.1186/s12859-024-05991-4.
EuDockScore: Euclidean graph neural networks for scoring protein-protein interfaces.
McFee M, Kim J, Kim P Bioinformatics. 2024; 40(11).
PMID: 39441796 PMC: 11543620. DOI: 10.1093/bioinformatics/btae636.
Chen X, Liu J, Park N, Cheng J Biomolecules. 2024; 14(5).
PMID: 38785981 PMC: 11117562. DOI: 10.3390/biom14050574.