InterPepRank: Assessment of Docked Peptide Conformations by a Deep Graph Network
Overview
Affiliations
Peptide-protein interactions between a smaller or disordered peptide stretch and a folded receptor make up a large part of all protein-protein interactions. A common approach for modeling such interactions is to exhaustively sample the conformational space by fast-Fourier-transform docking, and then refine a top percentage of decoys. Commonly, methods capable of ranking the decoys for selection fast enough for larger scale studies rely on first-principle energy terms such as electrostatics, Van der Waals forces, or on pre-calculated statistical potentials. We present InterPepRank for peptide-protein complex scoring and ranking. InterPepRank is a machine learning-based method which encodes the structure of the complex as a graph; with physical pairwise interactions as edges and evolutionary and sequence features as nodes. The graph network is trained to predict the LRMSD of decoys by using edge-conditioned graph convolutions on a large set of peptide-protein complex decoys. InterPepRank is tested on a massive independent test set with no targets sharing CATH annotation nor 30% sequence identity with any target in training or validation data. On this set, InterPepRank has a median AUC of 0.86 for finding coarse peptide-protein complexes with LRMSD < 4Å. This is an improvement compared to other state-of-the-art ranking methods that have a median AUC between 0.65 and 0.79. When included as a selection-method for selecting decoys for refinement in a previously established peptide docking pipeline, InterPepRank improves the number of medium and high quality models produced by 80% and 40%, respectively. The InterPepRank program as well as all scripts for reproducing and retraining it are available from: .
Leveraging machine learning models for peptide-protein interaction prediction.
Yin S, Mi X, Shukla D RSC Chem Biol. 2024; 5(5):401-417.
PMID: 38725911 PMC: 11078210. DOI: 10.1039/d3cb00208j.
Systematic discovery of protein interaction interfaces using AlphaFold and experimental validation.
Lee C, Hubrich D, Varga J, Schafer C, Welzel M, Schumbera E Mol Syst Biol. 2024; 20(2):75-97.
PMID: 38225382 PMC: 10883280. DOI: 10.1038/s44320-023-00005-6.
Leveraging Machine Learning Models for Peptide-Protein Interaction Prediction.
Yin S, Mi X, Shukla D ArXiv. 2023; .
PMID: 37961736 PMC: 10635286.
Modelling peptide-protein complexes: docking, simulations and machine learning.
Mondal A, Chang L, Perez A QRB Discov. 2023; 3:e17.
PMID: 37529282 PMC: 10392694. DOI: 10.1017/qrd.2022.14.
Super High-Throughput Screening of Enzyme Variants by Spectral Graph Convolutional Neural Networks.
Ramirez-Palacios C, Marrink S J Chem Theory Comput. 2023; 19(14):4668-4677.
PMID: 36961994 PMC: 10373491. DOI: 10.1021/acs.jctc.2c01227.