Learning Spatial Structures of Proteins Improves Protein-protein Interaction Prediction
Overview
Affiliations
Spatial structures of proteins are closely related to protein functions. Integrating protein structures improves the performance of protein-protein interaction (PPI) prediction. However, the limited quantity of known protein structures restricts the application of structure-based prediction methods. Utilizing the predicted protein structure information is a promising method to improve the performance of sequence-based prediction methods. We propose a novel end-to-end framework, TAGPPI, to predict PPIs using protein sequence alone. TAGPPI extracts multi-dimensional features by employing 1D convolution operation on protein sequences and graph learning method on contact maps constructed from AlphaFold. A contact map contains abundant spatial structure information, which is difficult to obtain from 1D sequence data directly. We further demonstrate that the spatial information learned from contact maps improves the ability of TAGPPI in PPI prediction tasks. We compare the performance of TAGPPI with those of nine state-of-the-art sequence-based methods, and TAGPPI outperforms such methods in all metrics. To the best of our knowledge, this is the first method to use the predicted protein topology structure graph for sequence-based PPI prediction. More importantly, our proposed architecture could be extended to other prediction tasks related to proteins.
PbImpute: Precise Zero Discrimination and Balanced Imputation in Single-Cell RNA Sequencing Data.
Zhang Y, Wang Y, Liu X, Feng X J Chem Inf Model. 2025; 65(5):2670-2684.
PMID: 39957720 PMC: 11898086. DOI: 10.1021/acs.jcim.4c02125.
Zhang K, Tao Y, Wang F Brief Bioinform. 2025; 26(1.
PMID: 39831890 PMC: 11744619. DOI: 10.1093/bib/bbaf008.
Wang L, Li R, Guan X, Yan S Front Plant Sci. 2024; 15:1489116.
PMID: 39687321 PMC: 11646721. DOI: 10.3389/fpls.2024.1489116.
TPepPro: a deep learning model for predicting peptide-protein interactions.
Jin X, Chen Z, Yu D, Jiang Q, Chen Z, Yan B Bioinformatics. 2024; 41(1).
PMID: 39585721 PMC: 11681936. DOI: 10.1093/bioinformatics/btae708.
Chen H, Liu J, Tang G, Hao G, Yang G Genomics Proteomics Bioinformatics. 2024; 22(5).
PMID: 39404802 PMC: 11658832. DOI: 10.1093/gpbjnl/qzae075.