» Articles » PMID: 27729520

Simultaneous Identification of Specifically Interacting Paralogs and Interprotein Contacts by Direct Coupling Analysis

Overview
Specialty Science
Date 2016 Oct 13
PMID 27729520
Citations 50
Authors
Affiliations
Soon will be listed here.
Abstract

Understanding protein-protein interactions is central to our understanding of almost all complex biological processes. Computational tools exploiting rapidly growing genomic databases to characterize protein-protein interactions are urgently needed. Such methods should connect multiple scales from evolutionary conserved interactions between families of homologous proteins, over the identification of specifically interacting proteins in the case of multiple paralogs inside a species, down to the prediction of residues being in physical contact across interaction interfaces. Statistical inference methods detecting residue-residue coevolution have recently triggered considerable progress in using sequence data for quaternary protein structure prediction; they require, however, large joint alignments of homologous protein pairs known to interact. The generation of such alignments is a complex computational task on its own; application of coevolutionary modeling has, in turn, been restricted to proteins without paralogs, or to bacterial systems with the corresponding coding genes being colocalized in operons. Here we show that the direct coupling analysis of residue coevolution can be extended to connect the different scales, and simultaneously to match interacting paralogs, to identify interprotein residue-residue contacts and to discriminate interacting from noninteracting families in a multiprotein system. Our results extend the potential applications of coevolutionary analysis far beyond cases treatable so far.

Citing Articles

Direct coupling analysis and the attention mechanism.

Caredda F, Pagnani A BMC Bioinformatics. 2025; 26(1):41.

PMID: 39915710 PMC: 11804077. DOI: 10.1186/s12859-025-06062-y.


The Historical Evolution and Significance of Multiple Sequence Alignment in Molecular Structure and Function Prediction.

Zhang C, Wang Q, Li Y, Teng A, Hu G, Wuyun Q Biomolecules. 2025; 14(12.

PMID: 39766238 PMC: 11673352. DOI: 10.3390/biom14121531.


DiffPaSS-high-performance differentiable pairing of protein sequences using soft scores.

Lupo U, Sgarbossa D, Milighetti M, Bitbol A Bioinformatics. 2024; 41(1).

PMID: 39672677 PMC: 11676329. DOI: 10.1093/bioinformatics/btae738.


Impact of phylogeny on the inference of functional sectors from protein sequence data.

Dietler N, Abbara A, Choudhury S, Bitbol A PLoS Comput Biol. 2024; 20(9):e1012091.

PMID: 39312591 PMC: 11449291. DOI: 10.1371/journal.pcbi.1012091.


Pairing interacting protein sequences using masked language modeling.

Lupo U, Sgarbossa D, Bitbol A Proc Natl Acad Sci U S A. 2024; 121(27):e2311887121.

PMID: 38913900 PMC: 11228504. DOI: 10.1073/pnas.2311887121.


References
1.
Baldassi C, Zamparo M, Feinauer C, Procaccini A, Zecchina R, Weigt M . Fast and accurate multivariate Gaussian modeling of protein families: predicting residue contacts and protein-interaction partners. PLoS One. 2014; 9(3):e92721. PMC: 3963956. DOI: 10.1371/journal.pone.0092721. View

2.
Rao V, Srinivas K, Sujini G, Kumar G . Protein-protein interaction detection: methods and analysis. Int J Proteomics. 2014; 2014:147648. PMC: 3947875. DOI: 10.1155/2014/147648. View

3.
Dandekar T, Snel B, Huynen M, Bork P . Conservation of gene order: a fingerprint of proteins that physically interact. Trends Biochem Sci. 1998; 23(9):324-8. DOI: 10.1016/s0968-0004(98)01274-2. View

4.
Reddy T, Thomas A, Stamatis D, Bertsch J, Isbandi M, Jansson J . The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification. Nucleic Acids Res. 2014; 43(Database issue):D1099-106. PMC: 4384021. DOI: 10.1093/nar/gku950. View

5.
Procaccini A, Lunt B, Szurmant H, Hwa T, Weigt M . Dissecting the specificity of protein-protein interaction in bacterial two-component signaling: orphans and crosstalks. PLoS One. 2011; 6(5):e19729. PMC: 3090404. DOI: 10.1371/journal.pone.0019729. View