» Articles » PMID: 38472223

Enhancing Coevolutionary Signals in Protein-protein Interaction Prediction Through Clade-wise Alignment Integration

Overview
Journal Sci Rep
Specialty Science
Date 2024 Mar 13
PMID 38472223
Authors
Affiliations
Soon will be listed here.
Abstract

Protein-protein interactions (PPIs) play essential roles in most biological processes. The binding interfaces between interacting proteins impose evolutionary constraints that have successfully been employed to predict PPIs from multiple sequence alignments (MSAs). To construct MSAs, critical choices have to be made: how to ensure the reliable identification of orthologs, and how to optimally balance the need for large alignments versus sufficient alignment quality. Here, we propose a divide-and-conquer strategy for MSA generation: instead of building a single, large alignment for each protein, multiple distinct alignments are constructed under distinct clades in the tree of life. Coevolutionary signals are searched separately within these clades, and are only subsequently integrated using machine learning techniques. We find that this strategy markedly improves overall prediction performance, concomitant with better alignment quality. Using the popular DCA algorithm to systematically search pairs of such alignments, a genome-wide all-against-all interaction scan in a bacterial genome is demonstrated. Given the recent successes of AlphaFold in predicting direct PPIs at atomic detail, a discover-and-refine approach is proposed: our method could provide a fast and accurate strategy for pre-screening the entire genome, submitting to AlphaFold only promising interaction candidates-thus reducing false positives as well as computation time.

References
1.
Lesk A, Chothia C . How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. J Mol Biol. 1980; 136(3):225-70. DOI: 10.1016/0022-2836(80)90373-3. View

2.
Marsh J, Teichmann S . Parallel dynamics and evolution: Protein conformational fluctuations and assembly reflect evolutionary changes in sequence and structure. Bioessays. 2013; 36(2):209-18. DOI: 10.1002/bies.201300134. View

3.
Haney P, Badger J, Buldak G, Reich C, Woese C, Olsen G . Thermal adaptation analyzed by comparison of protein sequences from mesophilic and extremely thermophilic Methanococcus species. Proc Natl Acad Sci U S A. 1999; 96(7):3578-83. PMC: 22336. DOI: 10.1073/pnas.96.7.3578. View

4.
Pal C, Papp B, Lercher M . An integrated view of protein evolution. Nat Rev Genet. 2006; 7(5):337-48. DOI: 10.1038/nrg1838. View

5.
Brininger C, Spradlin S, Cobani L, Evilia C . The more adaptive to change, the more likely you are to survive: Protein adaptation in extremophiles. Semin Cell Dev Biol. 2017; 84:158-169. DOI: 10.1016/j.semcdb.2017.12.016. View