» Articles » PMID: 34139218

Extending the Horizon of Homology Detection with Coevolution-based Structure Prediction

Overview
Journal J Mol Biol
Publisher Elsevier
Date 2021 Jun 17
PMID 34139218
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Traditional sequence analysis algorithms fail to identify distant homologies when they lie beyond a detection horizon. In this review, we discuss how co-evolution-based contact and distance prediction methods are pushing back this homology detection horizon, thereby yielding new functional insights and experimentally testable hypotheses. Based on correlated substitutions, these methods divine three-dimensional constraints among amino acids in protein sequences that were previously devoid of all annotated domains and repeats. The new algorithms discern hidden structure in an otherwise featureless sequence landscape. Their revelatory impact promises to be as profound as the use, by archaeologists, of ground-penetrating radar to discern long-hidden, subterranean structures. As examples of this, we describe how triplicated structures reflecting longin domains in MON1A-like proteins, or UVR-like repeats in DISC1, emerge from their predicted contact and distance maps. These methods also help to resolve structures that do not conform to a "beads-on-a-string" model of protein domains. In one such example, we describe CFAP298 whose ubiquitin-like domain was previously challenging to perceive owing to a large sequence insertion within it. More generally, the new algorithms permit an easier appreciation of domain families and folds whose evolution involved structural insertion or rearrangement. As we exemplify with α1-antitrypsin, coevolution-based predicted contacts may also yield insights into protein dynamics and conformational change. This new combination of structure prediction (using innovative co-evolution based methods) and homology inference (using more traditional sequence analysis approaches) shows great promise for bringing into view a sea of evolutionary relationships that had hitherto lain far beyond the horizon of homology detection.

Citing Articles

Genome-Wide Analysis of Proteases and Protease Inhibitors Using Advanced Informatics Provides Insights into Parasite Biology and Host-Parasite Interactions.

Zheng Y, Young N, Song J, Gasser R Int J Mol Sci. 2023; 24(15).

PMID: 37569696 PMC: 10418638. DOI: 10.3390/ijms241512320.


OAF: a new member of the BRICHOS family.

Sanchez-Pulido L, Ponting C Bioinform Adv. 2023; 2(1):vbac087.

PMID: 36699367 PMC: 9714404. DOI: 10.1093/bioadv/vbac087.


MES-3 is a highly divergent ortholog of the canonical PRC2 component SUZ12.

Snel B, van den Heuvel S, Seidl M iScience. 2022; 25(7):104633.

PMID: 35800781 PMC: 9253717. DOI: 10.1016/j.isci.2022.104633.


Collective Variable for Metadynamics Derived From AlphaFold Output.

Spiwok V, Kurecka M, Krenek A Front Mol Biosci. 2022; 9:878133.

PMID: 35769910 PMC: 9234394. DOI: 10.3389/fmolb.2022.878133.

References
1.
Letunic I, Khedkar S, Bork P . SMART: recent updates, new developments and status in 2020. Nucleic Acids Res. 2020; 49(D1):D458-D460. PMC: 7778883. DOI: 10.1093/nar/gkaa937. View

2.
Yang J, Anishchenko I, Park H, Peng Z, Ovchinnikov S, Baker D . Improved protein structure prediction using predicted interresidue orientations. Proc Natl Acad Sci U S A. 2020; 117(3):1496-1503. PMC: 6983395. DOI: 10.1073/pnas.1914677117. View

3.
KROGH A, Larsson B, von Heijne G, Sonnhammer E . Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001; 305(3):567-80. DOI: 10.1006/jmbi.2000.4315. View

4.
Gobel U, Sander C, Schneider R, Valencia A . Correlated mutations and residue contacts in proteins. Proteins. 1994; 18(4):309-17. DOI: 10.1002/prot.340180402. View

5.
Bateman A, Coggill P, Finn R . DUFs: families in search of function. Acta Crystallogr Sect F Struct Biol Cryst Commun. 2010; 66(Pt 10):1148-52. PMC: 2954198. DOI: 10.1107/S1744309110001685. View