» Articles » PMID: 24037213

Protein Evolution Along Phylogenetic Histories Under Structurally Constrained Substitution Models

Overview
Journal Bioinformatics
Specialty Biology
Date 2013 Sep 17
PMID 24037213
Citations 29
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Models of molecular evolution aim at describing the evolutionary processes at the molecular level. However, current models rarely incorporate information from protein structure. Conversely, structure-based models of protein evolution have not been commonly applied to simulate sequence evolution in a phylogenetic framework, and they often ignore relevant evolutionary processes such as recombination. A simulation evolutionary framework that integrates substitution models that account for protein structure stability should be able to generate more realistic in silico evolved proteins for a variety of purposes.

Results: We developed a method to simulate protein evolution that combines models of protein folding stability, such that the fitness depends on the stability of the native state both with respect to unfolding and misfolding, with phylogenetic histories that can be either specified by the user or simulated with the coalescent under complex evolutionary scenarios, including recombination, demographics and migration. We have implemented this framework in a computer program called ProteinEvolver. Remarkably, comparing these models with empirical amino acid replacement models, we found that the former produce amino acid distributions closer to distributions observed in real protein families, and proteins that are predicted to be more stable. Therefore, we conclude that evolutionary models that consider protein stability and realistic evolutionary histories constitute a better approximation of the real evolutionary process.

Citing Articles

nT4X and nT4M: Novel Time Non-reversible Mixture Amino Acid Substitution Models.

Tinh N, Dang C, Vinh L J Mol Evol. 2025; 93(1):136-148.

PMID: 39832000 DOI: 10.1007/s00239-024-10230-8.


Genome-wide identification of the key kinesin genes during fiber and boll development in upland cotton (Gossypium hirsutum L.).

Zhu H, Xu J, Yu K, Wu J, Xu H, Wang S Mol Genet Genomics. 2024; 299(1):38.

PMID: 38517563 DOI: 10.1007/s00438-024-02093-x.


Selection among site-dependent structurally constrained substitution models of protein evolution by approximate Bayesian computation.

Ferreiro D, Branco C, Arenas M Bioinformatics. 2024; 40(3).

PMID: 38374231 PMC: 10914458. DOI: 10.1093/bioinformatics/btae096.


Substitution Models of Protein Evolution with Selection on Enzymatic Activity.

Ferreiro D, Khalil R, Sousa S, Arenas M Mol Biol Evol. 2024; 41(2).

PMID: 38314876 PMC: 10873502. DOI: 10.1093/molbev/msae026.


Genome-wide identification of the key Kinesin genes during fiber and boll development in upland cotton (Gossypium hirsutum L).

Zhu H, Xu J, Yu K, Wu J, Xu H, Wang S Mol Genet Genomics. 2024; 299(1):2.

PMID: 38200363 DOI: 10.1007/s00438-023-02087-1.


References
1.
Grahnen J, Nandakumar P, Kubelka J, Liberles D . Biophysical and structural considerations for protein sequence evolution. BMC Evol Biol. 2011; 11:361. PMC: 3292521. DOI: 10.1186/1471-2148-11-361. View

2.
Nasrallah C, Mathews D, Huelsenbeck J . Quantifying the impact of dependent evolution among sites in phylogenetic inference. Syst Biol. 2010; 60(1):60-73. PMC: 2997629. DOI: 10.1093/sysbio/syq074. View

3.
Vendruscolo M, Kussell E, Domany E . Recovery of protein structure from contact maps. Fold Des. 1997; 2(5):295-306. DOI: 10.1016/S1359-0278(97)00041-2. View

4.
Simon-Loriere E, Galetto R, Hamoudi M, Archer J, Lefeuvre P, Martin D . Molecular mechanisms of recombination restriction in the envelope gene of the human immunodeficiency virus. PLoS Pathog. 2009; 5(5):e1000418. PMC: 2671596. DOI: 10.1371/journal.ppat.1000418. View

5.
Arenas M, Posada D . The effect of recombination on the reconstruction of ancestral sequences. Genetics. 2010; 184(4):1133-9. PMC: 2865913. DOI: 10.1534/genetics.109.113423. View