» Articles » PMID: 20176949

Mutation-selection Models of Coding Sequence Evolution with Site-heterogeneous Amino Acid Fitness Profiles

Overview
Specialty Science
Date 2010 Feb 24
PMID 20176949
Citations 71
Authors
Affiliations
Soon will be listed here.
Abstract

Modeling the interplay between mutation and selection at the molecular level is key to evolutionary studies. To this end, codon-based evolutionary models have been proposed as pertinent means of studying long-range evolutionary patterns and are widely used. However, these approaches have not yet consolidated results from amino acid level phylogenetic studies showing that selection acting on proteins displays strong site-specific effects, which translate into heterogeneous amino acid propensities across the columns of alignments; related codon-level studies have instead focused on either modeling a single selective context for all codon columns, or a separate selective context for each codon column, with the former strategy deemed too simplistic and the latter deemed overparameterized. Here, we integrate recent developments in nonparametric statistical approaches to propose a probabilistic model that accounts for the heterogeneity of amino acid fitness profiles across the coding positions of a gene. We apply the model to a dozen real protein-coding gene alignments and find it to produce biologically plausible inferences, for instance, as pertaining to site-specific amino acid constraints, as well as distributions of scaled selection coefficients. In their account of mutational features as well as the heterogeneous regimes of selection at the amino acid level, the modeling approaches studied here can form a backdrop for several extensions, accounting for other selective features, for variable population size, or for subtleties of mutational features, all with parameterizations couched within population-genetic theory.

Citing Articles

Estimating the proportion of beneficial mutations that are not adaptive in mammals.

Latrille T, Joseph J, Hartasanchez D, Salamin N PLoS Genet. 2024; 20(12):e1011536.

PMID: 39724093 PMC: 11709321. DOI: 10.1371/journal.pgen.1011536.


Increased Positive Selection in Highly Recombining Genes Does not Necessarily Reflect an Evolutionary Advantage of Recombination.

Joseph J Mol Biol Evol. 2024; 41(6).

PMID: 38829800 PMC: 11173204. DOI: 10.1093/molbev/msae107.


CAAStools: a toolbox to identify and test Convergent Amino Acid Substitutions.

Barteri F, Valenzuela A, Farre X, de Juan D, Muntane G, Esteve-Altava B Bioinformatics. 2023; 39(10).

PMID: 37846039 PMC: 10598582. DOI: 10.1093/bioinformatics/btad623.


Interlocus Gene Conversion, Natural Selection, and Paralog Homogenization.

Yang Y, Xu T, Conant G, Kishino H, Thorne J, Ji X Mol Biol Evol. 2023; 40(9).

PMID: 37675606 PMC: 10503786. DOI: 10.1093/molbev/msad198.


Beginner's Guide on the Use of PAML to Detect Positive Selection.

Alvarez-Carretero S, Kapli P, Yang Z Mol Biol Evol. 2023; 40(4).

PMID: 37096789 PMC: 10127084. DOI: 10.1093/molbev/msad041.


References
1.
Thorne J, Choi S, Yu J, Higgs P, Kishino H . Population genetics without intraspecific data. Mol Biol Evol. 2007; 24(8):1667-77. DOI: 10.1093/molbev/msm085. View

2.
OBrien J, Minin V, Suchard M . Learning to count: robust estimates for labeled distances between molecular sequences. Mol Biol Evol. 2009; 26(4):801-14. PMC: 2734148. DOI: 10.1093/molbev/msp003. View

3.
Yang Z, Nielsen R . Mutation-selection models of codon substitution and their use to estimate selective strengths on codon usage. Mol Biol Evol. 2008; 25(3):568-79. DOI: 10.1093/molbev/msm284. View

4.
Lartillot N . Conjugate Gibbs sampling for Bayesian phylogenetic models. J Comput Biol. 2007; 13(10):1701-22. DOI: 10.1089/cmb.2006.13.1701. View

5.
Poux C, Chevret P, Huchon D, de Jong W, Douzery E . Arrival and diversification of caviomorph rodents and platyrrhine primates in South America. Syst Biol. 2006; 55(2):228-44. DOI: 10.1080/10635150500481390. View