» Articles » PMID: 16752207

Efficient Methods for Estimating Amino Acid Replacement Rates

Overview
Journal J Mol Evol
Specialty Biochemistry
Date 2006 Jun 6
PMID 16752207
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

Replacement rate matrices describe the process of evolution at one position in a protein and are used in many applications where proteins are studied with an evolutionary perspective. Several general matrices have been suggested and have proved to be good approximations of the real process. However, there are data for which general matrices are inappropriate, for example, special protein families, certain lineages in the tree of life, or particular parts of proteins. Analysis of such data could benefit from adaption of a data-specific rate matrix. This paper suggests two new methods for estimating replacement rate matrices from independent pairwise protein sequence alignments and also carefully studies Müller-Vingron's resolvent method. Comprehensive tests on synthetic datasets show that both new methods perform better than the resolvent method in a variety of settings. The best method is furthermore demonstrated to be robust on small datasets as well as practical on very large datasets of real data. Neither short nor divergent sequence pairs have to be discarded, making the method economical with data. A generalization to multialignment data is suggested and used in a test on protein-domain family phylogenies, where it is shown that the method offers family-specific rate matrices that often have a significantly better likelihood than a general matrix.

Citing Articles

Past and present giant viruses diversity explored through permafrost metagenomics.

Rigou S, Santini S, Abergel C, Claverie J, Legendre M Nat Commun. 2022; 13(1):5853.

PMID: 36207343 PMC: 9546926. DOI: 10.1038/s41467-022-33633-x.


Genome-Wide Insights Into the Organelle Translocation of Photosynthetic NDH-1 Genes During Evolution.

Yu J, Ran Z, Zhang J, Wei L, Ma W Front Microbiol. 2022; 13:956578.

PMID: 35910652 PMC: 9326235. DOI: 10.3389/fmicb.2022.956578.


Evolutionary and functional lessons from human-specific amino acid substitution matrices.

Shauli T, Brandes N, Linial M NAR Genom Bioinform. 2021; 3(3):lqab079.

PMID: 34541526 PMC: 8445205. DOI: 10.1093/nargab/lqab079.


Stress-induced changes in the expression of antioxidant system genes for rice ( L.) and bread wheat ( L.).

Ermakov A, Bobrovskikh A, Zubairova U, Konstantinov D, Doroshkov A PeerJ. 2019; 7:e7791.

PMID: 31803533 PMC: 6886489. DOI: 10.7717/peerj.7791.


The evolution of gene regulatory networks controlling Arabidopsis thaliana L. trichome development.

Doroshkov A, Konstantinov D, Afonnikov D, Gunbin K BMC Plant Biol. 2019; 19(Suppl 1):53.

PMID: 30813891 PMC: 6393967. DOI: 10.1186/s12870-019-1640-2.


References
1.
Devauchelle C, Grossmann A, Henaut A, Holschneider M, Monnerot M, Risler J . Rate matrices for analyzing large families of protein sequences. J Comput Biol. 2001; 8(4):381-99. DOI: 10.1089/106652701752236205. View

2.
Yu Y, Wootton J, Altschul S . The compositional adjustment of amino acid substitution matrices. Proc Natl Acad Sci U S A. 2003; 100(26):15688-93. PMC: 307629. DOI: 10.1073/pnas.2533904100. View

3.
Yang Z, Kumar S, Nei M . A new method of inference of ancestral nucleotide and amino acid sequences. Genetics. 1995; 141(4):1641-50. PMC: 1206894. DOI: 10.1093/genetics/141.4.1641. View

4.
Goldman N, Whelan S . A novel use of equilibrium frequencies in models of sequence evolution. Mol Biol Evol. 2002; 19(11):1821-31. DOI: 10.1093/oxfordjournals.molbev.a004007. View

5.
Pearson W, Lipman D . Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A. 1988; 85(8):2444-8. PMC: 280013. DOI: 10.1073/pnas.85.8.2444. View