» Articles » PMID: 12679548

Heterogeneity of Nucleotide Frequencies Among Evolutionary Lineages and Phylogenetic Inference

Overview
Journal Mol Biol Evol
Specialty Biology
Date 2003 Apr 8
PMID 12679548
Citations 24
Authors
Affiliations
Soon will be listed here.
Abstract

A major assumption of many molecular phylogenetic methods is the homogeneity of nucleotide frequencies among taxa, which refers to the equality of the nucleotide frequency bias among species. Changes in nucleotide frequency among different lineages in a data set are thought to lead to erroneous phylogenetic inference because unrelated clades may appear similar because of evolutionarily unrelated similarities in nucleotide frequencies. We tested the effects of the heterogeneity of nucleotide frequency bias on phylogenetic inference, along with the interaction between this heterogeneity and stratified taxon sampling, by means of computer simulations using evolutionary parameters derived from genomic databases. We found that the phylogenetic trees inferred from data sets simulated under realistic, observed levels of heterogeneity for mammalian genes were reconstructed with accuracy comparable to those simulated with homogeneous nucleotide frequencies; the results hold for Neighbor-Joining, minimum evolution, maximum parsimony, and maximum-likelihood methods. The LogDet distance method, specifically designed to deal with heterogeneous nucleotide frequencies, does not perform better than distance methods that assume substitution pattern homogeneity among sequences. In these specific simulation conditions, we did not find a significant interaction between phylogenetic accuracy and substitution pattern heterogeneity among lineages, even when the taxon sampling is increased.

Citing Articles

Salmonidae Genome: Features, Evolutionary and Phylogenetic Characteristics.

Dysin A, Shcherbakov Y, Nikolaeva O, Terletskii V, Tyshchenko V, Dementieva N Genes (Basel). 2022; 13(12).

PMID: 36553488 PMC: 9778375. DOI: 10.3390/genes13122221.


Fast and accurate bootstrap confidence limits on genome-scale phylogenies using little bootstraps.

Sharma S, Kumar S Nat Comput Sci. 2021; 1(9):573-577.

PMID: 34734192 PMC: 8560003. DOI: 10.1038/s43588-021-00129-5.


Using a GTR+Γ substitution model for dating sequence divergence when stationarity and time-reversibility assumptions are violated.

Barba-Montoya J, Tao Q, Kumar S Bioinformatics. 2020; 36(Suppl_2):i884-i894.

PMID: 33381826 PMC: 7773479. DOI: 10.1093/bioinformatics/btaa820.


Molecular dating for phylogenies containing a mix of populations and species by using Bayesian and RelTime approaches.

Mello B, Tao Q, Barba-Montoya J, Kumar S Mol Ecol Resour. 2020; 21(1):122-136.

PMID: 32881388 PMC: 8152102. DOI: 10.1111/1755-0998.13249.


Quantifying the Error of Secondary vs. Distant Primary Calibrations in a Simulated Environment.

Powell C, Waskin S, Battistuzzi F Front Genet. 2020; 11:252.

PMID: 32265987 PMC: 7099002. DOI: 10.3389/fgene.2020.00252.