» Articles » PMID: 37243541

"Correcting" Gene Trees to Be More Like Species Trees Frequently Increases Topological Error

Overview
Date 2023 May 27
PMID 37243541
Authors
Affiliations
Soon will be listed here.
Abstract

The evolutionary histories of individual loci in a genome can be estimated independently, but this approach is error-prone due to the limited amount of sequence data available for each gene, which has led to the development of a diverse array of gene tree error correction methods which reduce the distance to the species tree. We investigate the performance of two representatives of these methods: TRACTION and TreeFix. We found that gene tree error correction frequently increases the level of error in gene tree topologies by "correcting" them to be closer to the species tree, even when the true gene and species trees are discordant. We confirm that full Bayesian inference of the gene trees under the multispecies coalescent model is more accurate than independent inference. Future gene tree correction approaches and methods should incorporate an adequately realistic model of evolution instead of relying on oversimplified heuristics.

References
1.
Xu L, Chen H, Hu X, Zhang R, Zhang Z, Luo Z . Average gene length is highly conserved in prokaryotes and eukaryotes and diverges only between the two kingdoms. Mol Biol Evol. 2006; 23(6):1107-8. DOI: 10.1093/molbev/msk019. View

2.
McDonald M, McGinness L, Hane J, Williams A, Milgate A, Solomon P . Utilizing Gene Tree Variation to Identify Candidate Effector Genes in Zymoseptoria tritici. G3 (Bethesda). 2016; 6(4):779-91. PMC: 4825649. DOI: 10.1534/g3.115.025197. View

3.
Christensen S, Molloy E, Vachaspati P, Yammanuru A, Warnow T . Non-parametric correction of estimated gene trees using TRACTION. Algorithms Mol Biol. 2020; 15:1. PMC: 6942343. DOI: 10.1186/s13015-019-0161-8. View

4.
Sjostrand J, Sennblad B, Arvestad L, Lagergren J . DLRS: gene tree evolution in light of a species tree. Bioinformatics. 2012; 28(22):2994-5. DOI: 10.1093/bioinformatics/bts548. View

5.
Morel B, Kozlov A, Stamatakis A, Szollosi G . GeneRax: A Tool for Species-Tree-Aware Maximum Likelihood-Based Gene  Family Tree Inference under Gene Duplication, Transfer, and Loss. Mol Biol Evol. 2020; 37(9):2763-2774. PMC: 8312565. DOI: 10.1093/molbev/msaa141. View