» Articles » PMID: 29745866

ASTRAL-III: Polynomial Time Species Tree Reconstruction from Partially Resolved Gene Trees

Overview
Publisher Biomed Central
Specialty Biology
Date 2018 May 11
PMID 29745866
Citations 664
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Evolutionary histories can be discordant across the genome, and such discordances need to be considered in reconstructing the species phylogeny. ASTRAL is one of the leading methods for inferring species trees from gene trees while accounting for gene tree discordance. ASTRAL uses dynamic programming to search for the tree that shares the maximum number of quartet topologies with input gene trees, restricting itself to a predefined set of bipartitions.

Results: We introduce ASTRAL-III, which substantially improves the running time of ASTRAL-II and guarantees polynomial running time as a function of both the number of species (n) and the number of genes (k). ASTRAL-III limits the bipartition constraint set (X) to grow at most linearly with n and k. Moreover, it handles polytomies more efficiently than ASTRAL-II, exploits similarities between gene trees better, and uses several techniques to avoid searching parts of the search space that are mathematically guaranteed not to include the optimal tree. The asymptotic running time of ASTRAL-III in the presence of polytomies is [Formula: see text] where D=O(nk) is the sum of degrees of all unique nodes in input trees. The running time improvements enable us to test whether contracting low support branches in gene trees improves the accuracy by reducing noise. In extensive simulations, we show that removing branches with very low support (e.g., below 10%) improves accuracy while overly aggressive filtering is harmful. We observe on a biological avian phylogenomic dataset of 14K genes that contracting low support branches greatly improve results.

Conclusions: ASTRAL-III is a faster version of the ASTRAL method for phylogenetic reconstruction and can scale up to 10,000 species. With ASTRAL-III, low support branches can be removed, resulting in improved accuracy.

Citing Articles

Resolving phylogenetic conflicts in Pandanales: the dual roles of gene flow and whole-genome duplication.

Shi T, He J Front Plant Sci. 2025; 16:1511582.

PMID: 40065784 PMC: 11891173. DOI: 10.3389/fpls.2025.1511582.


Phylogenetic analysis of Asiatic species in the tropical genus Beilschmiedia (Lauraceae).

Zhu W, Ma J, Tan Y, Song Y, Xin P BMC Genomics. 2025; 26(1):226.

PMID: 40057694 PMC: 11889841. DOI: 10.1186/s12864-025-11354-x.


Comparative genomics and phylogenetic analysis of mitochondrial genomes of Neocinnamomum.

Zhu W, Zhang D, Xu W, Gan Y, Huang J, Liu Y BMC Plant Biol. 2025; 25(1):289.

PMID: 40045193 PMC: 11883965. DOI: 10.1186/s12870-025-06238-x.


Comparative analyses of chloroplast genomes of Theobroma cacao from northern Peru.

Tineo D, Bustamante D, Calderon M, Oliva M PLoS One. 2025; 20(3):e0316148.

PMID: 40043011 PMC: 11882073. DOI: 10.1371/journal.pone.0316148.


New insights into the phylogeny and infrageneric taxonomy of based on hybrid capture phylogenomics (Hyb-Seq).

Xu L, Song Z, Li T, Jin Z, Zhang B, Du S Plant Divers. 2025; 47(1):21-33.

PMID: 40041562 PMC: 11873585. DOI: 10.1016/j.pld.2024.10.003.


References
1.
Song S, Liu L, Edwards S, Wu S . Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model. Proc Natl Acad Sci U S A. 2012; 109(37):14942-7. PMC: 3443116. DOI: 10.1073/pnas.1211733109. View

2.
Meiklejohn K, Faircloth B, Glenn T, Kimball R, Braun E . Analysis of a Rapid Evolutionary Radiation Using Ultraconserved Elements: Evidence for a Bias in Some Multispecies Coalescent Methods. Syst Biol. 2016; 65(4):612-27. DOI: 10.1093/sysbio/syw014. View

3.
Liu L, Yu L, Pearl D, Edwards S . Estimating species phylogenies using coalescence times among sequences. Syst Biol. 2010; 58(5):468-77. DOI: 10.1093/sysbio/syp031. View

4.
Yu Y, Warnow T, Nakhleh L . Algorithms for MDC-based multi-locus phylogeny inference: beyond rooted binary gene trees on single alleles. J Comput Biol. 2011; 18(11):1543-59. PMC: 3216099. DOI: 10.1089/cmb.2011.0174. View

5.
Gatesy J, Springer M . Phylogenetic analysis at deep timescales: unreliable gene trees, bypassed hidden support, and the coalescence/concatalescence conundrum. Mol Phylogenet Evol. 2014; 80:231-66. DOI: 10.1016/j.ympev.2014.08.013. View