ASTRAL-III: Polynomial Time Species Tree Reconstruction from Partially Resolved Gene Trees
Overview
Affiliations
Background: Evolutionary histories can be discordant across the genome, and such discordances need to be considered in reconstructing the species phylogeny. ASTRAL is one of the leading methods for inferring species trees from gene trees while accounting for gene tree discordance. ASTRAL uses dynamic programming to search for the tree that shares the maximum number of quartet topologies with input gene trees, restricting itself to a predefined set of bipartitions.
Results: We introduce ASTRAL-III, which substantially improves the running time of ASTRAL-II and guarantees polynomial running time as a function of both the number of species (n) and the number of genes (k). ASTRAL-III limits the bipartition constraint set (X) to grow at most linearly with n and k. Moreover, it handles polytomies more efficiently than ASTRAL-II, exploits similarities between gene trees better, and uses several techniques to avoid searching parts of the search space that are mathematically guaranteed not to include the optimal tree. The asymptotic running time of ASTRAL-III in the presence of polytomies is [Formula: see text] where D=O(nk) is the sum of degrees of all unique nodes in input trees. The running time improvements enable us to test whether contracting low support branches in gene trees improves the accuracy by reducing noise. In extensive simulations, we show that removing branches with very low support (e.g., below 10%) improves accuracy while overly aggressive filtering is harmful. We observe on a biological avian phylogenomic dataset of 14K genes that contracting low support branches greatly improve results.
Conclusions: ASTRAL-III is a faster version of the ASTRAL method for phylogenetic reconstruction and can scale up to 10,000 species. With ASTRAL-III, low support branches can be removed, resulting in improved accuracy.
Shi T, He J Front Plant Sci. 2025; 16:1511582.
PMID: 40065784 PMC: 11891173. DOI: 10.3389/fpls.2025.1511582.
Phylogenetic analysis of Asiatic species in the tropical genus Beilschmiedia (Lauraceae).
Zhu W, Ma J, Tan Y, Song Y, Xin P BMC Genomics. 2025; 26(1):226.
PMID: 40057694 PMC: 11889841. DOI: 10.1186/s12864-025-11354-x.
Comparative genomics and phylogenetic analysis of mitochondrial genomes of Neocinnamomum.
Zhu W, Zhang D, Xu W, Gan Y, Huang J, Liu Y BMC Plant Biol. 2025; 25(1):289.
PMID: 40045193 PMC: 11883965. DOI: 10.1186/s12870-025-06238-x.
Comparative analyses of chloroplast genomes of Theobroma cacao from northern Peru.
Tineo D, Bustamante D, Calderon M, Oliva M PLoS One. 2025; 20(3):e0316148.
PMID: 40043011 PMC: 11882073. DOI: 10.1371/journal.pone.0316148.
Xu L, Song Z, Li T, Jin Z, Zhang B, Du S Plant Divers. 2025; 47(1):21-33.
PMID: 40041562 PMC: 11873585. DOI: 10.1016/j.pld.2024.10.003.