» Articles » PMID: 28437136

Enumeration of Ancestral Configurations for Matching Gene Trees and Species Trees

Overview
Journal J Comput Biol
Date 2017 Apr 25
PMID 28437136
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Given a gene tree and a species tree, ancestral configurations represent the combinatorially distinct sets of gene lineages that can reach a given node of the species tree. They have been introduced as a data structure for use in the recursive computation of the conditional probability under the multispecies coalescent model of a gene tree topology given a species tree, the cost of this computation being affected by the number of ancestral configurations of the gene tree in the species tree. For matching gene trees and species trees, we obtain enumerative results on ancestral configurations. We study ancestral configurations in balanced and unbalanced families of trees determined by a given seed tree, showing that for seed trees with more than one taxon, the number of ancestral configurations increases for both families exponentially in the number of taxa n. For fixed n, the maximal number of ancestral configurations tabulated at the species tree root node and the largest number of labeled histories possible for a labeled topology occur for trees with precisely the same unlabeled shape. For ancestral configurations at the root, the maximum increases with [Formula: see text], where [Formula: see text] is a quadratic recurrence constant. Under a uniform distribution over the set of labeled trees of given size, the mean number of root ancestral configurations grows with [Formula: see text] and the variance with ∼[Formula: see text]. The results provide a contribution to the combinatorial study of gene trees and species trees.

Citing Articles

A lattice structure for ancestral configurations arising from the relationship between gene trees and species trees.

Lappo E, Rosenberg N Discrete Appl Math. 2023; 343:65-81.

PMID: 38078045 PMC: 10704929. DOI: 10.1016/j.dam.2023.09.033.


Enumeration of binary trees compatible with a perfect phylogeny.

Palacios J, Bhaskar A, Disanto F, Rosenberg N J Math Biol. 2022; 84(6):54.

PMID: 35552538 PMC: 9098623. DOI: 10.1007/s00285-022-01748-w.


On the Colijn-Plazzotta numbering scheme for unlabeled binary rooted trees.

Rosenberg N Discrete Appl Math. 2020; 291:88-98.

PMID: 33364668 PMC: 7751944. DOI: 10.1016/j.dam.2020.11.021.


Counting and sampling gene family evolutionary histories in the duplication-loss and duplication-loss-transfer models.

Chauve C, Ponty Y, Wallner M J Math Biol. 2020; 80(5):1353-1388.

PMID: 32060618 PMC: 7052048. DOI: 10.1007/s00285-019-01465-x.


Enumeration of compact coalescent histories for matching gene trees and species trees.

Disanto F, Rosenberg N J Math Biol. 2018; 78(1-2):155-188.

PMID: 30116881 PMC: 7661175. DOI: 10.1007/s00285-018-1271-5.


References
1.
Steel M, Mckenzie A . Properties of phylogenetic trees generated by Yule-type speciation models. Math Biosci. 2001; 170(1):91-112. DOI: 10.1016/s0025-5564(00)00061-4. View

2.
Disanto F, Wiehe T . Exact enumeration of cherries and pitchforks in ranked trees under the coalescent model. Math Biosci. 2013; 242(2):195-200. DOI: 10.1016/j.mbs.2013.01.010. View

3.
Than C, Nakhleh L . Species tree inference by minimizing deep coalescences. PLoS Comput Biol. 2009; 5(9):e1000501. PMC: 2729383. DOI: 10.1371/journal.pcbi.1000501. View

4.
Degnan J, Salter L . Gene tree distributions under the coalescent process. Evolution. 2005; 59(1):24-37. View

5.
Rosenberg N, Degnan J . Coalescent histories for discordant gene trees and species trees. Theor Popul Biol. 2010; 77(3):145-51. DOI: 10.1016/j.tpb.2009.12.004. View