» Articles » PMID: 22184263

A Metric for Phylogenetic Trees Based on Matching

Overview
Specialty Biology
Date 2011 Dec 21
PMID 22184263
Citations 30
Authors
Affiliations
Soon will be listed here.
Abstract

Comparing two or more phylogenetic trees is a fundamental task in computational biology. The simplest outcome of such a comparison is a pairwise measure of similarity, dissimilarity, or distance. A large number of such measures have been proposed, but so far all suffer from problems varying from computational cost to lack of robustness; many can be shown to behave unexpectedly under certain plausible inputs. For instance, the widely used Robinson-Foulds distance is poorly distributed and thus affords little discrimination, while also lacking robustness in the face of very small changes--reattaching a single leaf elsewhere in a tree of any size can instantly maximize the distance. In this paper, we introduce a new pairwise distance measure, based on matching, for phylogenetic trees. We prove that our measure induces a metric on the space of trees, show how to compute it in low polynomial time, verify through statistical testing that it is robust, and finally note that it does not exhibit unexpected behavior under the same inputs that cause problems with other measures. We also illustrate its usefulness in clustering trees, demonstrating significant improvements in the quality of hierarchical clustering as compared to the same collections of trees clustered using the Robinson-Foulds distance.

Citing Articles

Sparse Neighbor Joining: rapid phylogenetic inference using a sparse distance matrix.

Kurt S, Bouchard-Cote A, Lagergren J Bioinformatics. 2024; 40(12).

PMID: 39570613 PMC: 11637600. DOI: 10.1093/bioinformatics/btae701.


Spectral cluster supertree: fast and statistically robust merging of rooted phylogenetic trees.

McArthur R, Zehmakan A, Charleston M, Lin Y, Huttley G Front Mol Biosci. 2024; 11:1432495.

PMID: 39544404 PMC: 11561713. DOI: 10.3389/fmolb.2024.1432495.


Asymmetric Cluster-Based Measures for Comparative Phylogenetics.

Wagle S, Markin A, Gorecki P, Anderson T, Eulenstein O J Comput Biol. 2024; 31(4):312-327.

PMID: 38634854 PMC: 11057527. DOI: 10.1089/cmb.2023.0338.


Robust expansion of phylogeny for fast-growing genome sequence data.

Ye Y, Shum M, Tsui J, Yu G, Smith D, Zhu H PLoS Comput Biol. 2024; 20(2):e1011871.

PMID: 38330139 PMC: 10898724. DOI: 10.1371/journal.pcbi.1011871.


Optimizing ancestral trait reconstruction of large HIV Subtype C datasets through multiple-trait subsampling.

Li X, Trovao N, Wertheim J, Baele G, de Bernardi Schneider A Virus Evol. 2023; 9(2):vead069.

PMID: 38046219 PMC: 10691791. DOI: 10.1093/ve/vead069.