» Articles » PMID: 20525185

Polynomial Algorithms for the Maximal Pairing Problem: Efficient Phylogenetic Targeting on Arbitrary Trees

Overview
Publisher Biomed Central
Date 2010 Jun 8
PMID 20525185
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The Maximal Pairing Problem (MPP) is the prototype of a class of combinatorial optimization problems that are of considerable interest in bioinformatics: Given an arbitrary phylogenetic tree T and weights omegaxy for the paths between any two pairs of leaves (x, y), what is the collection of edge-disjoint paths between pairs of leaves that maximizes the total weight? Special cases of the MPP for binary trees and equal weights have been described previously; algorithms to solve the general MPP are still missing, however.

Results: We describe a relatively simple dynamic programming algorithm for the special case of binary trees. We then show that the general case of multifurcating trees can be treated by interleaving solutions to certain auxiliary Maximum Weighted Matching problems with an extension of this dynamic programming approach, resulting in an overall polynomial-time solution of complexity (n4 log n) w.r.t. the number n of leaves. The source code of a C implementation can be obtained under the GNU Public License from http://www.bioinf.uni-leipzig.de/Software/Targeting. For binary trees, we furthermore discuss several constrained variants of the MPP as well as a partition function approach to the probabilistic version of the MPP.

Conclusions: The algorithms introduced here make it possible to solve the MPP also for large trees with high-degree vertices. This has practical relevance in the field of comparative phylogenetics and, for example, in the context of phylogenetic targeting, i.e., data collection with resource limitations.

Citing Articles

Molecular function limits divergent protein evolution on planetary timescales.

Konate M, Plata G, Park J, Usmanova D, Wang H, Vitkup D Elife. 2019; 8.

PMID: 31532392 PMC: 6750897. DOI: 10.7554/eLife.39705.

References
1.
Burleigh J, Hilu K, Soltis D . Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms. BMC Evol Biol. 2009; 9:61. PMC: 2674047. DOI: 10.1186/1471-2148-9-61. View

2.
Steffen P, Giegerich R . Versatile and declarative dynamic programming using pair algebras. BMC Bioinformatics. 2005; 6:224. PMC: 1261154. DOI: 10.1186/1471-2105-6-224. View

3.
Arnold C, Nunn C . Phylogenetic targeting of research effort in evolutionary biology. Am Nat. 2010; 176(5):601-12. DOI: 10.1086/656490. View

4.
Sanderson M, Driskell A . The challenge of constructing large phylogenetic trees. Trends Plant Sci. 2003; 8(8):374-9. DOI: 10.1016/S1360-1385(03)00165-1. View

5.
Goodwin N, Dulvy N, Reynolds J . Life-history correlates of the evolution of live bearing in fishes. Philos Trans R Soc Lond B Biol Sci. 2002; 357(1419):259-67. PMC: 1692945. DOI: 10.1098/rstb.2001.0958. View