» Articles » PMID: 37056516

GPTree Cluster: Phylogenetic Tree Cluster Generator in the Context of Supertree Inference

Overview
Journal Bioinform Adv
Specialty Biology
Date 2023 Apr 14
PMID 37056516
Authors
Affiliations
Soon will be listed here.
Abstract

Summary: For many years, evolutionary and molecular biologists have been working with phylogenetic supertrees, which are oriented acyclic graph structures. In the standard approaches, supertrees are obtained by concatenating a set of phylogenetic trees defined on different but overlapping sets of taxa (i.e. species). More recent approaches propose alternative solutions for supertree inference. The testing of new metrics for comparing supertrees and adapting clustering algorithms to overlapping phylogenetic trees with different numbers of leaves requires large amounts of data. In this context, designing a new approach and developing a computer program to generate phylogenetic tree clusters with different numbers of overlapping leaves are key elements to advance research on phylogenetic supertrees and evolution. The main objective of the project is to propose a new approach to simulate clusters of phylogenetic trees defined on different, but mutually overlapping, sets of taxa, with biological events. The proposed generator can be used to generate a certain number of clusters of phylogenetic trees in Newick format with a variable number of leaves and with a defined level of overlap between trees in clusters.

Availability And Implementation: A Python script version 3.7, called GPTree Cluster, which implements the discussed approach, is freely available at: https://github.com/tahiri-lab/GPTree/tree/GPTreeCluster.

Citing Articles

Comparison of phylogenetic trees defined on different but mutually overlapping sets of taxa: A review.

Li W, Koshkarov A, Tahiri N Ecol Evol. 2024; 14(8):e70054.

PMID: 39119174 PMC: 11307105. DOI: 10.1002/ece3.70054.

References
1.
Louca S, Doebeli M . Efficient comparative phylogenetics on large trees. Bioinformatics. 2017; 34(6):1053-1055. DOI: 10.1093/bioinformatics/btx701. View

2.
Mallo D, de Oliveira Martins L, Posada D . SimPhy: Phylogenomic Simulation of Gene, Locus, and Species Trees. Syst Biol. 2015; 65(2):334-44. PMC: 4748750. DOI: 10.1093/sysbio/syv082. View

3.
Boc A, Philippe H, Makarenkov V . Inferring and validating horizontal gene transfer events using bipartition dissimilarity. Syst Biol. 2010; 59(2):195-211. DOI: 10.1093/sysbio/syp103. View

4.
Sjostrand J, Arvestad L, Lagergren J, Sennblad B . GenPhyloData: realistic simulation of gene family evolution. BMC Bioinformatics. 2013; 14:209. PMC: 3703295. DOI: 10.1186/1471-2105-14-209. View

5.
Wolfe J, Fournier G . Horizontal gene transfer constrains the timing of methanogen evolution. Nat Ecol Evol. 2018; 2(5):897-903. DOI: 10.1038/s41559-018-0513-7. View