» Articles » PMID: 22909249

Bio.Phylo: a Unified Toolkit for Processing, Analyzing and Visualizing Phylogenetic Trees in Biopython

Overview
Publisher Biomed Central
Specialty Biology
Date 2012 Aug 23
PMID 22909249
Citations 64
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Ongoing innovation in phylogenetics and evolutionary biology has been accompanied by a proliferation of software tools, data formats, analytical techniques and web servers. This brings with it the challenge of integrating phylogenetic and other related biological data found in a wide variety of formats, and underlines the need for reusable software that can read, manipulate and transform this information into the various forms required to build computational pipelines.

Results: We built a Python software library for working with phylogenetic data that is tightly integrated with Biopython, a broad-ranging toolkit for computational biology. Our library, Bio.Phylo, is highly interoperable with existing libraries, tools and standards, and is capable of parsing common file formats for phylogenetic trees, performing basic transformations and manipulations, attaching rich annotations, and visualizing trees. We unified the modules for working with the standard file formats Newick, NEXUS and phyloXML behind a consistent and simple API, providing a common set of functionality independent of the data source.

Conclusions: Bio.Phylo meets a growing need in bioinformatics for working with heterogeneous types of phylogenetic data. By supporting interoperability with multiple file formats and leveraging existing Biopython features, this library simplifies the construction of phylogenetic workflows. We also provide examples of the benefits of building a community around a shared open-source project. Bio.Phylo is included with Biopython, available through the Biopython website, http://biopython.org.

Citing Articles

Structural variation, selection, and diversification of the gene family from the human pangenome.

Dishuck P, Munson K, Lewis A, Dougherty M, Underwood J, Harvey W bioRxiv. 2025; .

PMID: 39975192 PMC: 11838601. DOI: 10.1101/2025.02.04.636496.


Simplifying and Characterizing DAGs and Phylogenetic Networks via Least Common Ancestor Constraints.

Lindeberg A, Hellmuth M Bull Math Biol. 2025; 87(3):44.

PMID: 39937386 PMC: 11821803. DOI: 10.1007/s11538-025-01419-z.


Unravelling genomic drivers of speciation in Musa through genome assemblies of wild banana ancestors.

Martin G, Istace B, Baurens F, Belser C, Hervouet C, Labadie K Nat Commun. 2025; 16(1):961.

PMID: 39843949 PMC: 11754795. DOI: 10.1038/s41467-025-56329-4.


OGU: A Toolbox for Better Utilising Organelle Genomic Data.

Wu P, Xue N, Yang J, Zhang Q, Sun Y, Zhang W Mol Ecol Resour. 2024; 25(3):e14044.

PMID: 39523951 PMC: 11887606. DOI: 10.1111/1755-0998.14044.


Evolution shapes and conserves genomic signatures in viruses.

Holmudden M, Gustafsson J, Bertrand Y, Schliep A, Norberg P Commun Biol. 2024; 7(1):1412.

PMID: 39478059 PMC: 11526014. DOI: 10.1038/s42003-024-07098-1.


References
1.
Sukumaran J, Holder M . DendroPy: a Python library for phylogenetic computing. Bioinformatics. 2010; 26(12):1569-71. DOI: 10.1093/bioinformatics/btq228. View

2.
Zmasek C, Zhang Q, Ye Y, Godzik A . Surprising complexity of the ancestral apoptosis network. Genome Biol. 2007; 8(10):R226. PMC: 2246300. DOI: 10.1186/gb-2007-8-10-r226. View

3.
Cock P, Antao T, Chang J, Chapman B, Cox C, Dalke A . Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009; 25(11):1422-3. PMC: 2682512. DOI: 10.1093/bioinformatics/btp163. View

4.
Huerta-Cepas J, Dopazo J, Gabaldon T . ETE: a python Environment for Tree Exploration. BMC Bioinformatics. 2010; 11:24. PMC: 2820433. DOI: 10.1186/1471-2105-11-24. View

5.
Rice P, Longden I, Bleasby A . EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000; 16(6):276-7. DOI: 10.1016/s0168-9525(00)02024-2. View