» Articles » PMID: 28333216

BuddySuite: Command-Line Toolkits for Manipulating Sequences, Alignments, and Phylogenetic Trees

Overview
Journal Mol Biol Evol
Specialty Biology
Date 2017 Mar 24
PMID 28333216
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

The ability to manipulate sequence, alignment, and phylogenetic tree files has become an increasingly important skill in the life sciences, whether to generate summary information or to prepare data for further downstream analysis. The command line can be an extremely powerful environment for interacting with these resources, but only if the user has the appropriate general-purpose tools on hand. BuddySuite is a collection of four independent yet interrelated command-line toolkits that facilitate each step in the workflow of sequence discovery, curation, alignment, and phylogenetic reconstruction. Most common sequence, alignment, and tree file formats are automatically detected and parsed, and over 100 tools have been implemented for manipulating these data. The project has been engineered to easily accommodate the addition of new tools, is written in the popular programming language Python, and is hosted on the Python Package Index and GitHub to maximize accessibility. Documentation for each BuddySuite tool, including usage examples, is available at http://tiny.cc/buddysuite_wiki. All software is open source and freely available through http://research.nhgri.nih.gov/software/BuddySuite.

Citing Articles

Phylogeny of section subsection (Saxifragaceae) and the origin of low elevation shade-dwelling species.

Gerschwitz-Eidt M, Dillenberger M, Kadereit J Ecol Evol. 2023; 13(1):e9728.

PMID: 36636428 PMC: 9829489. DOI: 10.1002/ece3.9728.


Gotree/Goalign: toolkit and Go API to facilitate the development of phylogenetic workflows.

Lemoine F, Gascuel O NAR Genom Bioinform. 2021; 3(3):lqab075.

PMID: 34396097 PMC: 8356961. DOI: 10.1093/nargab/lqab075.


Linking a Gene Cluster to Atranorin, a Major Cortical Substance of Lichens, through Genetic Dereplication and Heterologous Expression.

Kim W, Liu R, Woo S, Kang K, Park H, Yu Y mBio. 2021; 12(3):e0111121.

PMID: 34154413 PMC: 8262933. DOI: 10.1128/mBio.01111-21.


PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees.

Fang Y, Liu C, Lin J, Li X, Alavian K, Yang Y BMC Evol Biol. 2019; 19(1):219.

PMID: 31791235 PMC: 6889546. DOI: 10.1186/s12862-019-1541-x.


Functional and phylogenetic characterization of noncanonical vitamin B-binding proteins in zebrafish suggests involvement in cobalamin transport.

Benoit C, Stanton A, Tartanian A, Motzer A, McGaughey D, Bond S J Biol Chem. 2018; 293(45):17606-17621.

PMID: 30237171 PMC: 6231144. DOI: 10.1074/jbc.RA118.005323.

References
1.
Stamatakis A . RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006; 22(21):2688-90. DOI: 10.1093/bioinformatics/btl446. View

2.
Sievers F, Wilm A, Dineen D, Gibson T, Karplus K, Li W . Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011; 7:539. PMC: 3261699. DOI: 10.1038/msb.2011.75. View

3.
KROGH A, Larsson B, von Heijne G, Sonnhammer E . Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001; 305(3):567-80. DOI: 10.1006/jmbi.2000.4315. View

4.
Price M, Dehal P, Arkin A . FastTree 2--approximately maximum-likelihood trees for large alignments. PLoS One. 2010; 5(3):e9490. PMC: 2835736. DOI: 10.1371/journal.pone.0009490. View

5.
Katoh K, Standley D . MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013; 30(4):772-80. PMC: 3603318. DOI: 10.1093/molbev/mst010. View