» Articles » PMID: 19377059

FastTree: Computing Large Minimum Evolution Trees with Profiles Instead of a Distance Matrix

Overview
Journal Mol Biol Evol
Specialty Biology
Date 2009 Apr 21
PMID 19377059
Citations 2363
Authors
Affiliations
Soon will be listed here.
Abstract

Gene families are growing rapidly, but standard methods for inferring phylogenies do not scale to alignments with over 10,000 sequences. We present FastTree, a method for constructing large phylogenies and for estimating their reliability. Instead of storing a distance matrix, FastTree stores sequence profiles of internal nodes in the tree. FastTree uses these profiles to implement Neighbor-Joining and uses heuristics to quickly identify candidate joins. FastTree then uses nearest neighbor interchanges to reduce the length of the tree. For an alignment with N sequences, L sites, and a different characters, a distance matrix requires O(N(2)) space and O(N(2)L) time, but FastTree requires just O(NLa + N ) memory and O(N log (N)La) time. To estimate the tree's reliability, FastTree uses local bootstrapping, which gives another 100-fold speedup over a distance matrix. For example, FastTree computed a tree and support values for 158,022 distinct 16S ribosomal RNAs in 17 h and 2.4 GB of memory. Just computing pairwise Jukes-Cantor distances and storing them, without inferring a tree or bootstrapping, would require 17 h and 50 GB of memory. In simulations, FastTree was slightly more accurate than Neighbor-Joining, BIONJ, or FastME; on genuine alignments, FastTree's topologies had higher likelihoods. FastTree is available at http://microbesonline.org/fasttree.

Citing Articles

High prevalence of carbapenem-resistant and identification of a novel VIM-type metallo-β-lactamase, VIM-92, in clinical isolates from northern China.

Zhao L, Pu J, Liu Y, Cai H, Han M, Yu Y Front Microbiol. 2025; 16:1543509.

PMID: 40078538 PMC: 11897005. DOI: 10.3389/fmicb.2025.1543509.


Characterization of a novel lytic phage vB_AbaM_AB4P2 encoding depolymerase and its application in eliminating biofilms formed by Acinetobacter baumannii.

Su J, Tan Y, Liu S, Zou H, Huang X, Chen S BMC Microbiol. 2025; 25(1):123.

PMID: 40057696 PMC: 11889872. DOI: 10.1186/s12866-025-03854-3.


The universal accumulation of p-aminophenol during the microbial degradation of analgesic and antipyretic acetaminophen in WWTPs: a novel metagenomic perspective.

Yin C, Pan P, Li T, Song X, Xu Y, Zhou N Microbiome. 2025; 13(1):68.

PMID: 40055835 PMC: 11887370. DOI: 10.1186/s40168-025-02065-2.


Terrestrial-aquatic connectivity structures microbial communities during the formation of thermokarst lakes.

Leroy M, Burnett M, Laurion I, Douglas P, Kallenbach C, Comte J ISME Commun. 2025; 5(1):ycaf027.

PMID: 40041706 PMC: 11879182. DOI: 10.1093/ismeco/ycaf027.


Genomically defined hypervirulent Klebsiella pneumoniae contributed to early-onset increased mortality.

Tang Y, Du P, Du C, Yang P, Shen N, Russo T Nat Commun. 2025; 16(1):2096.

PMID: 40025046 PMC: 11873152. DOI: 10.1038/s41467-025-57379-4.


References
1.
Henikoff S, Henikoff J . Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992; 89(22):10915-9. PMC: 50453. DOI: 10.1073/pnas.89.22.10915. View

2.
Muller T, Rahmann S, Dandekar T, Wolf M . Accurate and robust phylogeny estimation based on profile distances: a study of the Chlorophyceae (Chlorophyta). BMC Evol Biol. 2004; 4:20. PMC: 449703. DOI: 10.1186/1471-2148-4-20. View

3.
Engelhardt B, Jordan M, Muratore K, Brenner S . Protein molecular function prediction by Bayesian phylogenomics. PLoS Comput Biol. 2005; 1(5):e45. PMC: 1246806. DOI: 10.1371/journal.pcbi.0010045. View

4.
Studier J, Keppler K . A note on the neighbor-joining algorithm of Saitou and Nei. Mol Biol Evol. 1988; 5(6):729-31. DOI: 10.1093/oxfordjournals.molbev.a040527. View

5.
Saitou N, Nei M . The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987; 4(4):406-25. DOI: 10.1093/oxfordjournals.molbev.a040454. View