» Articles » PMID: 37081138

Inference of Phylogenetic Trees Directly from Raw Sequencing Reads Using Read2Tree

Overview
Journal Nat Biotechnol
Specialty Biotechnology
Date 2023 Apr 20
PMID 37081138
Authors
Affiliations
Soon will be listed here.
Abstract

Current methods for inference of phylogenetic trees require running complex pipelines at substantial computational and labor costs, with additional constraints in sequencing coverage, assembly and annotation quality, especially for large datasets. To overcome these challenges, we present Read2Tree, which directly processes raw sequencing reads into groups of corresponding genes and bypasses traditional steps in phylogeny inference, such as genome assembly, annotation and all-versus-all sequence comparisons, while retaining accuracy. In a benchmark encompassing a broad variety of datasets, Read2Tree is 10-100 times faster than assembly-based approaches and in most cases more accurate-the exception being when sequencing coverage is high and reference species very distant. Here, to illustrate the broad applicability of the tool, we reconstruct a yeast tree of life of 435 species spanning 590 million years of evolution. We also apply Read2Tree to >10,000 Coronaviridae samples, accurately classifying highly diverse animal samples and near-identical severe acute respiratory syndrome coronavirus 2 sequences on a single tree. The speed, accuracy and versatility of Read2Tree enable comparative genomics at scale.

Citing Articles

EvANI benchmarking workflow for evolutionary distance estimation.

Majidian S, Hwang S, Zakeri M, Langmead B bioRxiv. 2025; .

PMID: 40027788 PMC: 11870633. DOI: 10.1101/2025.02.23.639716.


WASTER: Practical phylogenomics from low-coverage short reads.

Zhang C, Nielsen R bioRxiv. 2025; .

PMID: 39896589 PMC: 11785061. DOI: 10.1101/2025.01.20.633983.


Orthology inference at scale with FastOMA.

Majidian S, Nevers Y, Yazdizadeh Kharrazi A, Warwick Vesztrocy A, Pascarelli S, Moi D Nat Methods. 2025; 22(2):269-272.

PMID: 39753922 PMC: 11810774. DOI: 10.1038/s41592-024-02552-8.


Subfamily evolution analysis using nuclear and chloroplast data from the same reads.

Witharana E, Iwasaki T, San M, Jayawardana N, Kotoda N, Yamamoto M Sci Rep. 2025; 15(1):687.

PMID: 39753617 PMC: 11698846. DOI: 10.1038/s41598-024-83292-9.


Multiple Horizontal Mini-chromosome Transfers Drive Genome Evolution of Clonal Blast Fungus Lineages.

Barragan A, Latorre S, Malmgren A, Harant A, Win J, Sugihara Y Mol Biol Evol. 2024; 41(8).

PMID: 39107250 PMC: 11346369. DOI: 10.1093/molbev/msae164.


References
1.
Woese C, Fox G . Phylogenetic structure of the prokaryotic domain: the primary kingdoms. Proc Natl Acad Sci U S A. 1977; 74(11):5088-90. PMC: 432104. DOI: 10.1073/pnas.74.11.5088. View

2.
Ciccarelli F, Doerks T, von Mering C, Creevey C, Snel B, Bork P . Toward automatic reconstruction of a highly resolved tree of life. Science. 2006; 311(5765):1283-7. DOI: 10.1126/science.1123061. View

3.
Williams T, Foster P, Cox C, Embley T . An archaeal origin of eukaryotes supports only two primary domains of life. Nature. 2013; 504(7479):231-6. DOI: 10.1038/nature12779. View

4.
Hug L, Baker B, Anantharaman K, Brown C, Probst A, Castelle C . A new view of the tree of life. Nat Microbiol. 2016; 1:16048. DOI: 10.1038/nmicrobiol.2016.48. View

5.
Abbosh C, Birkbak N, Wilson G, Jamal-Hanjani M, Constantin T, Salari R . Phylogenetic ctDNA analysis depicts early-stage lung cancer evolution. Nature. 2017; 545(7655):446-451. PMC: 5812436. DOI: 10.1038/nature22364. View