» Articles » PMID: 36304302

Metagenomic Analysis Using Phylogenetic Placement-A Review of the First Decade

Overview
Journal Front Bioinform
Specialty Biology
Date 2022 Oct 28
PMID 36304302
Authors
Affiliations
Soon will be listed here.
Abstract

Phylogenetic placement refers to a family of tools and methods to analyze, visualize, and interpret the tsunami of metagenomic sequencing data generated by high-throughput sequencing. Compared to alternative (e. g., similarity-based) methods, it puts metabarcoding sequences into a phylogenetic context using a set of known reference sequences and taking evolutionary history into account. Thereby, one can increase the accuracy of metagenomic surveys and eliminate the requirement for having exact or close matches with existing sequence databases. Phylogenetic placement constitutes a valuable analysis tool , but also entails a plethora of downstream tools to interpret its results. A common use case is to analyze species communities obtained from metagenomic sequencing, for example via taxonomic assignment, diversity quantification, sample comparison, and identification of correlations with environmental variables. In this review, we provide an overview over the methods developed during the first 10 years. In particular, the goals of this review are 1) to motivate the usage of phylogenetic placement and illustrate some of its use cases, 2) to outline the full workflow, from raw sequences to publishable figures, including best practices, 3) to introduce the most common tools and methods and their capabilities, 4) to point out common placement pitfalls and misconceptions, 5) to showcase typical placement-based analyses, and how they can help to analyze, visualize, and interpret phylogenetic placement data.

Citing Articles

Scalable method for exploring phylogenetic placement uncertainty with custom visualizations using and .

Chen M, Luo X, Xu S, Li L, Li J, Xie Z Imeta. 2025; 4(1):e269.

PMID: 40027482 PMC: 11865327. DOI: 10.1002/imt2.269.


Exploring microbial players for metagenomic profiling of carbon cycling bacteria in sundarban mangrove soils.

Das B, Gadnayak A, Chakraborty H, Pradhan S, Raut S, Das S Sci Rep. 2025; 15(1):4784.

PMID: 39922935 PMC: 11807184. DOI: 10.1038/s41598-025-89418-x.


Read Length Dominates Phylogenetic Placement Accuracy of Ancient DNA Reads.

Bettisworth B, Psonis N, Poulakakis N, Pavlidis P, Stamatakis A Mol Biol Evol. 2025; 42(2).

PMID: 39823473 PMC: 11839404. DOI: 10.1093/molbev/msaf006.


Testing Phylogenetic Placement Accuracy of DNA Barcode Sequences on a Fish Backbone Tree: Implications of Backbone Tree Completeness and Species Representation.

Fernando M, Fu J, Adamowicz S Ecol Evol. 2025; 15(1):e70817.

PMID: 39781258 PMC: 11706799. DOI: 10.1002/ece3.70817.


Applications of Next-Generation Sequencing Technologies and Statistical Tools in Identifying Pathways and Biomarkers for Heat Tolerance in Livestock.

Kalaignazhal G, Sejian V, Velayudhan S, Mishra C, Rebez E, Chauhan S Vet Sci. 2024; 11(12).

PMID: 39728955 PMC: 11680151. DOI: 10.3390/vetsci11120616.


References
1.
Muhlemann B, Vinner L, Margaryan A, Wilhelmson H, de la Fuente Castro C, Allentoft M . Diverse variola virus (smallpox) strains were widespread in northern Europe in the Viking Age. Science. 2020; 369(6502). DOI: 10.1126/science.aaw8977. View

2.
Carbone I, White J, Miadlikowska J, Arnold A, Miller M, Magain N . T-BAS Version 2.1: Tree-Based Alignment Selector Toolkit for Evolutionary Placement of DNA Sequences and Viewing Alignments and Specimen Metadata on Curated and Custom Trees. Microbiol Resour Announc. 2019; 8(29). PMC: 6639605. DOI: 10.1128/MRA.00328-19. View

3.
Fu L, Niu B, Zhu Z, Wu S, Li W . CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012; 28(23):3150-2. PMC: 3516142. DOI: 10.1093/bioinformatics/bts565. View

4.
Heather J, Chain B . The sequence of sequencers: The history of sequencing DNA. Genomics. 2015; 107(1):1-8. PMC: 4727787. DOI: 10.1016/j.ygeno.2015.11.003. View

5.
Piredda R, Grimm G, Schulze E, Denk T, Simeone M . High-throughput sequencing of 5S-IGS in oaks: Exploring intragenomic variation and algorithms to recognize target species in pure and mixed samples. Mol Ecol Resour. 2020; 21(2):495-510. DOI: 10.1111/1755-0998.13264. View