» Articles » PMID: 31561435

The Impact of Bioinformatics Pipelines on Microbiota Studies: Does the Analytical "Microscope" Affect the Biological Interpretation?

Overview
Journal Microorganisms
Specialty Microbiology
Date 2019 Sep 29
PMID 31561435
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

Targeted metagenomics is the solution of choice to reveal differential microbial profiles (defined by richness, diversity and composition) as part of case-control studies. It is well documented that each data processing step may have the potential to introduce bias in the results. However, selecting a bioinformatics pipeline to analyze high-throughput sequencing data from A to Z remains one of the critical considerations in a case-control microbiota study design. Consequently, the aim of this study was to assess whether the same biological conclusions regarding human gut microbiota composition and diversity could be reached using different bioinformatics pipelines. In this work, we considered four pipelines (mothur, QIIME, kraken and CLARK) with different versions and databases, and examined their impact on the outcome of metagenetic analysis of Ion Torrent 16S sequencing data. We re-analyzed a case-control study evaluating the impact of the colonization of the intestinal protozoa sp. on the human gut microbial profile. Although most pipelines reported the same trends in this case-control study, we demonstrated how the use of different pipelines affects the biological conclusions that can be drawn. Targeted metagenomics must therefore rather be considered as a profiling tool to obtain a broad sense of the variations of the microbiota, rather than an accurate identification tool.

Citing Articles

Integrating Metagenomic and Culture-Based Techniques to Detect Foodborne Pathogens and Antimicrobial Resistance Genes in Malaysian Produce.

Quek J, Wong J, Tan J, Yeo C, Saw S Foods. 2025; 14(3).

PMID: 39941945 PMC: 11817458. DOI: 10.3390/foods14030352.


Comparison of different microbiome analysis pipelines to validate their reproducibility of gastric mucosal microbiome composition.

Lehr K, Oosterlinck B, Then C, Gemmell M, Gedgaudas R, Bornschein J mSystems. 2025; 10(2):e0135824.

PMID: 39873520 PMC: 11834405. DOI: 10.1128/msystems.01358-24.


Microbial and Metabolic Gut Profiling across Seven Malignancies Identifies Fecal and Formic Acid as Commonly Altered in Cancer Patients.

Kulecka M, Czarnowski P, Balabas A, Turkot M, Kruczkowska-Tarantowicz K, Zeber-Lubecka N Int J Mol Sci. 2024; 25(15).

PMID: 39125593 PMC: 11311272. DOI: 10.3390/ijms25158026.


Alterations in gut microbiota caused by major depressive disorder or a low FODMAP diet and where they overlap.

ONeill S, Minehan M, Knight-Agarwal C, Pyne D Front Nutr. 2024; 10:1303405.

PMID: 38260072 PMC: 10800578. DOI: 10.3389/fnut.2023.1303405.


A next-generation sequencing approach for the detection of mixed species in canned tuna.

Klapper R, Velasco A, Doring M, Schroder U, Sotelo C, Brinks E Food Chem X. 2023; 17:100560.

PMID: 36845509 PMC: 9943852. DOI: 10.1016/j.fochx.2023.100560.


References
1.
Amir A, McDonald D, Navas-Molina J, Kopylova E, Morton J, Xu Z . Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns. mSystems. 2017; 2(2). PMC: 5340863. DOI: 10.1128/mSystems.00191-16. View

2.
Brooks J, Edwards D, Harwich Jr M, Rivera M, Fettweis J, Serrano M . The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies. BMC Microbiol. 2015; 15:66. PMC: 4433096. DOI: 10.1186/s12866-015-0351-6. View

3.
White J, Nagarajan N, Pop M . Statistical methods for detecting differentially abundant features in clinical metagenomic samples. PLoS Comput Biol. 2009; 5(4):e1000352. PMC: 2661018. DOI: 10.1371/journal.pcbi.1000352. View

4.
Siegwald L, Touzet H, Lemoine Y, Hot D, Audebert C, Caboche S . Assessment of Common and Emerging Bioinformatics Pipelines for Targeted Metagenomics. PLoS One. 2017; 12(1):e0169563. PMC: 5215245. DOI: 10.1371/journal.pone.0169563. View

5.
Cai Y, Sun Y . ESPRIT-Tree: hierarchical clustering analysis of millions of 16S rRNA pyrosequences in quasilinear computational time. Nucleic Acids Res. 2011; 39(14):e95. PMC: 3152367. DOI: 10.1093/nar/gkr349. View