The Impact of Bioinformatics Pipelines on Microbiota Studies: Does the Analytical "Microscope" Affect the Biological Interpretation?
Overview
Affiliations
Targeted metagenomics is the solution of choice to reveal differential microbial profiles (defined by richness, diversity and composition) as part of case-control studies. It is well documented that each data processing step may have the potential to introduce bias in the results. However, selecting a bioinformatics pipeline to analyze high-throughput sequencing data from A to Z remains one of the critical considerations in a case-control microbiota study design. Consequently, the aim of this study was to assess whether the same biological conclusions regarding human gut microbiota composition and diversity could be reached using different bioinformatics pipelines. In this work, we considered four pipelines (mothur, QIIME, kraken and CLARK) with different versions and databases, and examined their impact on the outcome of metagenetic analysis of Ion Torrent 16S sequencing data. We re-analyzed a case-control study evaluating the impact of the colonization of the intestinal protozoa sp. on the human gut microbial profile. Although most pipelines reported the same trends in this case-control study, we demonstrated how the use of different pipelines affects the biological conclusions that can be drawn. Targeted metagenomics must therefore rather be considered as a profiling tool to obtain a broad sense of the variations of the microbiota, rather than an accurate identification tool.
Quek J, Wong J, Tan J, Yeo C, Saw S Foods. 2025; 14(3).
PMID: 39941945 PMC: 11817458. DOI: 10.3390/foods14030352.
Lehr K, Oosterlinck B, Then C, Gemmell M, Gedgaudas R, Bornschein J mSystems. 2025; 10(2):e0135824.
PMID: 39873520 PMC: 11834405. DOI: 10.1128/msystems.01358-24.
Kulecka M, Czarnowski P, Balabas A, Turkot M, Kruczkowska-Tarantowicz K, Zeber-Lubecka N Int J Mol Sci. 2024; 25(15).
PMID: 39125593 PMC: 11311272. DOI: 10.3390/ijms25158026.
ONeill S, Minehan M, Knight-Agarwal C, Pyne D Front Nutr. 2024; 10:1303405.
PMID: 38260072 PMC: 10800578. DOI: 10.3389/fnut.2023.1303405.
A next-generation sequencing approach for the detection of mixed species in canned tuna.
Klapper R, Velasco A, Doring M, Schroder U, Sotelo C, Brinks E Food Chem X. 2023; 17:100560.
PMID: 36845509 PMC: 9943852. DOI: 10.1016/j.fochx.2023.100560.