» Articles » PMID: 39363203

Design and Implementation of a Metagenomic Analytical Pipeline for Respiratory Pathogen Detection

Overview
Journal BMC Res Notes
Publisher Biomed Central
Date 2024 Oct 3
PMID 39363203
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: We developed an in-house bioinformatics pipeline to improve the detection of respiratory pathogens in metagenomic sequencing data. This pipeline addresses the need for short-time analysis, high accuracy, scalability, and reproducibility in a high-performance computing environment.

Results: We evaluated our pipeline using ninety synthetic metagenomes designed to simulate nasopharyngeal swab samples. The pipeline successfully identified 177 out of 204 respiratory pathogens present in the compositions, with an average processing time of approximately 4 min per sample (processing 1 million paired-end reads of 150 base pairs). For the estimation of all the 470 taxa included in the compositions, the pipeline demonstrated high accuracy, identifying 420 and achieving a correlation of 0.9 between their actual and predicted relative abundances. Among the identified taxa, 27 were significantly underestimated or overestimated, including only three clinically relevant pathogens. We also validated the pipeline by applying it to a clinical dataset from a study on metagenomic pathogen characterization in patients with acute respiratory infections and successfully identified all pathogens responsible for the diagnosed infections. These findings underscore the pipeline's effectiveness in pathogen detection and highlight its potential utility in respiratory pathogen surveillance.

References
1.
Stephens Z, Lee S, Faghri F, Campbell R, Zhai C, Efron M . Big Data: Astronomical or Genomical?. PLoS Biol. 2015; 13(7):e1002195. PMC: 4494865. DOI: 10.1371/journal.pbio.1002195. View

2.
Lindgreen S, Adair K, Gardner P . An evaluation of the accuracy and speed of metagenome analysis tools. Sci Rep. 2016; 6:19233. PMC: 4726098. DOI: 10.1038/srep19233. View

3.
Gourle H, Karlsson-Lindsjo O, Hayer J, Bongcam-Rudloff E . Simulating Illumina metagenomic data with InSilicoSeq. Bioinformatics. 2018; 35(3):521-522. PMC: 6361232. DOI: 10.1093/bioinformatics/bty630. View

4.
Kim D, Paggi J, Park C, Bennett C, Salzberg S . Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019; 37(8):907-915. PMC: 7605509. DOI: 10.1038/s41587-019-0201-4. View

5.
Li C, Li W, Zhou J, Zhang B, Feng Y, Xu C . High resolution metagenomic characterization of complex infectomes in paediatric acute respiratory infection. Sci Rep. 2020; 10(1):3963. PMC: 7054269. DOI: 10.1038/s41598-020-60992-6. View