» Articles » PMID: 30992055

Towards Precision Quantification of Contamination in Metagenomic Sequencing Experiments

Overview
Journal Microbiome
Publisher Biomed Central
Specialties Genetics
Microbiology
Date 2019 Apr 18
PMID 30992055
Citations 50
Authors
Affiliations
Soon will be listed here.
Abstract

Metagenomic next-generation sequencing (mNGS) experiments involving small amounts of nucleic acid input are highly susceptible to erroneous conclusions resulting from unintentional sequencing of occult contaminants, especially those derived from molecular biology reagents. Recent work suggests that, for any given microbe detected by mNGS, an inverse linear relationship between microbial sequencing reads and sample mass implicates that microbe as a contaminant. By associating sequencing read output with the mass of a spike-in control, we demonstrate that contaminant nucleic acid can be quantified in order to identify the mass contributions of each constituent. In an experiment using a high-resolution (n = 96) dilution series of HeLa RNA spanning 3-logs of RNA mass input, we identified a complex set of contaminants totaling 9.1 ± 2.0 attograms. Given the competition between contamination and the true microbiome in ultra-low biomass samples such as respiratory fluid, quantification of the contamination within a given batch of biological samples can be used to determine a minimum mass input below which sequencing results may be distorted. Rather than completely censoring contaminant taxa from downstream analyses, we propose here a statistical approach that allows separation of the true microbial components from the actual contribution due to contamination. We demonstrate this approach using a batch of n = 97 human serum samples and note that despite E. coli contamination throughout the dataset, we are able to identify a patient sample with significantly more E. coli than expected from contamination alone. Importantly, our method assumes no prior understanding of possible contaminants, does not rely on any prior collection of environmental or reagent-only sequencing samples, and does not censor potentially clinically relevant taxa, thus making it a generalized approach to any kind of metagenomic sequencing, for any purpose, clinical or otherwise.

Citing Articles

Non-Targeted RNA Sequencing: Towards the Development of Universal Clinical Diagnosis Methods for Human and Veterinary Infectious Diseases.

Spatz S, Afonso C Vet Sci. 2024; 11(6).

PMID: 38921986 PMC: 11209166. DOI: 10.3390/vetsci11060239.


A metagenomic analysis of the phase 2 Anopheles gambiae 1000 genomes dataset reveals a wide diversity of cobionts associated with field collected mosquitoes.

Pastusiak A, Reddy M, Chen X, Hoyer I, Dorman J, Gebhardt M Commun Biol. 2024; 7(1):667.

PMID: 38816486 PMC: 11139907. DOI: 10.1038/s42003-024-06337-9.


Pathobiological signatures of dysbiotic lung injury in pediatric patients undergoing stem cell transplantation.

Zinter M, Dvorak C, Mayday M, Reyes G, Simon M, Pearce E Nat Med. 2024; 30(7):1982-1993.

PMID: 38783139 PMC: 11271406. DOI: 10.1038/s41591-024-02999-4.


Enhancing Clinical Utility: Utilization of International Standards and Guidelines for Metagenomic Sequencing in Infectious Disease Diagnosis.

Kan C, Tsang H, Pei X, Ng S, Yim A, Yu A Int J Mol Sci. 2024; 25(6).

PMID: 38542307 PMC: 10970082. DOI: 10.3390/ijms25063333.


Hybrid capture shotgun sequencing detected unexpected viruses in the cerebrospinal fluid of children with acute meningitis and encephalitis.

Launes C, Camacho J, Pons-Espinal M, Lopez-Labrador F, Esteva C, Cabrerizo M Eur J Clin Microbiol Infect Dis. 2024; 43(5):863-873.

PMID: 38438704 PMC: 11108891. DOI: 10.1007/s10096-024-04795-x.


References
1.
van der Zee A, Peeters M, de Jong C, Verbakel H, Crielaard J, Claas E . Qiagen DNA extraction kits for sample preparation for legionella PCR are not suitable for diagnostic purposes. J Clin Microbiol. 2002; 40(3):1126. PMC: 120297. DOI: 10.1128/JCM.40.3.1128.2002. View

2.
Kennedy K, Hall M, Lynch M, Moreno-Hagelsieb G, Neufeld J . Evaluating bias of illumina-based bacterial 16S rRNA gene profiles. Appl Environ Microbiol. 2014; 80(18):5717-22. PMC: 4178620. DOI: 10.1128/AEM.01451-14. View

3.
Zinter M, Dvorak C, Mayday M, Iwanaga K, Ly N, McGarry M . Pulmonary Metagenomic Sequencing Suggests Missed Infections in Immunocompromised Children. Clin Infect Dis. 2018; 68(11):1847-1855. PMC: 6784263. DOI: 10.1093/cid/ciy802. View

4.
Davis N, Proctor D, Holmes S, Relman D, Callahan B . Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data. Microbiome. 2018; 6(1):226. PMC: 6298009. DOI: 10.1186/s40168-018-0605-2. View

5.
Weiss S, Amir A, Hyde E, Metcalf J, Song S, Knight R . Tracking down the sources of experimental contamination in microbiome studies. Genome Biol. 2015; 15(12):564. PMC: 4311479. DOI: 10.1186/s13059-014-0564-2. View