De Novo Identification of Microbial Contaminants in Low Microbial Biomass Microbiomes with Squeegee
Overview
Affiliations
Computational analysis of host-associated microbiomes has opened the door to numerous discoveries relevant to human health and disease. However, contaminant sequences in metagenomic samples can potentially impact the interpretation of findings reported in microbiome studies, especially in low-biomass environments. Contamination from DNA extraction kits or sampling lab environments leaves taxonomic "bread crumbs" across multiple distinct sample types. Here we describe Squeegee, a de novo contamination detection tool that is based upon this principle, allowing the detection of microbial contaminants when negative controls are unavailable. On the low-biomass samples, we compare Squeegee predictions to experimental negative control data and show that Squeegee accurately recovers putative contaminants. We analyze samples of varying biomass from the Human Microbiome Project and identify likely, previously unreported kit contamination. Collectively, our results highlight that Squeegee can identify microbial contaminants with high precision and thus represents a computational approach for contaminant detection when negative controls are unavailable.
Polyphonia: detecting inter-sample contamination in viral genomic sequencing data.
Krasilnikova L, Tomkins-Tinch C, Gayton A, Schaffner S, Dobbins S, Gladden-Young A Bioinformatics. 2024; 40(12).
PMID: 39673434 PMC: 11652266. DOI: 10.1093/bioinformatics/btae698.
Standardization of gut microbiome analysis in sports.
Mancin L, Paoli A, Berry S, Gonzalez J, Collins A, Lizarraga M Cell Rep Med. 2024; 5(10):101759.
PMID: 39368478 PMC: 11514603. DOI: 10.1016/j.xcrm.2024.101759.
A brain microbiome in salmonids at homeostasis.
Mani A, Henn C, Couch C, Patel S, Lieke T, Chan J Sci Adv. 2024; 10(38):eado0277.
PMID: 39292785 PMC: 11409976. DOI: 10.1126/sciadv.ado0277.
Chopra A, Franco-Duarte R, Rajagopal A, Choowong P, Soares P, Rito T Sci Rep. 2024; 14(1):1476.
PMID: 38233502 PMC: 10794416. DOI: 10.1038/s41598-023-50891-x.
Simon S, Schmidt K, Griesdorn L, Soares A, Bornemann T, Probst A BMC Genomics. 2023; 24(1):727.
PMID: 38041056 PMC: 10693096. DOI: 10.1186/s12864-023-09853-w.