» Articles » PMID: 29875188

Control of Artifactual Variation in Reported Intersample Relatedness During Clinical Use of a Mycobacterium Tuberculosis Sequencing Pipeline

Overview
Specialty Microbiology
Date 2018 Jun 8
PMID 29875188
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

Contact tracing requires reliable identification of closely related bacterial isolates. When we noticed the reporting of artifactual variation between isolates during routine next-generation sequencing of spp., we investigated its basis in 2,018 consecutive isolates. In the routine process used, clinical samples were decontaminated and inoculated into broth cultures; from positive broth cultures DNA was extracted and sequenced, reads were mapped, and consensus sequences were determined. We investigated the process of consensus sequence determination, which selects the most common nucleotide at each position. Having determined the high-quality read depth and depth of minor variants across 8,006 genomic regions, we quantified the relationship between the minor variant depth and the amount of nonmycobacterial bacterial DNA, which originates from commensal microbes killed during sample decontamination. In the presence of nonmycobacterial bacterial DNA, we found significant increases in minor variant frequencies, of more than 1.5-fold, in 242 regions covering 5.1% of the genome. Included within these were four high-variation regions strongly influenced by the amount of nonmycobacterial bacterial DNA. Excluding these four regions from pairwise distance comparisons reduced biologically implausible variation from 5.2% to 0% in an independent validation set derived from 226 individuals. Thus, we demonstrated an approach identifying critical genomic regions contributing to clinically relevant artifactual variation in bacterial similarity searches. The approach described monitors the outputs of the complex multistep laboratory and bioinformatics process, allows periodic process adjustments, and will have application to quality control of routine bacterial genomics.

Citing Articles

Description of Bacterial RNA Transcripts Detected in - Infected Cells from Peripheral Human Granulomas using Single Cell RNA Sequencing.

Moos P, Carey A, Joseph J, Kialo S, Norrie J, Moyarelce J bioRxiv. 2024; .

PMID: 39229107 PMC: 11370423. DOI: 10.1101/2024.08.20.608852.


Pangenome databases improve host removal and mycobacteria classification from clinical metagenomic data.

Hall M, Coin L Gigascience. 2024; 13.

PMID: 38573185 PMC: 10993716. DOI: 10.1093/gigascience/giae010.


Genomic insights into anthropozoonotic tuberculosis in captive sun bears (Helarctos malayanus) and an Asiatic black bear (Ursus thibetanus) in Cambodia.

Officer K, Walker T, Cheng S, Heng S, Hide M, Banuls A Sci Rep. 2024; 14(1):7343.

PMID: 38538629 PMC: 10973429. DOI: 10.1038/s41598-024-57318-1.


transmission in Birmingham, UK, 2009-19: An observational study.

Walker T, Choisy M, Dedicoat M, Drennan P, Wyllie D, Yang-Turner F Lancet Reg Health Eur. 2022; 17:100361.

PMID: 35345560 PMC: 8956939. DOI: 10.1016/j.lanepe.2022.100361.


High precision variant and antimicrobial resistance calling from metagenomic Nanopore sequencing.

Sanderson N, Swann J, Barker L, Kavanagh J, Hoosdally S, Crook D Genome Res. 2020; 30(9):1354-1363.

PMID: 32873606 PMC: 7545138. DOI: 10.1101/gr.262865.120.


References
1.
Langmead B, Trapnell C, Pop M, Salzberg S . Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009; 10(3):R25. PMC: 2690996. DOI: 10.1186/gb-2009-10-3-r25. View

2.
Morgulis A, Gertz E, Schaffer A, Agarwala R . A fast and symmetric DUST implementation to mask low-complexity DNA sequences. J Comput Biol. 2006; 13(5):1028-40. DOI: 10.1089/cmb.2006.13.1028. View

3.
Pankhurst L, Del Ojo Elias C, Votintseva A, Walker T, Cole K, Davies J . Rapid, comprehensive, and affordable mycobacterial diagnosis with whole-genome sequencing: a prospective study. Lancet Respir Med. 2015; 4(1):49-58. PMC: 4698465. DOI: 10.1016/S2213-2600(15)00466-X. View

4.
Bedell J, Korf I, Gish W . MaskerAid: a performance enhancement to RepeatMasker. Bioinformatics. 2001; 16(11):1040-1. DOI: 10.1093/bioinformatics/16.11.1040. View

5.
Quan T, Bawa Z, Foster D, Walker T, Del Ojo Elias C, Rathod P . Evaluation of Whole-Genome Sequencing for Mycobacterial Species Identification and Drug Susceptibility Testing in a Clinical Setting: a Large-Scale Prospective Assessment of Performance against Line Probe Assays and Phenotyping. J Clin Microbiol. 2017; 56(2). PMC: 5786738. DOI: 10.1128/JCM.01480-17. View