» Articles » PMID: 23303777

Discovery and Characterization of Artifactual Mutations in Deep Coverage Targeted Capture Sequencing Data Due to Oxidative DNA Damage During Sample Preparation

Abstract

As researchers begin probing deep coverage sequencing data for increasingly rare mutations and subclonal events, the fidelity of next generation sequencing (NGS) laboratory methods will become increasingly critical. Although error rates for sequencing and polymerase chain reaction (PCR) are well documented, the effects that DNA extraction and other library preparation steps could have on downstream sequence integrity have not been thoroughly evaluated. Here, we describe the discovery of novel C > A/G > T transversion artifacts found at low allelic fractions in targeted capture data. Characteristics such as sequencer read orientation and presence in both tumor and normal samples strongly indicated a non-biological mechanism. We identified the source as oxidation of DNA during acoustic shearing in samples containing reactive contaminants from the extraction process. We show generation of 8-oxoguanine (8-oxoG) lesions during DNA shearing, present analysis tools to detect oxidation in sequencing data and suggest methods to reduce DNA oxidation through the introduction of antioxidants. Further, informatics methods are presented to confidently filter these artifacts from sequencing data sets. Though only seen in a low percentage of reads in affected samples, such artifacts could have profoundly deleterious effects on the ability to confidently call rare mutations, and eliminating other possible sources of artifacts should become a priority for the research community.

Citing Articles

Clonal dynamics and somatic evolution of haematopoiesis in mouse.

Kapadia C, Williams N, Dawson K, Watson C, Yousefzadeh M, Le D Nature. 2025; .

PMID: 40044850 DOI: 10.1038/s41586-025-08625-8.


Fast and efficient method for parallel construction of targeted exome and methylome single-stranded DNA sequencing libraries.

Kim E, An S, Ahn H, Lim J, Kim S, Park A Sci Rep. 2025; 15(1):7144.

PMID: 40021910 PMC: 11871346. DOI: 10.1038/s41598-025-91537-4.


Exome sequencing of UK birth cohorts.

Koko M, Fabian L, Popov I, Eberhardt R, Zakharov G, Huang Q Wellcome Open Res. 2025; 9:390.

PMID: 39839975 PMC: 11747307. DOI: 10.12688/wellcomeopenres.22697.2.


Immunogenomic determinants of exceptional response to immune checkpoint inhibition in renal cell carcinoma.

Jammihal T, Saliby R, Labaki C, Soulati H, Gallegos J, Peris A Nat Cancer. 2025; 6(2):372-384.

PMID: 39789182 DOI: 10.1038/s43018-024-00896-w.


Genomic heterogeneity and ploidy identify patients with intrinsic resistance to PD-1 blockade in metastatic melanoma.

Tarantino G, Ricker C, Wang A, Ge W, Aprati T, Huang A Sci Adv. 2024; 10(48):eadp4670.

PMID: 39602539 PMC: 11601251. DOI: 10.1126/sciadv.adp4670.


References
1.
Li H, Durbin R . Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25(14):1754-60. PMC: 2705234. DOI: 10.1093/bioinformatics/btp324. View

2.
Choi M, Scholl U, Ji W, Liu T, Tikhonova I, Zumbo P . Genetic diagnosis by whole exome capture and massively parallel DNA sequencing. Proc Natl Acad Sci U S A. 2009; 106(45):19096-101. PMC: 2768590. DOI: 10.1073/pnas.0910672106. View

3.
Flaherty P, Natsoulis G, Muralidharan O, Winters M, Buenrostro J, Bell J . Ultrasensitive detection of rare mutations using next-generation targeted resequencing. Nucleic Acids Res. 2011; 40(1):e2. PMC: 3245950. DOI: 10.1093/nar/gkr861. View

4.
Gerlinger M, Rowan A, Horswell S, Math M, Larkin J, Endesfelder D . Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N Engl J Med. 2012; 366(10):883-892. PMC: 4878653. DOI: 10.1056/NEJMoa1113205. View

5.
Hainaut P, Pfeifer G . Patterns of p53 G-->T transversions in lung cancers reflect the primary mutagenic signature of DNA-damage by tobacco smoke. Carcinogenesis. 2001; 22(3):367-74. DOI: 10.1093/carcin/22.3.367. View