» Articles » PMID: 24386481

Consistent Errors in First Strand CDNA Due to Random Hexamer Mispriming

Overview
Journal PLoS One
Date 2014 Jan 4
PMID 24386481
Citations 25
Authors
Affiliations
Soon will be listed here.
Abstract

Priming of random hexamers in cDNA synthesis is known to show sequence bias, but in addition it has been suggested recently that mismatches in random hexamer priming could be a cause of mismatches between the original RNA fragment and observed sequence reads. To explore random hexamer mispriming as a potential source of these errors, we analyzed two independently generated RNA-seq datasets of synthetic ERCC spikes for which the reference is known. First strand cDNA synthesized by random hexamer priming on RNA showed consistent position and nucleotide-specific mismatch errors in the first seven nucleotides. The mismatch errors found in both datasets are consistent in distribution and thermodynamically stable mismatches are more common. This strongly indicates that RNA-DNA mispriming of specific random hexamers causes these errors. Due to their consistency and specificity, mispriming errors can have profound implications for downstream applications if not dealt with properly.

Citing Articles

Artifacts and biases of the reverse transcription reaction in RNA sequencing.

Verwilt J, Mestdagh P, Vandesompele J RNA. 2023; 29(7):889-897.

PMID: 36990512 PMC: 10275267. DOI: 10.1261/rna.079623.123.


See-N-Seq: RNA sequencing of target single cells identified by microscopy via micropatterning of hydrogel porosity.

Lee J, Park E, Choi J, Matthews K, Lam A, Deng X Commun Biol. 2022; 5(1):768.

PMID: 35908100 PMC: 9338959. DOI: 10.1038/s42003-022-03703-3.


Trimming and Validation of Illumina Short Reads Using Trimmomatic, Trinity Assembly, and Assessment of RNA-Seq Data.

Sewe S, Silva G, Sicat P, Seal S, Visendi P Methods Mol Biol. 2022; 2443:211-232.

PMID: 35037208 DOI: 10.1007/978-1-0716-2067-0_11.


Barcoded oligonucleotides ligated on RNA amplified for multiplexed and parallel in situ analyses.

Liu S, Punthambaker S, Iyer E, Ferrante T, Goodwin D, Furth D Nucleic Acids Res. 2021; 49(10):e58.

PMID: 33693773 PMC: 8191787. DOI: 10.1093/nar/gkab120.


A comparison of unamplified and massively multiplexed PCR amplification for murine antibody repertoire sequencing.

Rettig T, Pecaut M, Chapes S FASEB Bioadv. 2020; 1(1):6-17.

PMID: 32123808 PMC: 6996338. DOI: 10.1096/fba.1017.


References
1.
Zook J, Samarov D, McDaniel J, Sen S, Salit M . Synthetic spike-in standards improve run-specific systematic error analysis for DNA and RNA sequencing. PLoS One. 2012; 7(7):e41356. PMC: 3409179. DOI: 10.1371/journal.pone.0041356. View

2.
Dohm J, Lottaz C, Borodina T, Himmelbauer H . Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res. 2008; 36(16):e105. PMC: 2532726. DOI: 10.1093/nar/gkn425. View

3.
Sugimoto N, Nakano M, Nakano S . Thermodynamics-structure relationship of single mismatches in RNA/DNA duplexes. Biochemistry. 2000; 39(37):11270-81. DOI: 10.1021/bi000819p. View

4.
Pickrell J, Gilad Y, Pritchard J . Comment on "Widespread RNA and DNA sequence differences in the human transcriptome". Science. 2012; 335(6074):1302. PMC: 5207799. DOI: 10.1126/science.1210484. View

5.
Li M, Wang I, Li Y, Bruzel A, Richards A, Toung J . Widespread RNA and DNA sequence differences in the human transcriptome. Science. 2011; 333(6038):53-8. PMC: 3204392. DOI: 10.1126/science.1207018. View