» Articles » PMID: 27672352

Monitoring Error Rates In Illumina Sequencing

Overview
Journal J Biomol Tech
Date 2016 Sep 28
PMID 27672352
Citations 32
Authors
Affiliations
Soon will be listed here.
Abstract

Guaranteeing high-quality next-generation sequencing data in a rapidly changing environment is an ongoing challenge. The introduction of the Illumina NextSeq 500 and the depreciation of specific metrics from Illumina's Sequencing Analysis Viewer (SAV; Illumina, San Diego, CA, USA) have made it more difficult to determine directly the baseline error rate of sequencing runs. To improve our ability to measure base quality, we have created an open-source tool to construct the Percent Perfect Reads (PPR) plot, previously provided by the Illumina sequencers. The PPR program is compatible with HiSeq 2000/2500, MiSeq, and NextSeq 500 instruments and provides an alternative to Illumina's quality value (Q) scores for determining run quality. Whereas Q scores are representative of run quality, they are often overestimated and are sourced from different look-up tables for each platform. The PPR's unique capabilities as a cross-instrument comparison device, as a troubleshooting tool, and as a tool for monitoring instrument performance can provide an increase in clarity over SAV metrics that is often crucial for maintaining instrument health. These capabilities are highlighted.

Citing Articles

Effect of sequencing platforms on the sensitivity of chemical mutation detection using Hawk-Seq™.

Hosoi S, Hirose T, Matsumura S, Otsubo Y, Saito K, Miyazawa M Genes Environ. 2024; 46(1):20.

PMID: 39385252 PMC: 11462924. DOI: 10.1186/s41021-024-00313-9.


Sequencing by binding rivals SMOR error-corrected sequencing by synthesis technology for accurate detection and quantification of minor (< 0.1%) subpopulation variants.

Allender C, Wike C, Porter W, Ellis D, Lemmer D, Pond S BMC Genomics. 2024; 25(1):789.

PMID: 39160478 PMC: 11331594. DOI: 10.1186/s12864-024-10697-1.


Limitations of current high-throughput sequencing technologies lead to biased expression estimates of endogenous retroviral elements.

Kitsou K, Katzourakis A, Magiorkinis G NAR Genom Bioinform. 2024; 6(3):lqae081.

PMID: 38984066 PMC: 11231582. DOI: 10.1093/nargab/lqae081.


Sequencing by Binding rivals error-corrected Sequencing by Synthesis technology for accurate detection and quantification of minor (<0.1%) subpopulation variants.

Allender C, Wike C, Ellis D, Lemmer D, Porter T, Pond S Res Sq. 2024; .

PMID: 38826386 PMC: 11142358. DOI: 10.21203/rs.3.rs-4391713/v1.


TrieDedup: a fast trie-based deduplication algorithm to handle ambiguous bases in high-throughput sequencing.

Hu J, Luo S, Tian M, Ye A BMC Bioinformatics. 2024; 25(1):154.

PMID: 38637756 PMC: 11025179. DOI: 10.1186/s12859-024-05775-w.


References
1.
Dohm J, Lottaz C, Borodina T, Himmelbauer H . Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res. 2008; 36(16):e105. PMC: 2532726. DOI: 10.1093/nar/gkn425. View

2.
Langmead B, Salzberg S . Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012; 9(4):357-9. PMC: 3322381. DOI: 10.1038/nmeth.1923. View

3.
Ewing B, Green P . Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 1998; 8(3):186-94. View

4.
Nakamura K, Oshima T, Morimoto T, Ikeda S, Yoshikawa H, Shiwa Y . Sequence-specific error profile of Illumina sequencers. Nucleic Acids Res. 2011; 39(13):e90. PMC: 3141275. DOI: 10.1093/nar/gkr344. View

5.
DePristo M, Banks E, Poplin R, Garimella K, Maguire J, Hartl C . A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011; 43(5):491-8. PMC: 3083463. DOI: 10.1038/ng.806. View