» Articles » PMID: 28582401

DeepNano: Deep Recurrent Neural Networks for Base Calling in MinION Nanopore Reads

Overview
Journal PLoS One
Date 2017 Jun 6
PMID 28582401
Citations 83
Authors
Affiliations
Soon will be listed here.
Abstract

The MinION device by Oxford Nanopore produces very long reads (reads over 100 kBp were reported); however it suffers from high sequencing error rate. We present an open-source DNA base caller based on deep recurrent neural networks and show that the accuracy of base calling is much dependent on the underlying software and can be improved by considering modern machine learning methods. By employing carefully crafted recurrent neural networks, our tool significantly improves base calling accuracy on data from R7.3 version of the platform compared to the default base caller supplied by the manufacturer. On R9 version, we achieve results comparable to Nanonet base caller provided by Oxford Nanopore. Availability of an open source tool with high base calling accuracy will be useful for development of new applications of the MinION device, including infectious disease detection and custom target enrichment during sequencing.

Citing Articles

Artificial intelligence and machine learning in cell-free-DNA-based diagnostics.

Tsui W, Ding S, Jiang P, Lo Y Genome Res. 2025; 35(1):1-19.

PMID: 39843210 PMC: 11789496. DOI: 10.1101/gr.278413.123.


DeepCorr: a novel error correction method for 3GS long reads based on deep learning.

Wang R, Chen J PeerJ Comput Sci. 2024; 10:e2160.

PMID: 39678285 PMC: 11639150. DOI: 10.7717/peerj-cs.2160.


GCRTcall: a transformer based basecaller for nanopore RNA sequencing enhanced by gated convolution and relative position embedding via joint loss training.

Li Q, Sun C, Wang D, Lou J Front Genet. 2024; 15:1443532.

PMID: 39649096 PMC: 11621211. DOI: 10.3389/fgene.2024.1443532.


BaseNet: A transformer-based toolkit for nanopore sequencing signal decoding.

Li Q, Sun C, Wang D, Lou J Comput Struct Biotechnol J. 2024; 23:3430-3444.

PMID: 39391372 PMC: 11465205. DOI: 10.1016/j.csbj.2024.09.016.


A generalized protein identification method for novel and diverse sequencing technologies.

Bhandari B, Goldman N NAR Genom Bioinform. 2024; 6(3):lqae126.

PMID: 39296929 PMC: 11409062. DOI: 10.1093/nargab/lqae126.


References
1.
Loose M, Malla S, Stout M . Real-time selective sequencing using nanopore technology. Nat Methods. 2016; 13(9):751-4. PMC: 5008457. DOI: 10.1038/nmeth.3930. View

2.
Szalay T, Golovchenko J . De novo sequencing and variant calling with nanopores using PoreSeq. Nat Biotechnol. 2015; 33(10):1087-91. PMC: 4877053. DOI: 10.1038/nbt.3360. View

3.
Goodwin S, Gurtowski J, Ethe-Sayers S, Deshpande P, Schatz M, McCombie W . Oxford Nanopore sequencing, hybrid error correction, and de novo assembly of a eukaryotic genome. Genome Res. 2015; 25(11):1750-6. PMC: 4617970. DOI: 10.1101/gr.191395.115. View

4.
Quick J, Loman N, Duraffour S, Simpson J, Severi E, Cowley L . Real-time, portable genome sequencing for Ebola surveillance. Nature. 2016; 530(7589):228-232. PMC: 4817224. DOI: 10.1038/nature16996. View

5.
David M, Dursi L, Yao D, Boutros P, Simpson J . Nanocall: an open source basecaller for Oxford Nanopore sequencing data. Bioinformatics. 2016; 33(1):49-55. PMC: 5408768. DOI: 10.1093/bioinformatics/btw569. View