» Articles » PMID: 30458005

Deepbinner: Demultiplexing Barcoded Oxford Nanopore Reads with Deep Convolutional Neural Networks

Overview
Specialty Biology
Date 2018 Nov 21
PMID 30458005
Citations 98
Authors
Affiliations
Soon will be listed here.
Abstract

Multiplexing, the simultaneous sequencing of multiple barcoded DNA samples on a single flow cell, has made Oxford Nanopore sequencing cost-effective for small genomes. However, it depends on the ability to sort the resulting sequencing reads by barcode, and current demultiplexing tools fail to classify many reads. Here we present Deepbinner, a tool for Oxford Nanopore demultiplexing that uses a deep neural network to classify reads based on the raw electrical read signal. This 'signal-space' approach allows for greater accuracy than existing 'base-space' tools (Albacore and Porechop) for which signals must first be converted to DNA base calls, itself a complex problem that can introduce noise into the barcode sequence. To assess Deepbinner and existing tools, we performed multiplex sequencing on 12 amplicons chosen for their distinguishability. This allowed us to establish a ground truth classification for each read based on internal sequence alone. Deepbinner had the lowest rate of unclassified reads (7.8%) and the highest demultiplexing precision (98.5% of classified reads were correctly assigned). It can be used alone (to maximise the number of classified reads) or in conjunction with other demultiplexers (to maximise precision and minimise false positive classifications). We also found cross-sample chimeric reads (0.3%) and evidence of barcode switching (0.3%) in our dataset, which likely arise during library preparation and may be detrimental for quantitative studies that use multiplexing. Deepbinner is open source (GPLv3) and available at https://github.com/rrwick/Deepbinner.

Citing Articles

Complex exchanges among plasmids and clonal expansion of lineages shape the population structure and virulence of .

Laing R, Foster M, Hassani M, Kotzen B, Huang W, Shea T bioRxiv. 2025; .

PMID: 39974970 PMC: 11838331. DOI: 10.1101/2025.01.29.635312.


Genomic characteristics and genetic manipulation of the marine yeast Scheffersomyces spartinae.

Sharma A, Liu X, Yin J, Yu P, Qi L, He M Appl Microbiol Biotechnol. 2024; 108(1):539.

PMID: 39702830 PMC: 11659333. DOI: 10.1007/s00253-024-13382-1.


TDFPS-Designer: an efficient toolkit for barcode design and selection in nanopore sequencing.

Qi J, Li Z, Zhang Y, Li G, Gao X, Han R Genome Biol. 2024; 25(1):285.

PMID: 39497190 PMC: 11533379. DOI: 10.1186/s13059-024-03423-3.


m6ATM: a deep learning framework for demystifying the m6A epitranscriptome with Nanopore long-read RNA-seq data.

Yu B, Nagae G, Midorikawa Y, Tatsuno K, Dasgupta B, Aburatani H Brief Bioinform. 2024; 25(6).

PMID: 39438075 PMC: 11495873. DOI: 10.1093/bib/bbae529.


Same same, but different: exploring the enigmatic role of the pituitary adenylate cyclase-activating polypeptide (PACAP) in invertebrate physiology.

Pirger Z, Urban P, Galik B, Kiss B, Tapodi A, Schmidt J J Comp Physiol A Neuroethol Sens Neural Behav Physiol. 2024; 210(6):909-925.

PMID: 38940930 PMC: 11551080. DOI: 10.1007/s00359-024-01706-5.


References
1.
Quick J, Quinlan A, Loman N . A reference bacterial genome dataset generated on the MinION™ portable single-molecule nanopore sequencer. Gigascience. 2014; 3:22. PMC: 4226419. DOI: 10.1186/2047-217X-3-22. View

2.
Sosic M, Sikic M . Edlib: a C/C ++ library for fast, exact sequence alignment using edit distance. Bioinformatics. 2017; 33(9):1394-1395. PMC: 5408825. DOI: 10.1093/bioinformatics/btw753. View

3.
Bankevich A, Nurk S, Antipov D, Gurevich A, Dvorkin M, Kulikov A . SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012; 19(5):455-77. PMC: 3342519. DOI: 10.1089/cmb.2012.0021. View

4.
Wick R, Judd L, Holt K . Performance of neural network basecalling tools for Oxford Nanopore sequencing. Genome Biol. 2019; 20(1):129. PMC: 6591954. DOI: 10.1186/s13059-019-1727-y. View

5.
Loman N, Quick J, Simpson J . A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat Methods. 2015; 12(8):733-5. DOI: 10.1038/nmeth.3444. View