» Articles » PMID: 28100585

Fast and Accurate De Novo Genome Assembly from Long Uncorrected Reads

Overview
Journal Genome Res
Specialty Genetics
Date 2017 Jan 20
PMID 28100585
Citations 1336
Authors
Affiliations
Soon will be listed here.
Abstract

The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequences can be generated efficiently with a SIMD-accelerated, partial-order alignment-based, stand-alone consensus module called Racon. Based on tests with PacBio and Oxford Nanopore data sets, we show that Racon coupled with miniasm enables consensus genomes with similar or better quality than state-of-the-art methods while being an order of magnitude faster.

Citing Articles

Contributions of interspecific hybrids to genetic variability in Glycyrrhiza uralensis and G. glabra.

Kim J, Lee J, Kang J, Shim H, Kang D, Lee S Sci Rep. 2025; 15(1):8764.

PMID: 40082484 PMC: 11906797. DOI: 10.1038/s41598-025-92115-4.


Chromosome-level genome assembly of a specialist walnut pest Atrijuglans aristata.

Feng D, Sun C, Li Y, Gao Q, Wang G, Li H Sci Data. 2025; 12(1):434.

PMID: 40075062 PMC: 11904212. DOI: 10.1038/s41597-025-04754-x.


A chromosomal-level genome assembly of Begonia fimbristipula (Begoniaceae).

Xiao T, Wang Z, Yan H Sci Data. 2025; 12(1):429.

PMID: 40074751 PMC: 11904028. DOI: 10.1038/s41597-025-04768-5.


Improvements in RNA and DNA nanopore sequencing allow for rapid genetic characterization of avian influenza.

Perlas A, Reska T, Croville G, Tarres-Freixas F, Guerin J, Majo N Virus Evol. 2025; 11(1):veaf010.

PMID: 40066328 PMC: 11892550. DOI: 10.1093/ve/veaf010.


The assembly and annotation of two teinturier grapevine varieties, Dakapo and Rubired.

Ritter E, Cochetel N, Minio A, Cousins P, Cantu D, Niederhuth C GigaByte. 2025; 2025:gigabyte149.

PMID: 40065997 PMC: 11891882. DOI: 10.46471/gigabyte.149.


References
1.
Berlin K, Koren S, Chin C, Drake J, Landolin J, Phillippy A . Assembling large genomes with single-molecule sequencing and locality-sensitive hashing. Nat Biotechnol. 2015; 33(6):623-30. DOI: 10.1038/nbt.3238. View

2.
Sosic M, Sikic M . Edlib: a C/C ++ library for fast, exact sequence alignment using edit distance. Bioinformatics. 2017; 33(9):1394-1395. PMC: 5408825. DOI: 10.1093/bioinformatics/btw753. View

3.
Ye C, Ma Z . Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads. PeerJ. 2016; 4:e2016. PMC: 4906657. DOI: 10.7717/peerj.2016. View

4.
Sovic I, Krizanovic K, Skala K, Sikic M . Evaluation of hybrid and non-hybrid methods for de novo assembly of nanopore reads. Bioinformatics. 2016; 32(17):2582-9. DOI: 10.1093/bioinformatics/btw237. View

5.
Li H . Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences. Bioinformatics. 2016; 32(14):2103-10. PMC: 4937194. DOI: 10.1093/bioinformatics/btw152. View