» Articles » PMID: 23644548

Nonhybrid, Finished Microbial Genome Assemblies from Long-read SMRT Sequencing Data

Overview
Journal Nat Methods
Date 2013 May 7
PMID 23644548
Citations 2479
Authors
Affiliations
Soon will be listed here.
Abstract

We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.

Citing Articles

Chromosome-level genome assembly of Jaguar guapote (Parachromis manguensis) by massive parallel sequencing.

Cao J, Tong Y, Xiao Z, Chen H, Liu Z Sci Data. 2025; 12(1):411.

PMID: 40064893 PMC: 11894119. DOI: 10.1038/s41597-025-04752-z.


sp. nov., a novel marine bacterium isolated from the soft coral sp. at Oceanário de Lisboa in Portugal.

da Silva D, Marques M, Couceiro J, Santos E, Baylina N, Costa R Int J Syst Evol Microbiol. 2025; 75(3).

PMID: 40042989 PMC: 11883135. DOI: 10.1099/ijsem.0.006696.


Long-read genome sequencing reveals the sequence characteristics of pear self-incompatibility locus.

Gu C, Xu Y, Wu L, Wang X, Qi K, Qiao X Mol Hortic. 2025; 5(1):13.

PMID: 40022260 PMC: 11871771. DOI: 10.1186/s43897-024-00132-0.


Complete genome analysis of Bacillus velezensis HF-14,109 with potential for broad-spectrum antimicrobial activity and high enzyme-producing ability from common carp (Cyprinus carpio L.).

Zhang J, Wu J, Chen Y, Li X, Jia Y, Zhang X Mol Genet Genomics. 2025; 300(1):26.

PMID: 40011251 DOI: 10.1007/s00438-025-02229-7.


Genomic and pathogenicity analyses to identify the causative agent from multiple serogroups of non-O1, non-O139 in foodborne outbreaks.

Morita M, Hiyoshi H, Arakawa E, Izumiya H, Ohnishi M, Ogata K Microb Genom. 2025; 11(2).

PMID: 40009544 PMC: 11865499. DOI: 10.1099/mgen.0.001364.


References
1.
Carneiro M, Russ C, Ross M, Gabriel S, Nusbaum C, DePristo M . Pacific biosciences sequencing technology for genotyping and variation discovery in human data. BMC Genomics. 2012; 13:375. PMC: 3443046. DOI: 10.1186/1471-2164-13-375. View

2.
Liu G, Alkan C, Jiang L, Zhao S, Eichler E . Comparative analysis of Alu repeats in primate genomes. Genome Res. 2009; 19(5):876-85. PMC: 2675976. DOI: 10.1101/gr.083972.108. View

3.
Kingsford C, Schatz M, Pop M . Assembly complexity of prokaryotic genomes using short reads. BMC Bioinformatics. 2010; 11:21. PMC: 2821320. DOI: 10.1186/1471-2105-11-21. View

4.
Loomis E, Eid J, Peluso P, Yin J, Hickey L, Rank D . Sequencing the unsequenceable: expanded CGG-repeat alleles of the fragile X gene. Genome Res. 2012; 23(1):121-8. PMC: 3530672. DOI: 10.1101/gr.141705.112. View

5.
Kurtz S, Phillippy A, Delcher A, Smoot M, Shumway M, Antonescu C . Versatile and open software for comparing large genomes. Genome Biol. 2004; 5(2):R12. PMC: 395750. DOI: 10.1186/gb-2004-5-2-r12. View