Benchmarking Long-Read Assemblers for Genomic Analyses of Bacterial Pathogens Using Oxford Nanopore Sequencing
Overview
Chemistry
Molecular Biology
Authors
Affiliations
Oxford Nanopore sequencing can be used to achieve complete bacterial genomes. However, the error rates of Oxford Nanopore long reads are greater compared to Illumina short reads. Long-read assemblers using a variety of assembly algorithms have been developed to overcome this deficiency, which have not been benchmarked for genomic analyses of bacterial pathogens using Oxford Nanopore long reads. In this study, long-read assemblers, namely Canu, Flye, Miniasm/Racon, Raven, Redbean, and Shasta, were thus benchmarked using Oxford Nanopore long reads of bacterial pathogens. Ten species were tested for mediocre- and low-quality simulated reads, and 10 species were tested for real reads. Raven was the most robust assembler, obtaining complete and accurate genomes. All Miniasm/Racon and Raven assemblies of mediocre-quality reads provided accurate antimicrobial resistance (AMR) profiles, while the Raven assembly of with low-quality reads was the only assembly with an accurate AMR profile among all assemblers and species. All assemblers functioned well for predicting virulence genes using mediocre-quality and real reads, whereas only the Raven assemblies of low-quality reads had accurate numbers of virulence genes. Regarding multilocus sequence typing (MLST), Miniasm/Racon was the most effective assembler for mediocre-quality reads, while only the Raven assemblies of O157:H7 and with low-quality reads showed positive MLST results. Miniasm/Racon and Raven were the best performers for MLST using real reads. The Miniasm/Racon and Raven assemblies showed accurate phylogenetic inference. For the pan-genome analyses, Raven was the strongest assembler for simulated reads, whereas Miniasm/Racon and Raven performed the best for real reads. Overall, the most robust and accurate assembler was Raven, closely followed by Miniasm/Racon.
Snow alga Sanguina aurantia as revealed through de novo genome assembly and annotation.
Raymond B, Guenzi-Tiberi P, Marechal E, Quarmby L G3 (Bethesda). 2024; 14(10).
PMID: 39093299 PMC: 11457085. DOI: 10.1093/g3journal/jkae181.
Liew K, Shahar S, Shamsir M, Shaharuddin N, Liang C, Chan K Environ Microbiome. 2024; 19(1):29.
PMID: 38706006 PMC: 11071339. DOI: 10.1186/s40793-024-00572-7.
Three Rounds of Read Correction Significantly Improve Eukaryotic Protein Detection in ONT Reads.
Safar H, Alatar F, Mustafa A Microorganisms. 2024; 12(2).
PMID: 38399651 PMC: 10893331. DOI: 10.3390/microorganisms12020247.
Whole-genome sequencing and evolutionary analysis of the wild edible mushroom, .
Li Y, Yang T, Qiao J, Liang J, Li Z, Sa W Front Microbiol. 2024; 14:1309703.
PMID: 38361578 PMC: 10868677. DOI: 10.3389/fmicb.2023.1309703.
Evaluating long-read de novo assembly tools for eukaryotic genomes: insights and considerations.
Cosma B, Shirali Hossein Zade R, Jordan E, van Lent P, Peng C, Pillay S Gigascience. 2023; 12.
PMID: 38000912 PMC: 10673639. DOI: 10.1093/gigascience/giad100.