» Articles » PMID: 17324286

Minimus: a Fast, Lightweight Genome Assembler

Overview
Publisher Biomed Central
Specialty Biology
Date 2007 Feb 28
PMID 17324286
Citations 247
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Genome assemblers have grown very large and complex in response to the need for algorithms to handle the challenges of large whole-genome sequencing projects. Many of the most common uses of assemblers, however, are best served by a simpler type of assembler that requires fewer software components, uses less memory, and is far easier to install and run.

Results: We have developed the Minimus assembler to address these issues, and tested it on a range of assembly problems. We show that Minimus performs well on several small assembly tasks, including the assembly of viral genomes, individual genes, and BAC clones. In addition, we evaluate Minimus' performance in assembling bacterial genomes in order to assess its suitability as a component of a larger assembly pipeline. We show that, unlike other software currently used for these tasks, Minimus produces significantly fewer assembly errors, at the cost of generating a more fragmented assembly.

Conclusion: We find that for small genomes and other small assembly tasks, Minimus is faster and far more flexible than existing tools. Due to its small size and modular design Minimus is perfectly suited to be a component of complex assembly pipelines. Minimus is released as an open-source software project and the code is available as part of the AMOS project at Sourceforge.

Citing Articles

Near telomere-to-telomere genome assembly of Mongolian cattle: implications for population genetic variation and beef quality.

Su R, Zhou H, Yang W, Moqir S, Ritu X, Liu L Gigascience. 2024; 13.

PMID: 39693631 PMC: 11653892. DOI: 10.1093/gigascience/giae099.


The Statistics of Parametrized Syncmers in a Simple Mutation Process Without Spurious Matches.

Spouge J, Das P, Chen Y, Frith M J Comput Biol. 2024; 31(12):1195-1210.

PMID: 39530391 PMC: 11698668. DOI: 10.1089/cmb.2024.0508.


Fine-scale genomic analysis of the tree endophyte sp. FPYF3050 producing monoterpene 1,8-cineole.

Zhou S, Dou G, Yan D Microbiol Resour Announc. 2024; 13(11):e0119923.

PMID: 39320091 PMC: 11556029. DOI: 10.1128/mra.01199-23.


Genomic analysis of clinical isolates reveals genetic diversity but little evidence of genetic determinants for diarrhoeal disease.

Klemm E, Nisar M, Bawn M, Nasrin D, Qamar F, Page A Microb Genom. 2024; 10(3).

PMID: 38451244 PMC: 10999740. DOI: 10.1099/mgen.0.001211.


Creating and Using Minimizer Sketches in Computational Genomics.

Zheng H, Marcais G, Kingsford C J Comput Biol. 2023; 30(12):1251-1276.

PMID: 37646787 PMC: 11082048. DOI: 10.1089/cmb.2023.0094.


References
1.
Blackhall F, Merry C, Davies E, Jayson G . Heparan sulfate proteoglycans and cancer. Br J Cancer. 2001; 85(8):1094-8. PMC: 2375159. DOI: 10.1054/bjoc.2001.2054. View

2.
Chou H, Holmes M . DNA sequence quality trimming and vector removal. Bioinformatics. 2001; 17(12):1093-104. DOI: 10.1093/bioinformatics/17.12.1093. View

3.
Toretsky J, Zitomersky N, Eskenazi A, Voigt R, Strauch E, Sun C . Glypican-3 expression in Wilms tumor and hepatoblastoma. J Pediatr Hematol Oncol. 2002; 23(8):496-9. DOI: 10.1097/00043426-200111000-00006. View

4.
Myers E . Toward simplifying and accurately formulating fragment assembly. J Comput Biol. 1995; 2(2):275-90. DOI: 10.1089/cmb.1995.2.275. View

5.
Pilia G, MacKenzie A, Baybayan P, Chen E, Huber R, Neri G . Mutations in GPC3, a glypican gene, cause the Simpson-Golabi-Behmel overgrowth syndrome. Nat Genet. 1996; 12(3):241-7. DOI: 10.1038/ng0396-241. View