MetaSim: a Sequencing Simulator for Genomics and Metagenomics
Overview
Affiliations
Background: The new research field of metagenomics is providing exciting insights into various, previously unclassified ecological systems. Next-generation sequencing technologies are producing a rapid increase of environmental data in public databases. There is great need for specialized software solutions and statistical methods for dealing with complex metagenome data sets.
Methodology/principal Findings: To facilitate the development and improvement of metagenomic tools and the planning of metagenomic projects, we introduce a sequencing simulator called MetaSim. Our software can be used to generate collections of synthetic reads that reflect the diverse taxonomical composition of typical metagenome data sets. Based on a database of given genomes, the program allows the user to design a metagenome by specifying the number of genomes present at different levels of the NCBI taxonomy, and then to collect reads from the metagenome using a simulation of a number of different sequencing technologies. A population sampler optionally produces evolved sequences based on source genomes and a given evolutionary tree.
Conclusions/significance: MetaSim allows the user to simulate individual read datasets that can be used as standardized test scenarios for planning sequencing projects or for benchmarking metagenomic software.
Kohnert E, Kreutz C F1000Res. 2025; 13:1180.
PMID: 39866725 PMC: 11757917. DOI: 10.12688/f1000research.155230.2.
Rocha U, Kasmanas J, Toscan R, Sanches D, Magnusdottir S, Saraiva J PLoS Comput Biol. 2024; 20(10):e1012530.
PMID: 39436938 PMC: 11530072. DOI: 10.1371/journal.pcbi.1012530.
Espindola A Biology (Basel). 2024; 13(9).
PMID: 39336128 PMC: 11428249. DOI: 10.3390/biology13090700.
Yin H, Wu S, Tan J, Guo Q, Li M, Guo J Gigascience. 2024; 13.
PMID: 38649300 PMC: 11034026. DOI: 10.1093/gigascience/giae018.
Boquila: NGS read simulator to eliminate read nucleotide bias in sequence analysis.
Akkose U, Adebali O Turk J Biol. 2023; 47(2):158-163.
PMID: 37529166 PMC: 10387831. DOI: 10.55730/1300-0152.2650.