» Articles » PMID: 30993039

VirMine: Automated Detection of Viral Sequences from Complex Metagenomic Samples

Overview
Journal PeerJ
Date 2019 Apr 18
PMID 30993039
Citations 25
Authors
Affiliations
Soon will be listed here.
Abstract

Metagenomics has enabled sequencing of viral communities from a myriad of different environments. Viral metagenomic studies routinely uncover sequences with no recognizable homology to known coding regions or genomes. Nevertheless, complete viral genomes have been constructed directly from complex community metagenomes, often through tedious manual curation. To address this, we developed the software tool virMine to identify viral genomes from raw reads representative of viral or mixed (viral and bacterial) communities. virMine automates sequence read quality control, assembly, and annotation. Researchers can easily refine their search for a specific study system and/or feature(s) of interest. In contrast to other viral genome detection tools that often rely on the recognition of viral signature sequences, virMine is not restricted by the insufficient representation of viral diversity in public data repositories. Rather, viral genomes are identified through an iterative approach, first omitting non-viral sequences. Thus, both relatives of previously characterized viruses and novel species can be detected, including both eukaryotic viruses and bacteriophages. Here we present virMine and its analysis of synthetic communities as well as metagenomic data sets from three distinctly different environments: the gut microbiota, the urinary microbiota, and freshwater viromes. Several new viral genomes were identified and annotated, thus contributing to our understanding of viral genetic diversity in these three environments.

Citing Articles

VITALdb: to select the best viroinformatics tools for a desired virus or application.

Koul M, Kaushik S, Singh K, Sharma D Brief Bioinform. 2025; 26(2).

PMID: 40063348 PMC: 11892104. DOI: 10.1093/bib/bbaf084.


Blood virome research in myalgic encephalomyelitis/chronic fatigue syndrome: challenges and opportunities.

Obraitis D, Li D Curr Opin Virol. 2024; 68-69:101437.

PMID: 39537445 PMC: 11795702. DOI: 10.1016/j.coviro.2024.101437.


Hecatomb: an integrated software platform for viral metagenomics.

Roach M, Beecroft S, Mihindukulasuriya K, Wang L, Paredes A, Cardenas L Gigascience. 2024; 13.

PMID: 38832467 PMC: 11148595. DOI: 10.1093/gigascience/giae020.


Tips and tools to obtain and assess mosquito viromes.

Da Silva A, Bach E, Ellwanger J, Chies J Arch Microbiol. 2024; 206(3):132.

PMID: 38436750 DOI: 10.1007/s00203-023-03813-4.


Benchmarking informatics approaches for virus discovery: caution is needed when combining identification methods.

Hegarty B, Riddell V J, Bastien E, Langenfeld K, Lindback M, Saini J mSystems. 2024; 9(3):e0110523.

PMID: 38376167 PMC: 10949488. DOI: 10.1128/msystems.01105-23.


References
1.
Delcher A, Harmon D, Kasif S, White O, Salzberg S . Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 1999; 27(23):4636-41. PMC: 148753. DOI: 10.1093/nar/27.23.4636. View

2.
Breitbart M, Salamon P, Andresen B, Mahaffy J, Segall A, Mead D . Genomic analysis of uncultured marine viral communities. Proc Natl Acad Sci U S A. 2002; 99(22):14250-5. PMC: 137870. DOI: 10.1073/pnas.202488399. View

3.
Yooseph S, Sutton G, Rusch D, Halpern A, Williamson S, Remington K . The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families. PLoS Biol. 2007; 5(3):e16. PMC: 1821046. DOI: 10.1371/journal.pbio.0050016. View

4.
Fierer N, Breitbart M, Nulton J, Salamon P, Lozupone C, Jones R . Metagenomic and small-subunit rRNA analyses reveal the genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol. 2007; 73(21):7059-66. PMC: 2074941. DOI: 10.1128/AEM.00358-07. View

5.
Aziz R, Bartels D, Best A, DeJongh M, Disz T, Edwards R . The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008; 9:75. PMC: 2265698. DOI: 10.1186/1471-2164-9-75. View