» Articles » PMID: 36008967

Opportunities and Challenges of Data-Driven Virus Discovery

Overview
Journal Biomolecules
Publisher MDPI
Date 2022 Aug 26
PMID 36008967
Authors
Affiliations
Soon will be listed here.
Abstract

Virus discovery has been fueled by new technologies ever since the first viruses were discovered at the end of the 19th century. Starting with mechanical devices that provided evidence for virus presence in sick hosts, virus discovery gradually transitioned into a sequence-based scientific discipline, which, nowadays, can characterize virus identity and explore viral diversity at an unprecedented resolution and depth. Sequencing technologies are now being used routinely and at ever-increasing scales, producing an avalanche of novel viral sequences found in a multitude of organisms and environments. In this perspective article, we argue that virus discovery has started to undergo another transformation prompted by the emergence of new approaches that are sequence data-centered and primarily computational, setting them apart from previous technology-driven innovations. The data-driven virus discovery approach is largely uncoupled from the collection and processing of biological samples, and exploits the availability of massive amounts of publicly and freely accessible data from sequencing archives. We discuss open challenges to be solved in order to unlock the full potential of data-driven virus discovery, and we highlight the benefits it can bring to classical (mostly molecular) virology and molecular biology in general.

Citing Articles

Unveiling the genetic diversity of the genera Enamovirus and Polerovirus through data-driven virus discovery.

Sidharthan V, Reddy V, Krishnan N, Parameswari B Arch Virol. 2025; 170(4):76.

PMID: 40080166 DOI: 10.1007/s00705-025-06258-w.


Giant RNA genomes: Roles of host, translation elongation, genome architecture, and proteome in nidoviruses.

Neuman B, Smart A, Gilmer O, Smyth R, Vaas J, Boker N Proc Natl Acad Sci U S A. 2025; 122(7):e2413675122.

PMID: 39928875 PMC: 11848433. DOI: 10.1073/pnas.2413675122.


Vesicular Stomatitis Virus: Insights into Pathogenesis, Immune Evasion, and Technological Innovations in Oncolytic and Vaccine Development.

Ahmed M, Okesanya O, Ukoaka B, Ibrahim A, Lucero-Prisno 3rd D Viruses. 2025; 16(12.

PMID: 39772239 PMC: 11680291. DOI: 10.3390/v16121933.


The protein structurome of Orthornavirae and its dark matter.

Mutz P, Camargo A, Sahakyan H, Neri U, Butkovic A, Wolf Y mBio. 2024; 16(2):e0320024.

PMID: 39714180 PMC: 11796362. DOI: 10.1128/mbio.03200-24.


Identification of nine putative novel members of plant-infecting alphaflexiviruses in public domain plant transcriptomes.

Sravani B, Sidharthan V, Reddy V Virusdisease. 2024; 35(4):630-636.

PMID: 39677843 PMC: 11635070. DOI: 10.1007/s13337-024-00898-3.


References
1.
Roux S, Adriaenssens E, Dutilh B, Koonin E, Kropinski A, Krupovic M . Minimum Information about an Uncultivated Virus Genome (MIUViG). Nat Biotechnol. 2018; 37(1):29-37. PMC: 6871006. DOI: 10.1038/nbt.4306. View

2.
Tisza M, Pastrana D, Welch N, Stewart B, Peretti A, Starrett G . Discovery of several thousand highly diverse circular DNA viruses. Elife. 2020; 9. PMC: 7000223. DOI: 10.7554/eLife.51971. View

3.
Lauber C, Seifert M, Bartenschlager R, Seitz S . Discovery of highly divergent lineages of plant-associated astro-like viruses sheds light on the emergence of potyviruses. Virus Res. 2018; 260:38-48. DOI: 10.1016/j.virusres.2018.11.009. View

4.
Shi M, Lin X, Chen X, Tian J, Chen L, Li K . The evolutionary history of vertebrate RNA viruses. Nature. 2018; 556(7700):197-202. DOI: 10.1038/s41586-018-0012-7. View

5.
Mitra A, Skrzypczak M, Ginalski K, Rowicka M . Strategies for achieving high sequencing accuracy for low diversity samples and avoiding sample bleeding using illumina platform. PLoS One. 2015; 10(4):e0120520. PMC: 4393298. DOI: 10.1371/journal.pone.0120520. View