» Articles » PMID: 38648214

Deep Mining of the Sequence Read Archive Reveals Major Genetic Innovations in Coronaviruses and Other Nidoviruses of Aquatic Vertebrates

Abstract

Virus discovery by genomics and metagenomics empowered studies of viromes, facilitated characterization of pathogen epidemiology, and redefined our understanding of the natural genetic diversity of viruses with profound functional and structural implications. Here we employed a data-driven virus discovery approach that directly queries unprocessed sequencing data in a highly parallelized way and involves a targeted viral genome assembly strategy in a wide range of sequence similarity. By screening more than 269,000 datasets of numerous authors from the Sequence Read Archive and using two metrics that quantitatively assess assembly quality, we discovered 40 nidoviruses from six virus families whose members infect vertebrate hosts. They form 13 and 32 putative viral subfamilies and genera, respectively, and include 11 coronaviruses with bisegmented genomes from fishes and amphibians, a giant 36.1 kilobase coronavirus genome with a duplicated spike glycoprotein (S) gene, 11 tobaniviruses and 17 additional corona-, arteri-, cremega-, nanhypo- and nangoshaviruses. Genome segmentation emerged in a single evolutionary event in the monophyletic lineage encompassing the subfamily Pitovirinae. We recovered the bisegmented genome sequences of two coronaviruses from RNA samples of 69 infected fishes and validated the presence of poly(A) tails at both segments using 3'RACE PCR and subsequent Sanger sequencing. We report a genetic linkage between accessory and structural proteins whose phylogenetic relationships and evolutionary distances are incongruent with the phylogeny of replicase proteins. We rationalize these observations in a model of inter-family S recombination involving at least five ancestral corona- and tobaniviruses of aquatic hosts. In support of this model, we describe an individual fish co-infected with members from the families Coronaviridae and Tobaniviridae. Our results expand the scale of the known extraordinary evolutionary plasticity in nidoviral genome architecture and call for revisiting fundamentals of genome expression, virus particle biology, host range and ecology of vertebrate nidoviruses.

Citing Articles

Giant RNA genomes: Roles of host, translation elongation, genome architecture, and proteome in nidoviruses.

Neuman B, Smart A, Gilmer O, Smyth R, Vaas J, Boker N Proc Natl Acad Sci U S A. 2025; 122(7):e2413675122.

PMID: 39928875 PMC: 11848433. DOI: 10.1073/pnas.2413675122.


Genome sizes of animal RNA viruses reflect phylogenetic constraints.

Takada K, Holmes E Virus Evol. 2025; 11(1):veaf005.

PMID: 39906303 PMC: 11792653. DOI: 10.1093/ve/veaf005.


Insect-specific Alphamesonivirus-1 () in lymph node and lung tissues from two horses with acute respiratory syndrome.

Jurisic L, Auerswald H, Marcacci M, Di Giallonardo F, Coetzee L, Curini V J Virol. 2025; 99(2):e0214424.

PMID: 39853116 PMC: 11852760. DOI: 10.1128/jvi.02144-24.


The protein structurome of Orthornavirae and its dark matter.

Mutz P, Camargo A, Sahakyan H, Neri U, Butkovic A, Wolf Y mBio. 2024; 16(2):e0320024.

PMID: 39714180 PMC: 11796362. DOI: 10.1128/mbio.03200-24.


Insights into the RNA Virome of the Corn Leafhopper , a Major Emergent Threat of Maize in Latin America.

Debat H, Farrher E, Bejerman N Viruses. 2024; 16(10).

PMID: 39459917 PMC: 11512364. DOI: 10.3390/v16101583.


References
1.
Lehmann K, Gulyaeva A, Zevenhoven-Dobbe J, Janssen G, Ruben M, Overkleeft H . Discovery of an essential nucleotidylating activity associated with a newly delineated conserved domain in the RNA polymerase-containing protein of all nidoviruses. Nucleic Acids Res. 2015; 43(17):8416-34. PMC: 4787807. DOI: 10.1093/nar/gkv838. View

2.
Jokhi V, Ashley J, Nunnari J, Noma A, Ito N, Wakabayashi-Ito N . Torsin mediates primary envelopment of large ribonucleoprotein granules at the nuclear envelope. Cell Rep. 2013; 3(4):988-95. PMC: 3683601. DOI: 10.1016/j.celrep.2013.03.015. View

3.
Snijder E, Meulenberg J . The molecular biology of arteriviruses. J Gen Virol. 1998; 79 ( Pt 5):961-79. DOI: 10.1099/0022-1317-79-5-961. View

4.
Koonin E, Dolja V, Krupovic M . Origins and evolution of viruses of eukaryotes: The ultimate modularity. Virology. 2015; 479-480:2-25. PMC: 5898234. DOI: 10.1016/j.virol.2015.02.039. View

5.
Zhou P, Yang X, Wang X, Hu B, Zhang L, Zhang W . A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020; 579(7798):270-273. PMC: 7095418. DOI: 10.1038/s41586-020-2012-7. View