» Articles » PMID: 32581081

Rampant C→U Hypermutation in the Genomes of SARS-CoV-2 and Other Coronaviruses: Causes and Consequences for Their Short- and Long-Term Evolutionary Trajectories

Overview
Journal mSphere
Date 2020 Jun 26
PMID 32581081
Citations 151
Authors
Affiliations
Soon will be listed here.
Abstract

The pandemic of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has motivated an intensive analysis of its molecular epidemiology following its worldwide spread. To understand the early evolutionary events following its emergence, a data set of 985 complete SARS-CoV-2 sequences was assembled. Variants showed a mean of 5.5 to 9.5 nucleotide differences from each other, consistent with a midrange coronavirus substitution rate of 3 × 10 substitutions/site/year. Almost one-half of sequence changes were C→U transitions, with an 8-fold base frequency normalized directional asymmetry between C→U and U→C substitutions. Elevated ratios were observed in other recently emerged coronaviruses (SARS-CoV, Middle East respiratory syndrome [MERS]-CoV), and decreasing ratios were observed in other human coronaviruses (HCoV-NL63, -OC43, -229E, and -HKU1) proportionate to their increasing divergence. C→U transitions underpinned almost one-half of the amino acid differences between SARS-CoV-2 variants and occurred preferentially in both 5' U/A and 3' U/A flanking sequence contexts comparable to favored motifs of human APOBEC3 proteins. Marked base asymmetries observed in nonpandemic human coronaviruses (U ≫ A > G ≫ C) and low G+C contents may represent long-term effects of prolonged C→U hypermutation in their hosts. The evidence that much of sequence change in SARS-CoV-2 and other coronaviruses may be driven by a host APOBEC-like editing process has profound implications for understanding their short- and long-term evolution. Repeated cycles of mutation and reversion in favored mutational hot spots and the widespread occurrence of amino acid changes with no adaptive value for the virus represent a quite different paradigm of virus sequence change from neutral and Darwinian evolutionary frameworks and are not incorporated by standard models used in molecular epidemiology investigations. The wealth of accurately curated sequence data for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), its long genome, and its low substitution rate provides a relatively blank canvas with which to investigate effects of mutational and editing processes imposed by the host cell. The finding that a large proportion of sequence change in SARS-CoV-2 in the initial months of the pandemic comprised C→U mutations in a host APOBEC-like context provides evidence for a potent host-driven antiviral editing mechanism against coronaviruses more often associated with antiretroviral defense. In evolutionary terms, the contribution of biased, convergent, and context-dependent mutations to sequence change in SARS-CoV-2 is substantial, and these processes are not incorporated by standard models used in molecular epidemiology investigations.

Citing Articles

APOBEC3-Related Editing and Non-Editing Determinants of HIV-1 and HTLV-1 Restriction.

Leong S, Nasser H, Ikeda T Int J Mol Sci. 2025; 26(4).

PMID: 40004025 PMC: 11855278. DOI: 10.3390/ijms26041561.


A phylogenetic method identifies candidate drivers of the evolution of the SARS-CoV-2 mutation spectrum.

Corbett-Detig R bioRxiv. 2025; .

PMID: 39896455 PMC: 11785018. DOI: 10.1101/2025.01.17.633662.


SARS-CoV-2 CoCoPUTs: analyzing GISAID and NCBI data to obtain codon statistics, mutations, and free energy over a multiyear period.

Padhiar N, Ghazanchyan T, Fumagalli S, DiCuccio M, Cohen G, Ginzburg A Virus Evol. 2025; 11(1):veae115.

PMID: 39882309 PMC: 11776705. DOI: 10.1093/ve/veae115.


The mutation rate of SARS-CoV-2 is highly variable between sites and is influenced by sequence context, genomic region, and RNA structure.

Haddox H, Angehrn G, Sesta L, Jennings-Shaffer C, Temple S, Galloway J bioRxiv. 2025; .

PMID: 39829847 PMC: 11741320. DOI: 10.1101/2025.01.07.631013.


Adaptive evolution of SARS-CoV-2 during a persistent infection for 521 days in an immunocompromised patient.

Schmidt H, Schick L, Podlech J, Renzaho A, Lieb B, Diederich S NPJ Genom Med. 2025; 10(1):4.

PMID: 39820045 PMC: 11739519. DOI: 10.1038/s41525-025-00463-x.


References
1.
Fu X, Fang B, Liu Y, Cai M, Jun J, Ma J . Newly emerged porcine enteric alphacoronavirus in southern China: Identification, origin and evolutionary history analysis. Infect Genet Evol. 2018; 62:179-187. PMC: 7106130. DOI: 10.1016/j.meegid.2018.04.031. View

2.
Homwong N, Jarvis M, Lam H, Diaz A, Rovira A, Nelson M . Characterization and evolution of porcine deltacoronavirus in the United States. Prev Vet Med. 2015; 123:168-174. PMC: 7114263. DOI: 10.1016/j.prevetmed.2015.11.001. View

3.
Corman V, Muth D, Niemeyer D, Drosten C . Hosts and Sources of Endemic Human Coronaviruses. Adv Virus Res. 2018; 100:163-188. PMC: 7112090. DOI: 10.1016/bs.aivir.2018.01.001. View

4.
Zhang H, Yang B, Pomerantz R, Zhang C, Arunachalam S, Gao L . The cytidine deaminase CEM15 induces hypermutation in newly synthesized HIV-1 DNA. Nature. 2003; 424(6944):94-8. PMC: 1350966. DOI: 10.1038/nature01707. View

5.
Simmonds P . SSE: a nucleotide and amino acid sequence analysis platform. BMC Res Notes. 2012; 5:50. PMC: 3292810. DOI: 10.1186/1756-0500-5-50. View