» Articles » PMID: 33450015

Non-B DNA: a Major Contributor to Small- and Large-scale Variation in Nucleotide Substitution Frequencies Across the Genome

Overview
Specialty Biochemistry
Date 2021 Jan 15
PMID 33450015
Citations 47
Authors
Affiliations
Soon will be listed here.
Abstract

Approximately 13% of the human genome can fold into non-canonical (non-B) DNA structures (e.g. G-quadruplexes, Z-DNA, etc.), which have been implicated in vital cellular processes. Non-B DNA also hinders replication, increasing errors and facilitating mutagenesis, yet its contribution to genome-wide variation in mutation rates remains unexplored. Here, we conducted a comprehensive analysis of nucleotide substitution frequencies at non-B DNA loci within noncoding, non-repetitive genome regions, their ±2 kb flanking regions, and 1-Megabase windows, using human-orangutan divergence and human single-nucleotide polymorphisms. Functional data analysis at single-base resolution demonstrated that substitution frequencies are usually elevated at non-B DNA, with patterns specific to each non-B DNA type. Mirror, direct and inverted repeats have higher substitution frequencies in spacers than in repeat arms, whereas G-quadruplexes, particularly stable ones, have higher substitution frequencies in loops than in stems. Several non-B DNA types also affect substitution frequencies in their flanking regions. Finally, non-B DNA explains more variation than any other predictor in multiple regression models for diversity or divergence at 1-Megabase scale. Thus, non-B DNA substantially contributes to variation in substitution frequencies at small and large scales. Our results highlight the role of non-B DNA in germline mutagenesis with implications to evolution and genetic diseases.

Citing Articles

Ribosomal DNA arrays are the most H-DNA rich element in the human genome.

Chantzi N, Chan C, Patsakis M, Nayak A, Montgomery A, Mouratidis I NAR Genom Bioinform. 2025; 7(1):lqaf012.

PMID: 40041207 PMC: 11879447. DOI: 10.1093/nargab/lqaf012.


Comparative analysis of predicted DNA secondary structures infers complex human centromere topology.

Chittoor S, Giunta S Am J Hum Genet. 2024; 111(12):2707-2719.

PMID: 39561771 PMC: 11639080. DOI: 10.1016/j.ajhg.2024.10.016.


Non-B DNA-informed mutation burden as a marker of treatment response and outcome in cancer.

Xu Q, Kowalski J Br J Cancer. 2024; 131(11):1825-1832.

PMID: 39427051 PMC: 11589871. DOI: 10.1038/s41416-024-02873-7.


Caspase-Activated DNase localizes to cancer causing translocation breakpoints during cell differentiation.

Alsowaida D, Larsen B, Hachmer S, Azimi M, Arezza E, Brunette S bioRxiv. 2024; .

PMID: 39386486 PMC: 11463586. DOI: 10.1101/2024.09.24.614809.


G-quadruplexes as pivotal components of cis-regulatory elements in the human genome.

Zhang R, Wang Y, Wang C, Sun X, Mergny J BMC Biol. 2024; 22(1):177.

PMID: 39183303 PMC: 11346177. DOI: 10.1186/s12915-024-01971-5.


References
1.
Lemmens B, van Schendel R, Tijsterman M . Mutagenic consequences of a single G-quadruplex demonstrate mitotic inheritance of DNA replication fork barriers. Nat Commun. 2015; 6:8909. PMC: 4654259. DOI: 10.1038/ncomms9909. View

2.
Bochman M, Paeschke K, Zakian V . DNA secondary structures: stability and function of G-quadruplex structures. Nat Rev Genet. 2012; 13(11):770-80. PMC: 3725559. DOI: 10.1038/nrg3296. View

3.
Duret L, Arndt P . The impact of recombination on nucleotide substitutions in the human genome. PLoS Genet. 2008; 4(5):e1000071. PMC: 2346554. DOI: 10.1371/journal.pgen.1000071. View

4.
Wilkins M, STOKES A, Wilson H . Molecular structure of deoxypentose nucleic acids. Nature. 1953; 171(4356):738-40. DOI: 10.1038/171738a0. View

5.
Miller W, Rosenbloom K, Hardison R, Hou M, Taylor J, Raney B . 28-way vertebrate alignment and conservation track in the UCSC Genome Browser. Genome Res. 2007; 17(12):1797-808. PMC: 2099589. DOI: 10.1101/gr.6761107. View