Non-B DNA: a Major Contributor to Small- and Large-scale Variation in Nucleotide Substitution Frequencies Across the Genome
Overview
Authors
Affiliations
Approximately 13% of the human genome can fold into non-canonical (non-B) DNA structures (e.g. G-quadruplexes, Z-DNA, etc.), which have been implicated in vital cellular processes. Non-B DNA also hinders replication, increasing errors and facilitating mutagenesis, yet its contribution to genome-wide variation in mutation rates remains unexplored. Here, we conducted a comprehensive analysis of nucleotide substitution frequencies at non-B DNA loci within noncoding, non-repetitive genome regions, their ±2 kb flanking regions, and 1-Megabase windows, using human-orangutan divergence and human single-nucleotide polymorphisms. Functional data analysis at single-base resolution demonstrated that substitution frequencies are usually elevated at non-B DNA, with patterns specific to each non-B DNA type. Mirror, direct and inverted repeats have higher substitution frequencies in spacers than in repeat arms, whereas G-quadruplexes, particularly stable ones, have higher substitution frequencies in loops than in stems. Several non-B DNA types also affect substitution frequencies in their flanking regions. Finally, non-B DNA explains more variation than any other predictor in multiple regression models for diversity or divergence at 1-Megabase scale. Thus, non-B DNA substantially contributes to variation in substitution frequencies at small and large scales. Our results highlight the role of non-B DNA in germline mutagenesis with implications to evolution and genetic diseases.
Ribosomal DNA arrays are the most H-DNA rich element in the human genome.
Chantzi N, Chan C, Patsakis M, Nayak A, Montgomery A, Mouratidis I NAR Genom Bioinform. 2025; 7(1):lqaf012.
PMID: 40041207 PMC: 11879447. DOI: 10.1093/nargab/lqaf012.
Comparative analysis of predicted DNA secondary structures infers complex human centromere topology.
Chittoor S, Giunta S Am J Hum Genet. 2024; 111(12):2707-2719.
PMID: 39561771 PMC: 11639080. DOI: 10.1016/j.ajhg.2024.10.016.
Non-B DNA-informed mutation burden as a marker of treatment response and outcome in cancer.
Xu Q, Kowalski J Br J Cancer. 2024; 131(11):1825-1832.
PMID: 39427051 PMC: 11589871. DOI: 10.1038/s41416-024-02873-7.
Alsowaida D, Larsen B, Hachmer S, Azimi M, Arezza E, Brunette S bioRxiv. 2024; .
PMID: 39386486 PMC: 11463586. DOI: 10.1101/2024.09.24.614809.
G-quadruplexes as pivotal components of cis-regulatory elements in the human genome.
Zhang R, Wang Y, Wang C, Sun X, Mergny J BMC Biol. 2024; 22(1):177.
PMID: 39183303 PMC: 11346177. DOI: 10.1186/s12915-024-01971-5.