» Articles » PMID: 39738068

Short Tandem Repeats Delineate Gene Bodies Across Eukaryotes

Overview
Journal Nat Commun
Specialty Biology
Date 2024 Dec 31
PMID 39738068
Authors
Affiliations
Soon will be listed here.
Abstract

Short tandem repeats (STRs) have emerged as important and hypermutable sites where genetic variation correlates with gene expression in plant and animal systems. Recently, it has been shown that a broad range of transcription factors (TFs) are affected by STRs near or in the DNA target binding site. Despite this, the distribution of STR motif repetitiveness in eukaryote genomes is still largely unknown. Here, we identify monomer and dimer STR motif repetitiveness in 5.1 billion 10-bp windows upstream of translation starts and downstream of translation stops in 25 million genes spanning 1270 species across the eukaryotic Tree of Life. We report that all surveyed genomes have gene-proximal shifts in motif repetitiveness. Within genomes, variation in gene-proximal repetitiveness landscapes correlated to the function of genes; genes with housekeeping functions were depleted in upstream and downstream repetitiveness. Furthermore, the repetitiveness landscapes correlated with TF binding sites, indicating that gene function has evolved in conjunction with cis-regulatory STRs and TFs that recognize repetitive sites. These results suggest that the hypermutability inherent to STRs is canalized along the genome sequence and contributes to regulatory and eco-evolutionary dynamics in all eukaryotes.

References
1.
Mirkin S . Expandable DNA repeats and human disease. Nature. 2007; 447(7147):932-40. DOI: 10.1038/nature05977. View

2.
Kolberg L, Raudvere U, Kuzmin I, Adler P, Vilo J, Peterson H . g:Profiler-interoperable web service for functional enrichment analysis and gene identifier mapping (2023 update). Nucleic Acids Res. 2023; 51(W1):W207-W212. PMC: 10320099. DOI: 10.1093/nar/gkad347. View

3.
Wanford J, Green L, Aidley J, Bayliss C . Phasome analysis of pathogenic and commensal Neisseria species expands the known repertoire of phase variable genes, and highlights common adaptive strategies. PLoS One. 2018; 13(5):e0196675. PMC: 5953494. DOI: 10.1371/journal.pone.0196675. View

4.
Olson D, Wheeler T . ULTRA: A Model Based Tool to Detect Tandem Repeats. ACM BCB. 2019; 2018:37-46. PMC: 6508075. DOI: 10.1145/3233547.3233604. View

5.
Martin F, Amode M, Aneja A, Austine-Orimoloye O, Azov A, Barnes I . Ensembl 2023. Nucleic Acids Res. 2022; 51(D1):D933-D941. PMC: 9825606. DOI: 10.1093/nar/gkac958. View