Discovery of Repetitive Elements As Key Sequence Determinants of 3D Genome Folding
Overview
Affiliations
Natural and experimental genetic variants can modify DNA loops and insulating boundaries to tune transcription, but it is unknown how sequence perturbations affect chromatin organization genome wide. We developed a deep-learning strategy to quantify the effect of any insertion, deletion, or substitution on chromatin contacts and systematically scored millions of synthetic variants. While most genetic manipulations have little impact, regions with CTCF motifs and active transcription are highly sensitive, as expected. Our unbiased screen and subsequent targeted experiments also point to noncoding RNA genes and several families of repetitive elements as CTCF-motif-free DNA sequences with particularly large effects on nearby chromatin interactions, sometimes exceeding the effects of CTCF sites and explaining interactions that lack CTCF. We anticipate that our disruption tracks may be of broad interest and utility as a measure of 3D genome sensitivity, and our computational strategies may serve as a template for biological inquiry with deep learning.
Interpreting the CTCF-mediated sequence grammar of genome folding with AkitaV2.
Smaruj P, Kamulegeya F, Kelley D, Fudenberg G PLoS Comput Biol. 2025; 21(2):e1012824.
PMID: 39903776 PMC: 11828424. DOI: 10.1371/journal.pcbi.1012824.
Gjoni K, Ren X, Everitt A, Shen Y, Pollard K bioRxiv. 2024; .
PMID: 39574698 PMC: 11580890. DOI: 10.1101/2024.11.06.621353.
An integrated view of the structure and function of the human 4D nucleome.
Dekker J, Oksuz B, Zhang Y, Wang Y, Minsk M, Kuang S bioRxiv. 2024; .
PMID: 39484446 PMC: 11526861. DOI: 10.1101/2024.09.17.613111.
Machine Learning Reveals the Diversity of Human 3D Chromatin Contact Patterns.
Gilbertson E, Brand C, McArthur E, Rinker D, Kuang S, Pollard K Mol Biol Evol. 2024; 41(10).
PMID: 39404010 PMC: 11523124. DOI: 10.1093/molbev/msae209.
Sequence-Based Machine Learning Reveals 3D Genome Differences between Bonobos and Chimpanzees.
Brand C, Kuang S, Gilbertson E, McArthur E, Pollard K, Webster T Genome Biol Evol. 2024; 16(11).
PMID: 39382451 PMC: 11579661. DOI: 10.1093/gbe/evae210.