» Articles » PMID: 37868032

Discovery of Repetitive Elements As Key Sequence Determinants of 3D Genome Folding

Overview
Journal Cell Genom
Date 2023 Oct 23
PMID 37868032
Authors
Affiliations
Soon will be listed here.
Abstract

Natural and experimental genetic variants can modify DNA loops and insulating boundaries to tune transcription, but it is unknown how sequence perturbations affect chromatin organization genome wide. We developed a deep-learning strategy to quantify the effect of any insertion, deletion, or substitution on chromatin contacts and systematically scored millions of synthetic variants. While most genetic manipulations have little impact, regions with CTCF motifs and active transcription are highly sensitive, as expected. Our unbiased screen and subsequent targeted experiments also point to noncoding RNA genes and several families of repetitive elements as CTCF-motif-free DNA sequences with particularly large effects on nearby chromatin interactions, sometimes exceeding the effects of CTCF sites and explaining interactions that lack CTCF. We anticipate that our disruption tracks may be of broad interest and utility as a measure of 3D genome sensitivity, and our computational strategies may serve as a template for biological inquiry with deep learning.

Citing Articles

Interpreting the CTCF-mediated sequence grammar of genome folding with AkitaV2.

Smaruj P, Kamulegeya F, Kelley D, Fudenberg G PLoS Comput Biol. 2025; 21(2):e1012824.

PMID: 39903776 PMC: 11828424. DOI: 10.1371/journal.pcbi.1012824.


De novo structural variants in autism spectrum disorder disrupt distal regulatory interactions of neuronal genes.

Gjoni K, Ren X, Everitt A, Shen Y, Pollard K bioRxiv. 2024; .

PMID: 39574698 PMC: 11580890. DOI: 10.1101/2024.11.06.621353.


An integrated view of the structure and function of the human 4D nucleome.

Dekker J, Oksuz B, Zhang Y, Wang Y, Minsk M, Kuang S bioRxiv. 2024; .

PMID: 39484446 PMC: 11526861. DOI: 10.1101/2024.09.17.613111.


Machine Learning Reveals the Diversity of Human 3D Chromatin Contact Patterns.

Gilbertson E, Brand C, McArthur E, Rinker D, Kuang S, Pollard K Mol Biol Evol. 2024; 41(10).

PMID: 39404010 PMC: 11523124. DOI: 10.1093/molbev/msae209.


Sequence-Based Machine Learning Reveals 3D Genome Differences between Bonobos and Chimpanzees.

Brand C, Kuang S, Gilbertson E, McArthur E, Pollard K, Webster T Genome Biol Evol. 2024; 16(11).

PMID: 39382451 PMC: 11579661. DOI: 10.1093/gbe/evae210.


References
1.
Avsec Z, Agarwal V, Visentin D, Ledsam J, Grabska-Barwinska A, Taylor K . Effective gene expression prediction from sequence by integrating long-range interactions. Nat Methods. 2021; 18(10):1196-1203. PMC: 8490152. DOI: 10.1038/s41592-021-01252-x. View

2.
Kentepozidou E, Aitken S, Feig C, Stefflova K, Ibarra-Soria X, Odom D . Clustered CTCF binding is an evolutionary mechanism to maintain topologically associating domains. Genome Biol. 2020; 21(1):5. PMC: 6945661. DOI: 10.1186/s13059-019-1894-x. View

3.
Diehl A, Ouyang N, Boyle A . Transposable elements contribute to cell and species-specific chromatin looping and gene regulation in mammalian genomes. Nat Commun. 2020; 11(1):1796. PMC: 7156512. DOI: 10.1038/s41467-020-15520-5. View

4.
Fudenberg G, Kelley D, Pollard K . Predicting 3D genome folding from DNA sequence with Akita. Nat Methods. 2020; 17(11):1111-1117. PMC: 8211359. DOI: 10.1038/s41592-020-0958-x. View

5.
Moore J, Purcaro M, Pratt H, Epstein C, Shoresh N, Adrian J . Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature. 2020; 583(7818):699-710. PMC: 7410828. DOI: 10.1038/s41586-020-2493-4. View