» Articles » PMID: 23221638

Integrative Annotation of Chromatin Elements from ENCODE Data

Abstract

The ENCODE Project has generated a wealth of experimental information mapping diverse chromatin properties in several human cell lines. Although each such data track is independently informative toward the annotation of regulatory elements, their interrelations contain much richer information for the systematic annotation of regulatory elements. To uncover these interrelations and to generate an interpretable summary of the massive datasets of the ENCODE Project, we apply unsupervised learning methodologies, converting dozens of chromatin datasets into discrete annotation maps of regulatory regions and other chromatin elements across the human genome. These methods rediscover and summarize diverse aspects of chromatin architecture, elucidate the interplay between chromatin activity and RNA transcription, and reveal that a large proportion of the genome lies in a quiescent state, even across multiple cell types. The resulting annotation of non-coding regulatory elements correlate strongly with mammalian evolutionary constraint, and provide an unbiased approach for evaluating metrics of evolutionary constraint in human. Lastly, we use the regulatory annotations to revisit previously uncharacterized disease-associated loci, resulting in focused, testable hypotheses through the lens of the chromatin landscape.

Citing Articles

Beyond Blacklists: A Critical Assessment of Exclusion Set Generation Strategies and Alternative Approaches.

Wall B, Ogata J, Nguyen M, McClay J, Harrell J, Dozmorov M bioRxiv. 2025; .

PMID: 39975128 PMC: 11839099. DOI: 10.1101/2025.02.06.636968.


Integrative pan-cancer analysis reveals a common architecture of dysregulated transcriptional networks characterized by loss of enhancer methylation.

Ankill J, Zhao Z, Tekpli X, Kure E, Kristensen V, Mathelier A PLoS Comput Biol. 2024; 20(11):e1012565.

PMID: 39556603 PMC: 11611269. DOI: 10.1371/journal.pcbi.1012565.


New MiniPromoter Ple389 (ADORA2A) drives selective expression in medium spiny neurons in mice and non-human primates.

de Moura Gomes A, L Petkau T, Korecki A, Fornes O, Galvan A, Lu G Sci Rep. 2024; 14(1):28194.

PMID: 39548191 PMC: 11568231. DOI: 10.1038/s41598-024-79004-y.


Cell tumbling enhances stem cell differentiation in hydrogels via nuclear mechanotransduction.

Ayushman M, Mikos G, Tong X, Sinha S, Lopez-Fuentes E, Jones S Nat Mater. 2024; 24(2):312-322.

PMID: 39487316 DOI: 10.1038/s41563-024-02038-0.


Entropic Graph-based Posterior Regularization.

Libbrecht M, Hoffman M, Bilmes J, Noble W Proc Int Conf Mach Learn. 2024; 37:1992-2001.

PMID: 39483441 PMC: 11526501.


References
1.
Zhang Y, Liu T, Meyer C, Eeckhoute J, Johnson D, Bernstein B . Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008; 9(9):R137. PMC: 2592715. DOI: 10.1186/gb-2008-9-9-r137. View

2.
Zhang S, Li Q, Liu J, Zhou X . A novel computational framework for simultaneous integration of multiple types of genomic data to identify microRNA-gene regulatory modules. Bioinformatics. 2011; 27(13):i401-9. PMC: 3117336. DOI: 10.1093/bioinformatics/btr206. View

3.
Garber M, Guttman M, Clamp M, Zody M, Friedman N, Xie X . Identifying novel constrained elements by exploiting biased substitution patterns. Bioinformatics. 2009; 25(12):i54-62. PMC: 2687944. DOI: 10.1093/bioinformatics/btp190. View

4.
Kim T, Hemberg M, Gray J, Costa A, Bear D, Wu J . Widespread transcription at neuronal activity-regulated enhancers. Nature. 2010; 465(7295):182-7. PMC: 3020079. DOI: 10.1038/nature09033. View

5.
Feingold E, Penny L, Nienhuis A, Forget B . An olfactory receptor gene is located in the extended human beta-globin gene cluster and is expressed in erythroid cells. Genomics. 1999; 61(1):15-23. DOI: 10.1006/geno.1999.5935. View