» Articles » PMID: 17571346

Identification and Analysis of Functional Elements in 1% of the Human Genome by the ENCODE Pilot Project

Overview
Journal Nature
Specialty Science
Date 2007 Jun 16
PMID 17571346
Citations 2714
Authors
Affiliations
Soon will be listed here.
Abstract

We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

Citing Articles

cfDNA hydroxymethylcytosine profiling for detection metastasis and recurrence of Esophageal Squamous Cell Carcinoma.

Kuerban S, Chen H, Chen L, Zhang L, Li X, Zhen B World J Surg Oncol. 2025; 23(1):90.

PMID: 40089765 DOI: 10.1186/s12957-025-03747-9.


The crosstalk between non-coding RNAs and oxidative stress in cancer progression.

Sun Q, Lei X, Yang X Genes Dis. 2025; 12(3):101286.

PMID: 40028033 PMC: 11870203. DOI: 10.1016/j.gendis.2024.101286.


Bidirectional Interplay Among Non-Coding RNAs, the Microbiome, and the Host During Development and Diseases.

Nai S, Song J, Su W, Liu X Genes (Basel). 2025; 16(2).

PMID: 40004537 PMC: 11855195. DOI: 10.3390/genes16020208.


Identifying Essential Hub Genes and circRNA-Regulated ceRNA Networks in Hepatocellular Carcinoma.

Yu X, Xu H, Xing Y, Sun D, Li D, Shi J Int J Mol Sci. 2025; 26(4).

PMID: 40003874 PMC: 11855757. DOI: 10.3390/ijms26041408.


Long Intergenic Non-Coding RNAs and in Breast Cancer Pathogenesis: Neighboring Companions or Nemeses?.

Fadebi O, Miya T, Khanyile R, Dlamini Z, Marima R Noncoding RNA. 2025; 11(1).

PMID: 39997609 PMC: 11857994. DOI: 10.3390/ncrna11010009.


References
1.
Trinklein N, Karaoz U, Wu J, Halees A, Force Aldred S, Collins P . Integrated analysis of experimental data sets reveals many novel promoters in 1% of the human genome. Genome Res. 2007; 17(6):720-31. PMC: 1891333. DOI: 10.1101/gr.5716607. View

2.
Ren B, Robert F, Wyrick J, Aparicio O, Jennings E, Simon I . Genome-wide location and function of DNA binding proteins. Science. 2000; 290(5500):2306-9. DOI: 10.1126/science.290.5500.2306. View

3.
Karnani N, Taylor C, Malhotra A, Dutta A . Pan-S replication patterns and chromosomal domains defined by genome-tiling arrays of ENCODE genomic areas. Genome Res. 2007; 17(6):865-76. PMC: 1891345. DOI: 10.1101/gr.5427007. View

4.
Zheng D, Zhang Z, Harrison P, Karro J, Carriero N, Gerstein M . Integrated pseudogene annotation for human chromosome 22: evidence for transcription. J Mol Biol. 2005; 349(1):27-45. DOI: 10.1016/j.jmb.2005.02.072. View

5.
Yusufzai T, Tagami H, Nakatani Y, Felsenfeld G . CTCF tethers an insulator to subnuclear sites, suggesting shared insulator mechanisms across species. Mol Cell. 2004; 13(2):291-8. DOI: 10.1016/s1097-2765(04)00029-2. View