» Articles » PMID: 17567992

Integrated Analysis of Experimental Data Sets Reveals Many Novel Promoters in 1% of the Human Genome

Overview
Journal Genome Res
Specialty Genetics
Date 2007 Jun 15
PMID 17567992
Citations 19
Authors
Affiliations
Soon will be listed here.
Abstract

The regulation of transcriptional initiation in the human genome is a critical component of global gene regulation, but a complete catalog of human promoters currently does not exist. In order to identify regulatory regions, we developed four computational methods to integrate 129 sets of ENCODE-wide chromatin immunoprecipitation data. They collectively predicted 1393 regions. Roughly 47% of the regions were unique to one method, as each method makes different assumptions about the data. Overall, predicted regions tend to localize to highly conserved, DNase I hypersensitive, and actively transcribed regions in the genome. Interestingly, a significant portion of the regions overlaps with annotated 3'-UTRs, suggesting that some of them might regulate anti-sense transcription. The majority of the predicted regions are >2 kb away from the 5'-ends of previously annotated human cDNAs and hence are novel. These novel regions may regulate unannotated transcripts or may represent new alternative transcription start sites of known genes. We tested 163 such regions for promoter activity in four cell lines using transient transfection assays, and 25% of them showed transcriptional activity above background in at least one cell line. We also performed 5'-RACE experiments on 62 novel regions, and 76% of the regions were associated with the 5'-ends of at least two RACE products. Our results suggest that there are at least 35% more functional promoters in the human genome than currently annotated.

Citing Articles

Characterization of human pseudogene-derived non-coding RNAs for functional potential.

Guo X, Lin M, Rockowitz S, Lachman H, Zheng D PLoS One. 2014; 9(4):e93972.

PMID: 24699680 PMC: 3974860. DOI: 10.1371/journal.pone.0093972.


Unravelling the hidden DNA structural/physical code provides novel insights on promoter location.

Duran E, Djebali S, Gonzalez S, Flores O, Mercader J, Guigo R Nucleic Acids Res. 2013; 41(15):7220-30.

PMID: 23761436 PMC: 3753636. DOI: 10.1093/nar/gkt511.


Epigenetic regulation of human cis-natural antisense transcripts.

Conley A, Jordan I Nucleic Acids Res. 2012; 40(4):1438-45.

PMID: 22371288 PMC: 3287164. DOI: 10.1093/nar/gkr1010.


A user's guide to the encyclopedia of DNA elements (ENCODE).

PLoS Biol. 2011; 9(4):e1001046.

PMID: 21526222 PMC: 3079585. DOI: 10.1371/journal.pbio.1001046.


Expression of distinct RNAs from 3' untranslated regions.

Mercer T, Wilhelm D, Dinger M, Solda G, Korbie D, Glazov E Nucleic Acids Res. 2010; 39(6):2393-403.

PMID: 21075793 PMC: 3064787. DOI: 10.1093/nar/gkq1158.


References
1.
Sabo P, Kuehn M, Thurman R, Johnson B, Johnson E, Cao H . Genome-scale mapping of DNase I sensitivity in vivo using tiling DNA microarrays. Nat Methods. 2006; 3(7):511-8. DOI: 10.1038/nmeth890. View

2.
Cooper S, Trinklein N, Anton E, Nguyen L, Myers R . Comprehensive analysis of transcriptional promoter structure and function in 1% of the human genome. Genome Res. 2005; 16(1):1-10. PMC: 1356123. DOI: 10.1101/gr.4222606. View

3.
Giresi P, Kim J, McDaniell R, Iyer V, Lieb J . FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) isolates active regulatory elements from human chromatin. Genome Res. 2006; 17(6):877-85. PMC: 1891346. DOI: 10.1101/gr.5533506. View

4.
Birney E, Stamatoyannopoulos J, Dutta A, Guigo R, Gingeras T, Margulies E . Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007; 447(7146):799-816. PMC: 2212820. DOI: 10.1038/nature05874. View

5.
Wingender E, Chen X, Hehl R, Karas H, Liebich I, Matys V . TRANSFAC: an integrated system for gene expression regulation. Nucleic Acids Res. 1999; 28(1):316-9. PMC: 102445. DOI: 10.1093/nar/28.1.316. View