» Articles » PMID: 21486745

Predicting Promoter Activities of Primary Human DNA Sequences

Overview
Specialty Biochemistry
Date 2011 Apr 14
PMID 21486745
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

We developed a computer program that can predict the intrinsic promoter activities of primary human DNA sequences. We observed promoter activity using a quantitative luciferase assay and generated a prediction model using multiple linear regression. Our program achieved a prediction accuracy correlation coefficient of 0.87 between the predicted and observed promoter activities. We evaluated the prediction accuracy of the program using massive sequencing analysis of transcriptional start sites in vivo. We found that it is still difficult to predict transcript levels in a strictly quantitative manner in vivo; however, it was possible to select active promoters in a given cell from the other silent promoters. Using this program, we analyzed the transcriptional landscape of the entire human genome. We demonstrate that many human genomic regions have potential promoter activity, and the expression of some previously uncharacterized putatively non-protein-coding transcripts can be explained by our prediction model. Furthermore, we found that nucleosomes occasionally formed open chromatin structures with RNA polymerase II recruitment where the program predicted significant promoter activities, although no transcripts were observed.

Citing Articles

A new computational method to predict transcriptional activity of a DNA sequence from diverse datasets of massively parallel reporter assays.

Liu Y, Irie T, Yada T, Suzuki Y Nucleic Acids Res. 2017; 45(13):e124.

PMID: 28531296 PMC: 5737609. DOI: 10.1093/nar/gkx396.


Prediction of fine-tuned promoter activity from DNA sequence.

Siwo G, Rider A, Tan A, Pinapati R, Emrich S, Chawla N F1000Res. 2016; 5:158.

PMID: 27347373 PMC: 4916984. DOI: 10.12688/f1000research.7485.1.


Contribution of Sequence Motif, Chromatin State, and DNA Structure Features to Predictive Models of Transcription Factor Binding in Yeast.

Tsai Z, Shiu S, Tsai H PLoS Comput Biol. 2015; 11(8):e1004418.

PMID: 26291518 PMC: 4546298. DOI: 10.1371/journal.pcbi.1004418.


Computational promoter modeling identifies the modes of transcriptional regulation in hematopoietic stem cells.

Park S, Umemoto T, Saito-Adachi M, Shiratsuchi Y, Yamato M, Nakai K PLoS One. 2014; 9(4):e93853.

PMID: 24710559 PMC: 3977923. DOI: 10.1371/journal.pone.0093853.


Inferring gene expression from ribosomal promoter sequences, a crowdsourcing approach.

Meyer P, Siwo G, Zeevi D, Sharon E, Norel R, Segal E Genome Res. 2013; 23(11):1928-37.

PMID: 23950146 PMC: 3814892. DOI: 10.1101/gr.157420.113.

References
1.
Lieberman-Aiden E, van Berkum N, Williams L, Imakaev M, Ragoczy T, Telling A . Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science. 2009; 326(5950):289-93. PMC: 2858594. DOI: 10.1126/science.1181369. View

2.
Gray N, Wickens M . Control of translation initiation in animals. Annu Rev Cell Dev Biol. 1999; 14:399-458. DOI: 10.1146/annurev.cellbio.14.1.399. View

3.
Park P . ChIP-seq: advantages and challenges of a maturing technology. Nat Rev Genet. 2009; 10(10):669-80. PMC: 3191340. DOI: 10.1038/nrg2641. View

4.
Carninci P, Kasukawa T, Katayama S, Gough J, Frith M, Maeda N . The transcriptional landscape of the mammalian genome. Science. 2005; 309(5740):1559-63. DOI: 10.1126/science.1112014. View

5.
Suzuki Y, Sugano S . Construction of a full-length enriched and a 5'-end enriched cDNA library using the oligo-capping method. Methods Mol Biol. 2003; 221:73-91. DOI: 10.1385/1-59259-359-3:73. View