» Articles » PMID: 18173853

Systematic Analysis of Transcribed Loci in ENCODE Regions Using RACE Sequencing Reveals Extensive Transcription in the Human Genome

Overview
Journal Genome Biol
Specialties Biology
Genetics
Date 2008 Jan 5
PMID 18173853
Citations 36
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Recent studies of the mammalian transcriptome have revealed a large number of additional transcribed regions and extraordinary complexity in transcript diversity. However, there is still much uncertainty regarding precisely what portion of the genome is transcribed, the exact structures of these novel transcripts, and the levels of the transcripts produced.

Results: We have interrogated the transcribed loci in 420 selected ENCyclopedia Of DNA Elements (ENCODE) regions using rapid amplification of cDNA ends (RACE) sequencing. We analyzed annotated known gene regions, but primarily we focused on novel transcriptionally active regions (TARs), which were previously identified by high-density oligonucleotide tiling arrays and on random regions that were not believed to be transcribed. We found RACE sequencing to be very sensitive and were able to detect low levels of transcripts in specific cell types that were not detectable by microarrays. We also observed many instances of sense-antisense transcripts; further analysis suggests that many of the antisense transcripts (but not all) may be artifacts generated from the reverse transcription reaction. Our results show that the majority of the novel TARs analyzed (60%) are connected to other novel TARs or known exons. Of previously unannotated random regions, 17% were shown to produce overlapping transcripts. Furthermore, it is estimated that 9% of the novel transcripts encode proteins.

Conclusion: We conclude that RACE sequencing is an efficient, sensitive, and highly accurate method for characterization of the transcriptome of specific cell/tissue types. Using this method, it appears that much of the genome is represented in polyA+ RNA. Moreover, a fraction of the novel RNAs can encode protein and are likely to be functional.

Citing Articles

Capturing the 'ome': the expanding molecular toolbox for RNA and DNA library construction.

Boone M, De Koker A, Callewaert N Nucleic Acids Res. 2018; 46(6):2701-2721.

PMID: 29514322 PMC: 5888575. DOI: 10.1093/nar/gky167.


Highly parallel direct RNA sequencing on an array of nanopores.

Garalde D, Snell E, Jachimowicz D, Sipos B, Lloyd J, Bruce M Nat Methods. 2018; 15(3):201-206.

PMID: 29334379 DOI: 10.1038/nmeth.4577.


Comparative Transcriptome Analysis Reveals Substantial Tissue Specificity in Human Aortic Valve.

Wang J, Wang Y, Gu W, Ni B, Sun H, Yu T Evol Bioinform Online. 2016; 12:175-84.

PMID: 27493474 PMC: 4968975. DOI: 10.4137/EBO.S37594.


Identification and analysis of the promoter region of the STGC3 gene.

Li S, Wang L, He X, Xie Y, Zhang Z Arch Med Sci. 2015; 11(5):1095-100.

PMID: 26528355 PMC: 4624735. DOI: 10.5114/aoms.2015.49213.


Building an RNA Sequencing Transcriptome of the Central Nervous System.

Dong X, You Y, Wu J Neuroscientist. 2015; 22(6):579-592.

PMID: 26463470 PMC: 4833695. DOI: 10.1177/1073858415610541.


References
1.
Kent W, Sugnet C, Furey T, Roskin K, Pringle T, Zahler A . The human genome browser at UCSC. Genome Res. 2002; 12(6):996-1006. PMC: 186604. DOI: 10.1101/gr.229102. View

2.
Gish W, States D . Identification of protein coding regions by database similarity search. Nat Genet. 1993; 3(3):266-72. DOI: 10.1038/ng0393-266. View

3.
Rinn J, Euskirchen G, Bertone P, Martone R, Luscombe N, Hartman S . The transcriptional activity of human Chromosome 22. Genes Dev. 2003; 17(4):529-40. PMC: 195998. DOI: 10.1101/gad.1055203. View

4.
Kao H, Porton B, Czernik A, Feng J, Yiu G, Haring M . A third member of the synapsin gene family. Proc Natl Acad Sci U S A. 1998; 95(8):4667-72. PMC: 22548. DOI: 10.1073/pnas.95.8.4667. View

5.
Vaquero C . Do natural antisense transcripts make sense in eukaryotes?. Gene. 1998; 211(1):1-9. DOI: 10.1016/s0378-1119(98)00093-6. View