» Articles » PMID: 24282307

Characterization of the Human ESC Transcriptome by Hybrid Sequencing

Overview
Specialty Science
Date 2013 Nov 28
PMID 24282307
Citations 174
Authors
Affiliations
Soon will be listed here.
Abstract

Although transcriptional and posttranscriptional events are detected in RNA-Seq data from second-generation sequencing, full-length mRNA isoforms are not captured. On the other hand, third-generation sequencing, which yields much longer reads, has current limitations of lower raw accuracy and throughput. Here, we combine second-generation sequencing and third-generation sequencing with a custom-designed method for isoform identification and quantification to generate a high-confidence isoform dataset for human embryonic stem cells (hESCs). We report 8,084 RefSeq-annotated isoforms detected as full-length and an additional 5,459 isoforms predicted through statistical inference. Over one-third of these are novel isoforms, including 273 RNAs from gene loci that have not previously been identified. Further characterization of the novel loci indicates that a subset is expressed in pluripotent cells but not in diverse fetal and adult tissues; moreover, their reduced expression perturbs the network of pluripotency-associated genes. Results suggest that gene identification, even in well-characterized human cell lines and tissues, is likely far from complete.

Citing Articles

Challenges in identifying mRNA transcript starts and ends from long-read sequencing data.

Calvo-Roitberg E, Daniels R, Pai A Genome Res. 2024; 34(11):1719-1734.

PMID: 39567236 PMC: 11610588. DOI: 10.1101/gr.279559.124.


Understanding isoform expression by pairing long-read sequencing with single-cell and spatial transcriptomics.

Belchikov N, Hsu J, Li X, Jarroux J, Hu W, Joglekar A Genome Res. 2024; 34(11):1735-1746.

PMID: 39567235 PMC: 11610585. DOI: 10.1101/gr.279640.124.


Long-read RNA sequencing: A transformative technology for exploring transcriptome complexity in human diseases.

Ament I, DeBruyne N, Wang F, Lin L Mol Ther. 2024; 33(3):883-894.

PMID: 39563027 PMC: 11897757. DOI: 10.1016/j.ymthe.2024.11.025.


Contrasting and combining transcriptome complexity captured by short and long RNA sequencing reads.

Han S, Jewell S, Thomas-Tikhonenko A, Barash Y Genome Res. 2024; 34(10):1624-1635.

PMID: 39322279 PMC: 11529863. DOI: 10.1101/gr.278659.123.


Integrative analyses of long and short-read RNA sequencing reveal the spliced isoform regulatory network of seedling growth dynamics in upland cotton.

Shahzad K, Zhang M, Mubeen I, Zhang X, Guo L, Qi T Funct Integr Genomics. 2024; 24(5):156.

PMID: 39230785 DOI: 10.1007/s10142-024-01420-0.


References
1.
Cabili M, Trapnell C, Goff L, Koziol M, Tazon-Vega B, Regev A . Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 2011; 25(18):1915-27. PMC: 3185964. DOI: 10.1101/gad.17446611. View

2.
Adams M, Soares M, Kerlavage A, Fields C, Venter J . Rapid cDNA sequencing (expressed sequence tags) from a directionally cloned human infant brain cDNA library. Nat Genet. 1993; 4(4):373-80. DOI: 10.1038/ng0893-373. View

3.
Djebali S, Davis C, Merkel A, Dobin A, Lassmann T, Mortazavi A . Landscape of transcription in human cells. Nature. 2012; 489(7414):101-8. PMC: 3684276. DOI: 10.1038/nature11233. View

4.
Au K, Jiang H, Lin L, Xing Y, Wong W . Detection of splice junctions from paired-end RNA-seq data by SpliceMap. Nucleic Acids Res. 2010; 38(14):4570-8. PMC: 2919714. DOI: 10.1093/nar/gkq211. View

5.
. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012; 489(7414):57-74. PMC: 3439153. DOI: 10.1038/nature11247. View