Characterization of the Human ESC Transcriptome by Hybrid Sequencing
Overview
Authors
Affiliations
Although transcriptional and posttranscriptional events are detected in RNA-Seq data from second-generation sequencing, full-length mRNA isoforms are not captured. On the other hand, third-generation sequencing, which yields much longer reads, has current limitations of lower raw accuracy and throughput. Here, we combine second-generation sequencing and third-generation sequencing with a custom-designed method for isoform identification and quantification to generate a high-confidence isoform dataset for human embryonic stem cells (hESCs). We report 8,084 RefSeq-annotated isoforms detected as full-length and an additional 5,459 isoforms predicted through statistical inference. Over one-third of these are novel isoforms, including 273 RNAs from gene loci that have not previously been identified. Further characterization of the novel loci indicates that a subset is expressed in pluripotent cells but not in diverse fetal and adult tissues; moreover, their reduced expression perturbs the network of pluripotency-associated genes. Results suggest that gene identification, even in well-characterized human cell lines and tissues, is likely far from complete.
Challenges in identifying mRNA transcript starts and ends from long-read sequencing data.
Calvo-Roitberg E, Daniels R, Pai A Genome Res. 2024; 34(11):1719-1734.
PMID: 39567236 PMC: 11610588. DOI: 10.1101/gr.279559.124.
Belchikov N, Hsu J, Li X, Jarroux J, Hu W, Joglekar A Genome Res. 2024; 34(11):1735-1746.
PMID: 39567235 PMC: 11610585. DOI: 10.1101/gr.279640.124.
Ament I, DeBruyne N, Wang F, Lin L Mol Ther. 2024; 33(3):883-894.
PMID: 39563027 PMC: 11897757. DOI: 10.1016/j.ymthe.2024.11.025.
Contrasting and combining transcriptome complexity captured by short and long RNA sequencing reads.
Han S, Jewell S, Thomas-Tikhonenko A, Barash Y Genome Res. 2024; 34(10):1624-1635.
PMID: 39322279 PMC: 11529863. DOI: 10.1101/gr.278659.123.
Shahzad K, Zhang M, Mubeen I, Zhang X, Guo L, Qi T Funct Integr Genomics. 2024; 24(5):156.
PMID: 39230785 DOI: 10.1007/s10142-024-01420-0.