» Articles » PMID: 32024661

The Full-length Transcriptome of Using Direct RNA Sequencing

Overview
Journal Genome Res
Specialty Genetics
Date 2020 Feb 7
PMID 32024661
Citations 56
Authors
Affiliations
Soon will be listed here.
Abstract

Current transcriptome annotations have largely relied on short read lengths intrinsic to the most widely used high-throughput cDNA sequencing technologies. For example, in the annotation of the transcriptome, more than half of the transcript isoforms lack full-length support and instead rely on inference from short reads that do not span the full length of the isoform. We applied nanopore-based direct RNA sequencing to characterize the developmental polyadenylated transcriptome of Taking advantage of long reads spanning the full length of mRNA transcripts, we provide support for 23,865 splice isoforms across 14,611 genes, without the need for computational reconstruction of gene models. Of the isoforms identified, 3452 are novel splice isoforms not present in the WormBase WS265 annotation. Furthermore, we identified 16,342 isoforms in the 3' untranslated region (3' UTR), 2640 of which are novel and do not fall within 10 bp of existing 3'-UTR data sets and annotations. Combining 3' UTRs and splice isoforms, we identified 28,858 full-length transcript isoforms. We also determined that poly(A) tail lengths of transcripts vary across development, as do the strengths of previously reported correlations between poly(A) tail length and expression level, and poly(A) tail length and 3'-UTR length. Finally, we have formatted this data as a publicly accessible track hub, enabling researchers to explore this data set easily in a genome browser.

Citing Articles

A systematic benchmark of Nanopore long-read RNA sequencing for transcript-level analysis in human cell lines.

Chen Y, Davidson N, Kei Wan Y, Yao F, Su Y, Gamaarachchi H Nat Methods. 2025; .

PMID: 40082608 DOI: 10.1038/s41592-025-02623-4.


Toward the use of nanopore RNA sequencing technologies in the clinic: challenges and opportunities.

Katopodi X, Begik O, Novoa E Nucleic Acids Res. 2025; 53(5).

PMID: 40057374 PMC: 11890063. DOI: 10.1093/nar/gkaf128.


CGC1, a new reference genome for .

Ichikawa K, Shoura M, Artiles K, Jeong D, Owa C, Kobayashi H bioRxiv. 2024; .

PMID: 39677790 PMC: 11643116. DOI: 10.1101/2024.12.04.626850.


GCRTcall: a transformer based basecaller for nanopore RNA sequencing enhanced by gated convolution and relative position embedding via joint loss training.

Li Q, Sun C, Wang D, Lou J Front Genet. 2024; 15:1443532.

PMID: 39649096 PMC: 11621211. DOI: 10.3389/fgene.2024.1443532.


Full-length direct RNA sequencing reveals extensive remodeling of RNA expression, processing and modification in aging Caenorhabditis elegans.

Schiksnis E, Nicastro I, Pasquinelli A Nucleic Acids Res. 2024; 52(22):13896-13913.

PMID: 39558169 PMC: 11662692. DOI: 10.1093/nar/gkae1064.


References
1.
Lim J, Lee M, Son A, Chang H, Kim V . mTAIL-seq reveals dynamic poly(A) tail regulation in oocyte-to-embryo development. Genes Dev. 2016; 30(14):1671-82. PMC: 4973296. DOI: 10.1101/gad.284802.116. View

2.
Lima S, Chipman L, Nicholson A, Chen Y, Yee B, Yeo G . Short poly(A) tails are a conserved feature of highly expressed genes. Nat Struct Mol Biol. 2017; 24(12):1057-1063. PMC: 5877826. DOI: 10.1038/nsmb.3499. View

3.
Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley D . Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 2012; 7(3):562-78. PMC: 3334321. DOI: 10.1038/nprot.2012.016. View

4.
Lee R, Feinbaum R, Ambros V . The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell. 1993; 75(5):843-54. DOI: 10.1016/0092-8674(93)90529-y. View

5.
Mangone M, Manoharan A, Thierry-Mieg D, Thierry-Mieg J, Han T, Mackowiak S . The landscape of C. elegans 3'UTRs. Science. 2010; 329(5990):432-5. PMC: 3142571. DOI: 10.1126/science.1191244. View