» Articles » PMID: 24961374

Defining a Personal, Allele-specific, and Single-molecule Long-read Transcriptome

Overview
Specialty Science
Date 2014 Jun 26
PMID 24961374
Citations 140
Authors
Affiliations
Soon will be listed here.
Abstract

Personal transcriptomes in which all of an individual's genetic variants (e.g., single nucleotide variants) and transcript isoforms (transcription start sites, splice sites, and polyA sites) are defined and quantified for full-length transcripts are expected to be important for understanding individual biology and disease, but have not been described previously. To obtain such transcriptomes, we sequenced the lymphoblastoid transcriptomes of three family members (GM12878 and the parents GM12891 and GM12892) by using a Pacific Biosciences long-read approach complemented with Illumina 101-bp sequencing and made the following observations. First, we found that reads representing all splice sites of a transcript are evident for most sufficiently expressed genes ≤3 kb and often for genes longer than that. Second, we added and quantified previously unidentified splicing isoforms to an existing annotation, thus creating the first personalized annotation to our knowledge. Third, we determined SNVs in a de novo manner and connected them to RNA haplotypes, including HLA haplotypes, thereby assigning single full-length RNA molecules to their transcribed allele, and demonstrated Mendelian inheritance of RNA molecules. Fourth, we show how RNA molecules can be linked to personal variants on a one-by-one basis, which allows us to assess differential allelic expression (DAE) and differential allelic isoforms (DAI) from the phased full-length isoform reads. The DAI method is largely independent of the distance between exon and SNV--in contrast to fragmentation-based methods. Overall, in addition to improving eukaryotic transcriptome annotation, these results describe, to our knowledge, the first large-scale and full-length personal transcriptome.

Citing Articles

Long-read RNA sequencing enables full-length chimeric transcript annotation of transposable elements in lung adenocarcinoma.

Li Y, Liu Y, Xie Y, Wang Y, Wang J, Wang H BMC Cancer. 2025; 25(1):482.

PMID: 40089719 DOI: 10.1186/s12885-025-13888-5.


Challenges in identifying mRNA transcript starts and ends from long-read sequencing data.

Calvo-Roitberg E, Daniels R, Pai A Genome Res. 2024; 34(11):1719-1734.

PMID: 39567236 PMC: 11610588. DOI: 10.1101/gr.279559.124.


Understanding isoform expression by pairing long-read sequencing with single-cell and spatial transcriptomics.

Belchikov N, Hsu J, Li X, Jarroux J, Hu W, Joglekar A Genome Res. 2024; 34(11):1735-1746.

PMID: 39567235 PMC: 11610585. DOI: 10.1101/gr.279640.124.


Genetic regulation of nascent RNA maturation revealed by direct RNA nanopore sequencing.

Choquet K, Chaumont L, Bache S, Baxter-Koenigs A, Churchman L bioRxiv. 2024; .

PMID: 39257732 PMC: 11383983. DOI: 10.1101/2024.08.29.610338.


Targeted DNA-seq and RNA-seq of Reference Samples with Short-read and Long-read Sequencing.

Gong B, Li D, Labaj P, Pan B, Novoradovskaya N, Thierry-Mieg D Sci Data. 2024; 11(1):892.

PMID: 39152166 PMC: 11329654. DOI: 10.1038/s41597-024-03741-y.


References
1.
Tilgner H, Raha D, Habegger L, Mohiuddin M, Gerstein M, Snyder M . Accurate identification and analysis of human mRNA isoforms using deep long read sequencing. G3 (Bethesda). 2013; 3(3):387-97. PMC: 3583448. DOI: 10.1534/g3.112.004812. View

2.
Au K, Sebastiano V, Afshar P, Durruthy J, Lee L, Williams B . Characterization of the human ESC transcriptome by hybrid sequencing. Proc Natl Acad Sci U S A. 2013; 110(50):E4821-30. PMC: 3864310. DOI: 10.1073/pnas.1320101110. View

3.
Steijger T, Abril J, Engstrom P, Kokocinski F, Hubbard T, Guigo R . Assessment of transcript reconstruction methods for RNA-seq. Nat Methods. 2013; 10(12):1177-84. PMC: 3851240. DOI: 10.1038/nmeth.2714. View

4.
Pickrell J, Pai A, Gilad Y, Pritchard J . Noisy splicing drives mRNA isoform diversity in human cells. PLoS Genet. 2010; 6(12):e1001236. PMC: 3000347. DOI: 10.1371/journal.pgen.1001236. View

5.
Djebali S, Davis C, Merkel A, Dobin A, Lassmann T, Mortazavi A . Landscape of transcription in human cells. Nature. 2012; 489(7414):101-8. PMC: 3684276. DOI: 10.1038/nature11233. View