» Articles » PMID: 31305886

Visualization and Analysis of RNA-Seq Assembly Graphs

Overview
Specialty Biochemistry
Date 2019 Jul 16
PMID 31305886
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

RNA-Seq is a powerful transcriptome profiling technology enabling transcript discovery and quantification. Whilst most commonly used for gene-level quantification, the data can be used for the analysis of transcript isoforms. However, when the underlying transcript assemblies are complex, current visualization approaches can be limiting, with splicing events a challenge to interpret. Here, we report on the development of a graph-based visualization method as a complementary approach to understanding transcript diversity from short-read RNA-Seq data. Following the mapping of reads to a reference genome, a read-to-read comparison is performed on all reads mapping to a given gene, producing a weighted similarity matrix between reads. This is used to produce an RNA assembly graph, where nodes represent reads and edges similarity scores between them. The resulting graphs are visualized in 3D space to better appreciate their sometimes large and complex topology, with other information being overlaid on to nodes, e.g. transcript models. Here we demonstrate the utility of this approach, including the unusual structure of these graphs and how they can be used to identify issues in assembly, repetitive sequences within transcripts and splice variants. We believe this approach has the potential to significantly improve our understanding of transcript complexity.

Citing Articles

Exploring RNA-Seq Data Analysis Through Visualization Techniques and Tools: A Systematic Review of Opportunities and Limitations for Clinical Applications.

Manzoor F, Tsurgeon C, Gupta V Bioengineering (Basel). 2025; 12(1).

PMID: 39851330 PMC: 11760846. DOI: 10.3390/bioengineering12010056.


Graphia: A platform for the graph-based visualisation and analysis of high dimensional data.

Freeman T, Horsewell S, Patir A, Harling-Lee J, Regan T, Shih B PLoS Comput Biol. 2022; 18(7):e1010310.

PMID: 35877685 PMC: 9352203. DOI: 10.1371/journal.pcbi.1010310.

References
1.
Han Y, Gao S, Muegge K, Zhang W, Zhou B . Advanced Applications of RNA Sequencing and Challenges. Bioinform Biol Insights. 2015; 9(Suppl 1):29-46. PMC: 4648566. DOI: 10.4137/BBI.S28991. View

2.
Bahrami-Samani E, Vo D, de Araujo P, Vogel C, Smith A, Penalva L . Computational challenges, tools, and resources for analyzing co- and post-transcriptional events in high throughput. Wiley Interdiscip Rev RNA. 2014; 6(3):291-310. PMC: 4397117. DOI: 10.1002/wrna.1274. View

3.
Hartley S, Mullikin J . Detection and visualization of differential splicing in RNA-Seq data with JunctionSeq. Nucleic Acids Res. 2016; 44(15):e127. PMC: 5009739. DOI: 10.1093/nar/gkw501. View

4.
PERRY S . Vertebrate tropomyosin: distribution, properties and function. J Muscle Res Cell Motil. 2001; 22(1):5-49. DOI: 10.1023/a:1010303732441. View

5.
Giotti B, Chen S, Barnett M, Regan T, Ly T, Wiemann S . Assembly of a parts list of the human mitotic cell cycle machinery. J Mol Cell Biol. 2018; 11(8):703-718. PMC: 6788831. DOI: 10.1093/jmcb/mjy063. View