» Articles » PMID: 29624415

A Beginner's Guide to Analysis of RNA Sequencing Data

Overview
Date 2018 Apr 7
PMID 29624415
Citations 64
Authors
Affiliations
Soon will be listed here.
Abstract

Since the first publications coining the term RNA-seq (RNA sequencing) appeared in 2008, the number of publications containing RNA-seq data has grown exponentially, hitting an all-time high of 2,808 publications in 2016 (PubMed). With this wealth of RNA-seq data being generated, it is a challenge to extract maximal meaning from these datasets, and without the appropriate skills and background, there is risk of misinterpretation of these data. However, a general understanding of the principles underlying each step of RNA-seq data analysis allows investigators without a background in programming and bioinformatics to critically analyze their own datasets as well as published data. Our goals in the present review are to break down the steps of a typical RNA-seq analysis and to highlight the pitfalls and checkpoints along the way that are vital for bench scientists and biomedical researchers performing experiments that use RNA-seq.

Citing Articles

Breast cancer prediction based on gene expression data using interpretable machine learning techniques.

Kallah-Dagadu G, Mohammed M, Nasejje J, Mchunu N, Twabi H, Batidzirai J Sci Rep. 2025; 15(1):7594.

PMID: 40038307 PMC: 11880515. DOI: 10.1038/s41598-025-85323-5.


Vortioxetine exhibits anti-glioblastoma activity via the PI3K-Akt signaling pathway.

Zhang H, Zhang D, Huang Z, Cheng J, Zhang C, Lin N Iran J Basic Med Sci. 2025; 28(4):401-408.

PMID: 39968089 PMC: 11831746. DOI: 10.22038/ijbms.2025.82513.17836.


Cold storage of human precision-cut lung slices in TiProtec preserves cellular composition and transcriptional responses and enables on-demand mechanistic studies.

Melo-Narvaez M, Golitz F, Jain E, Gote-Schniering J, Stoleriu M, Bertrams W Respir Res. 2025; 26(1):57.

PMID: 39962456 PMC: 11834602. DOI: 10.1186/s12931-025-03132-w.


A novel piperine derivative HJ-23 exhibits anti-colorectal cancer effects by activating the p53 pathway.

Zhang M, Liu R, Jiang W, Li H, Zhang S, Cheng W Naunyn Schmiedebergs Arch Pharmacol. 2024; .

PMID: 39718615 DOI: 10.1007/s00210-024-03707-2.


SRC kinase drives multidrug resistance induced by KRAS-G12C inhibition.

Song X, Zhou Z, Elmezayen A, Wu R, Yu C, Gao B Sci Adv. 2024; 10(50):eadq4274.

PMID: 39661665 PMC: 11633746. DOI: 10.1126/sciadv.adq4274.


References
1.
Anders S, Pyl P, Huber W . HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics. 2014; 31(2):166-9. PMC: 4287950. DOI: 10.1093/bioinformatics/btu638. View

2.
Subramanian A, Tamayo P, Mootha V, Mukherjee S, Ebert B, Gillette M . Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005; 102(43):15545-50. PMC: 1239896. DOI: 10.1073/pnas.0506580102. View

3.
Robinson M, McCarthy D, Smyth G . edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2009; 26(1):139-40. PMC: 2796818. DOI: 10.1093/bioinformatics/btp616. View

4.
. Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Res. 2016; 45(D1):D331-D338. PMC: 5210579. DOI: 10.1093/nar/gkw1108. View

5.
Winter D, Jung S, Amit I . Making the case for chromatin profiling: a new tool to investigate the immune-regulatory landscape. Nat Rev Immunol. 2015; 15(9):585-94. DOI: 10.1038/nri3884. View