» Articles » PMID: 27909575

A Step-by-step Workflow for Low-level Analysis of Single-cell RNA-seq Data with Bioconductor

Overview
Journal F1000Res
Date 2016 Dec 6
PMID 27909575
Citations 818
Authors
Affiliations
Soon will be listed here.
Abstract

Single-cell RNA sequencing (scRNA-seq) is widely used to profile the transcriptome of individual cells. This provides biological resolution that cannot be matched by bulk RNA sequencing, at the cost of increased technical noise and data complexity. The differences between scRNA-seq and bulk RNA-seq data mean that the analysis of the former cannot be performed by recycling bioinformatics pipelines for the latter. Rather, dedicated single-cell methods are required at various steps to exploit the cellular resolution while accounting for technical noise. This article describes a computational workflow for low-level analyses of scRNA-seq data, based primarily on software packages from the open-source Bioconductor project. It covers basic steps including quality control, data exploration and normalization, as well as more complex procedures such as cell cycle phase assignment, identification of highly variable and correlated genes, clustering into subpopulations and marker gene detection. Analyses were demonstrated on gene-level count data from several publicly available datasets involving haematopoietic stem cells, brain-derived cells, T-helper cells and mouse embryonic stem cells. This will provide a range of usage scenarios from which readers can construct their own analysis pipelines.

Citing Articles

Feature selection methods affect the performance of scRNA-seq data integration and querying.

Zappia L, Richter S, Ramirez-Suastegui C, Kfuri-Rubens R, Vornholz L, Wang W Nat Methods. 2025; .

PMID: 40082610 DOI: 10.1038/s41592-025-02624-3.


Cellular and molecular determinants mediating the dysregulated germinal center immune dynamics in systemic lupus erythematosus.

Georgakis S, Ioannidou K, Mora B, Orfanakis M, Brenna C, Muller Y Front Immunol. 2025; 16:1530327.

PMID: 40070830 PMC: 11894538. DOI: 10.3389/fimmu.2025.1530327.


Opportunities and challenges in the application of single-cell transcriptomics in plant tissue research.

Luo M, Cao Y, Hong J Physiol Mol Biol Plants. 2025; 31(2):199-209.

PMID: 40070535 PMC: 11890805. DOI: 10.1007/s12298-025-01558-6.


Geospatially informed representation of spatial genomics data with SpatialFeatureExperiment.

Moses L, Huseynov A, Rich J, Pachter L bioRxiv. 2025; .

PMID: 40060564 PMC: 11888365. DOI: 10.1101/2025.02.24.640007.


Longitudinal single cell profiling of epitope specific memory CD4+ T cell responses to recombinant zoster vaccine.

Wen X, Hu A, Presnell S, Ford E, Koelle D, Kwok W Nat Commun. 2025; 16(1):2332.

PMID: 40057520 PMC: 11890790. DOI: 10.1038/s41467-025-57562-7.


References
1.
Angerer P, Haghverdi L, Buttner M, Theis F, Marr C, Buettner F . destiny: diffusion maps for large-scale single-cell data in R. Bioinformatics. 2015; 32(8):1241-3. DOI: 10.1093/bioinformatics/btv715. View

2.
Phipson B, Smyth G . Permutation P-values should never be zero: calculating exact P-values when permutations are randomly drawn. Stat Appl Genet Mol Biol. 2010; 9:Article39. DOI: 10.2202/1544-6115.1585. View

3.
Pollen A, Nowakowski T, Shuga J, Wang X, Leyrat A, Lui J . Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex. Nat Biotechnol. 2014; 32(10):1053-8. PMC: 4191988. DOI: 10.1038/nbt.2967. View

4.
Leng N, Chu L, Barry C, Li Y, Choi J, Li X . Oscope identifies oscillatory genes in unsynchronized single-cell RNA-seq experiments. Nat Methods. 2015; 12(10):947-950. PMC: 4589503. DOI: 10.1038/nmeth.3549. View

5.
Law C, Chen Y, Shi W, Smyth G . voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014; 15(2):R29. PMC: 4053721. DOI: 10.1186/gb-2014-15-2-r29. View