» Articles » PMID: 28088763

Scater: Pre-processing, Quality Control, Normalization and Visualization of Single-cell RNA-seq Data in R

Overview
Journal Bioinformatics
Specialty Biology
Date 2017 Jan 16
PMID 28088763
Citations 813
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Single-cell RNA sequencing (scRNA-seq) is increasingly used to study gene expression at the level of individual cells. However, preparing raw sequence data for further analysis is not a straightforward process. Biases, artifacts and other sources of unwanted variation are present in the data, requiring substantial time and effort to be spent on pre-processing, quality control (QC) and normalization.

Results: We have developed the R/Bioconductor package scater to facilitate rigorous pre-processing, quality control, normalization and visualization of scRNA-seq data. The package provides a convenient, flexible workflow to process raw sequencing reads into a high-quality expression dataset ready for downstream analysis. scater provides a rich suite of plotting tools for single-cell data and a flexible data structure that is compatible with existing tools and can be used as infrastructure for future software development.

Availability And Implementation: The open-source code, along with installation instructions, vignettes and case studies, is available through Bioconductor at http://bioconductor.org/packages/scater .

Contact: davis@ebi.ac.uk.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Transient inhibition of type I interferon enhances CD8+ T cell stemness and vaccine protection.

Broomfield B, Tan C, Qin R, Abberger H, Duckworth B, Alvarado C J Exp Med. 2025; 222(5).

PMID: 40062995 PMC: 11893171. DOI: 10.1084/jem.20241148.


An model for cardiac organoid production: The combined role of geometrical confinement and substrate stiffness.

Santoro R, Piacentini L, Vavassori C, Benzoni P, Colombo G, Banfi C Mater Today Bio. 2025; 31:101566.

PMID: 40061214 PMC: 11889630. DOI: 10.1016/j.mtbio.2025.101566.


Geospatially informed representation of spatial genomics data with SpatialFeatureExperiment.

Moses L, Huseynov A, Rich J, Pachter L bioRxiv. 2025; .

PMID: 40060564 PMC: 11888365. DOI: 10.1101/2025.02.24.640007.


Single-cell transcriptomic atlas of the human testis across the reproductive lifespan.

Cui L, Nie X, Guo Y, Ren P, Guo Y, Wang X Nat Aging. 2025; .

PMID: 40033047 DOI: 10.1038/s43587-025-00824-2.


Transcriptomic characterization of human lateral septum neurons reveals conserved and divergent marker genes across species.

Phillips 3rd R, Oh S, Bach S, Du Y, Miller R, Kleinman J iScience. 2025; 28(2):111820.

PMID: 39967863 PMC: 11834073. DOI: 10.1016/j.isci.2025.111820.


References
1.
Liao Y, Smyth G, Shi W . featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2013; 30(7):923-30. DOI: 10.1093/bioinformatics/btt656. View

2.
Amir E, Davis K, Tadmor M, Simonds E, Levine J, Bendall S . viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia. Nat Biotechnol. 2013; 31(6):545-52. PMC: 4076922. DOI: 10.1038/nbt.2594. View

3.
Kim J, Kolodziejczyk A, Ilicic T, Illicic T, Teichmann S, Marioni J . Characterizing noise structure in single-cell RNA-seq distinguishes genuine from technical stochastic allelic expression. Nat Commun. 2015; 6:8687. PMC: 4627577. DOI: 10.1038/ncomms9687. View

4.
Chikina M, Zaslavsky E, Sealfon S . CellCODE: a robust latent variable approach to differential expression analysis for heterogeneous cell populations. Bioinformatics. 2015; 31(10):1584-91. PMC: 4426841. DOI: 10.1093/bioinformatics/btv015. View

5.
Leek J . svaseq: removing batch effects and other unwanted noise from sequencing data. Nucleic Acids Res. 2014; 42(21). PMC: 4245966. DOI: 10.1093/nar/gku864. View