» Articles » PMID: 29348443

A General and Flexible Method for Signal Extraction from Single-cell RNA-seq Data

Overview
Journal Nat Commun
Specialty Biology
Date 2018 Jan 20
PMID 29348443
Citations 304
Authors
Affiliations
Soon will be listed here.
Abstract

Single-cell RNA-sequencing (scRNA-seq) is a powerful high-throughput technique that enables researchers to measure genome-wide transcription levels at the resolution of single cells. Because of the low amount of RNA present in a single cell, some genes may fail to be detected even though they are expressed; these genes are usually referred to as dropouts. Here, we present a general and flexible zero-inflated negative binomial model (ZINB-WaVE), which leads to low-dimensional representations of the data that account for zero inflation (dropouts), over-dispersion, and the count nature of the data. We demonstrate, with simulated and real data, that the model and its associated estimation procedure are able to give a more stable and accurate low-dimensional representation of the data than principal component analysis (PCA) and zero-inflated factor analysis (ZIFA), without the need for a preliminary normalization step.

Citing Articles

Cellular interactions within the immune microenvironment underpins resistance to cell cycle inhibition in breast cancers.

Griffiths J, Cosgrove P, Medina E, Nath A, Chen J, Adler F Nat Commun. 2025; 16(1):2132.

PMID: 40032842 PMC: 11876604. DOI: 10.1038/s41467-025-56279-x.


Dissecting tumor cell programs through group biology estimation in clinical single-cell transcriptomics.

Johri S, Bi K, Titchen B, Fu J, Conway J, Crowdis J Nat Commun. 2025; 16(1):2090.

PMID: 40025015 PMC: 11873288. DOI: 10.1038/s41467-025-57377-6.


Interpretable single-cell factor decomposition using sciRED.

Pouyabahar D, Andrews T, Bader G Nat Commun. 2025; 16(1):1878.

PMID: 39987196 PMC: 11846867. DOI: 10.1038/s41467-025-57157-2.


AsaruSim: a single-cell and spatial RNA-Seq Nanopore long-reads simulation workflow.

Hamraoui A, Jourdren L, Thomas-Chollier M Bioinformatics. 2025; 41(3).

PMID: 39985444 PMC: 11897429. DOI: 10.1093/bioinformatics/btaf087.


scRDiT: Generating Single-cell RNA-seq Data by Diffusion Transformers and Accelerating Sampling.

Dong S, Cui Z, Liu D, Lei J Interdiscip Sci. 2025; .

PMID: 39982678 DOI: 10.1007/s12539-025-00688-5.


References
1.
Tseng G, Wong W . Tight clustering: a resampling-based approach for identifying stable and tight patterns in data. Biometrics. 2005; 61(1):10-6. DOI: 10.1111/j.0006-341X.2005.031032.x. View

2.
Johnson W, Li C, Rabinovic A . Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2006; 8(1):118-27. DOI: 10.1093/biostatistics/kxj037. View

3.
Leek J, Storey J . Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet. 2007; 3(9):1724-35. PMC: 1994707. DOI: 10.1371/journal.pgen.0030161. View

4.
Robinson M, McCarthy D, Smyth G . edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2009; 26(1):139-40. PMC: 2796818. DOI: 10.1093/bioinformatics/btp616. View

5.
Bullard J, Purdom E, Hansen K, Dudoit S . Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinformatics. 2010; 11:94. PMC: 2838869. DOI: 10.1186/1471-2105-11-94. View