» Articles » PMID: 35885218

Differential Expression Analysis of Single-Cell RNA-Seq Data: Current Statistical Approaches and Outstanding Challenges

Overview
Journal Entropy (Basel)
Publisher MDPI
Date 2022 Jul 27
PMID 35885218
Authors
Affiliations
Soon will be listed here.
Abstract

With the advent of single-cell RNA-sequencing (scRNA-seq), it is possible to measure the expression dynamics of genes at the single-cell level. Through scRNA-seq, a huge amount of expression data for several thousand(s) of genes over million(s) of cells are generated in a single experiment. Differential expression analysis is the primary downstream analysis of such data to identify gene markers for cell type detection and also provide inputs to other secondary analyses. Many statistical approaches for differential expression analysis have been reported in the literature. Therefore, we critically discuss the underlying statistical principles of the approaches and distinctly divide them into six major classes, i.e., generalized linear, generalized additive, Hurdle, mixture models, two-class parametric, and non-parametric approaches. We also succinctly discuss the limitations that are specific to each class of approaches, and how they are addressed by other subsequent classes of approach. A number of challenges are identified in this study that must be addressed to develop the next class of innovative approaches. Furthermore, we also emphasize the methodological challenges involved in differential expression analysis of scRNA-seq data that researchers must address to draw maximum benefit from this recent single-cell technology. This study will serve as a guide to genome researchers and experimental biologists to objectively select options for their analysis.

Citing Articles

Pathway metrics accurately stratify T cells to their cells states.

Livne D, Efroni S BioData Min. 2024; 17(1):60.

PMID: 39716187 PMC: 11668091. DOI: 10.1186/s13040-024-00416-7.


Statistically principled feature selection for single cell transcriptomics.

Dollinger E, Silkwood K, Atwood S, Nie Q, Lander A bioRxiv. 2024; .

PMID: 39463971 PMC: 11507810. DOI: 10.1101/2024.10.11.617709.


Theoretical framework for the difference of two negative binomial distributions and its application in comparative analysis of sequencing data.

Petrany A, Chen R, Zhang S, Chen Y Genome Res. 2024; 34(10):1636-1650.

PMID: 39406498 PMC: 11529838. DOI: 10.1101/gr.278843.123.


Leveraging gene correlations in single cell transcriptomic data.

Silkwood K, Dollinger E, Gervin J, Atwood S, Nie Q, Lander A BMC Bioinformatics. 2024; 25(1):305.

PMID: 39294560 PMC: 11411778. DOI: 10.1186/s12859-024-05926-z.


Kernel-based testing for single-cell differential analysis.

Ozier-Lafontaine A, Fourneaux C, Durif G, Arsenteva P, Vallot C, Gandrillon O Genome Biol. 2024; 25(1):114.

PMID: 38702740 PMC: 11069218. DOI: 10.1186/s13059-024-03255-1.


References
1.
Birtwistle M, Rauch J, Kiyatkin A, Aksamitiene E, Dobrzynski M, Hoek J . Emergence of bimodal cell population responses from the interplay between analog single-cell signaling and protein expression noise. BMC Syst Biol. 2012; 6:109. PMC: 3484110. DOI: 10.1186/1752-0509-6-109. View

2.
Trapnell C, Cacchiarelli D, Grimsby J, Pokharel P, Li S, Morse M . The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol. 2014; 32(4):381-386. PMC: 4122333. DOI: 10.1038/nbt.2859. View

3.
Cui X, Churchill G . Statistical tests for differential expression in cDNA microarray experiments. Genome Biol. 2003; 4(4):210. PMC: 154570. DOI: 10.1186/gb-2003-4-4-210. View

4.
Satija R, Farrell J, Gennert D, Schier A, Regev A . Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. 2015; 33(5):495-502. PMC: 4430369. DOI: 10.1038/nbt.3192. View

5.
Cui C, Shu W, Li P . Fluorescence In situ Hybridization: Cell-Based Genetic Diagnostic and Research Applications. Front Cell Dev Biol. 2016; 4:89. PMC: 5011256. DOI: 10.3389/fcell.2016.00089. View