» Articles » PMID: 37192164

An Empirical Bayes Method for Differential Expression Analysis of Single Cells with Deep Generative Models

Overview
Specialty Science
Date 2023 May 16
PMID 37192164
Authors
Affiliations
Soon will be listed here.
Abstract

Detecting differentially expressed genes is important for characterizing subpopulations of cells. In scRNA-seq data, however, nuisance variation due to technical factors like sequencing depth and RNA capture efficiency obscures the underlying biological signal. Deep generative models have been extensively applied to scRNA-seq data, with a special focus on embedding cells into a low-dimensional latent space and correcting for batch effects. However, little attention has been paid to the problem of utilizing the uncertainty from the deep generative model for differential expression (DE). Furthermore, the existing approaches do not allow for controlling for effect size or the false discovery rate (FDR). Here, we present lvm-DE, a generic Bayesian approach for performing DE predictions from a fitted deep generative model, while controlling the FDR. We apply the lvm-DE framework to scVI and scSphere, two deep generative models. The resulting approaches outperform state-of-the-art methods at estimating the log fold change in gene expression levels as well as detecting differentially expressed genes between subpopulations of cells.

Citing Articles

Cross-species imputation and comparison of single-cell transcriptomic profiles.

Zhang R, Yang M, Schreiber J, ODay D, Turner J, Shendure J Genome Biol. 2025; 26(1):40.

PMID: 40012008 PMC: 11863430. DOI: 10.1186/s13059-025-03493-x.


Reduced circulating sphingolipids and activity are linked to T2D risk and impaired insulin secretion.

Khan S, Ye W, Van J, Singh I, Rabiee Y, Rodricks K Sci Adv. 2025; 11(2):eadr1725.

PMID: 39792658 PMC: 11790001. DOI: 10.1126/sciadv.adr1725.


Deep profiling deconstructs features associated with memory CD8 T cell tissue residence.

Scott M, Steier Z, Pierson M, Stolley J, OFlanagan S, Soerens A Immunity. 2024; 58(1):162-181.e10.

PMID: 39708817 PMC: 11852946. DOI: 10.1016/j.immuni.2024.11.007.


Considerations for building and using integrated single-cell atlases.

Hrovatin K, Sikkema L, Shitov V, Heimberg G, Shulman M, Oliver A Nat Methods. 2024; 22(1):41-57.

PMID: 39672979 DOI: 10.1038/s41592-024-02532-y.


VI-VS: calibrated identification of feature dependencies in single-cell multiomics.

Boyeau P, Bates S, Ergen C, Jordan M, Yosef N Genome Biol. 2024; 25(1):294.

PMID: 39548591 PMC: 11566124. DOI: 10.1186/s13059-024-03419-z.


References
1.
Lee J, Hyeon D, Hwang D . Single-cell multiomics: technologies and data analysis methods. Exp Mol Med. 2020; 52(9):1428-1442. PMC: 8080692. DOI: 10.1038/s12276-020-0420-2. View

2.
Wagner A, Regev A, Yosef N . Revealing the vectors of cellular identity with single-cell genomics. Nat Biotechnol. 2016; 34(11):1145-1160. PMC: 5465644. DOI: 10.1038/nbt.3711. View

3.
Squair J, Gautier M, Kathe C, Anderson M, James N, Hutson T . Confronting false discoveries in single-cell differential expression. Nat Commun. 2021; 12(1):5692. PMC: 8479118. DOI: 10.1038/s41467-021-25960-2. View

4.
Korthauer K, Chu L, Newton M, Li Y, Thomson J, Stewart R . A statistical approach for identifying differential distributions in single-cell RNA-seq experiments. Genome Biol. 2016; 17(1):222. PMC: 5080738. DOI: 10.1186/s13059-016-1077-y. View

5.
Luecken M, Buttner M, Chaichoompu K, Danese A, Interlandi M, Mueller M . Benchmarking atlas-level data integration in single-cell genomics. Nat Methods. 2021; 19(1):41-50. PMC: 8748196. DOI: 10.1038/s41592-021-01336-8. View