» Articles » PMID: 39097589

A Unified Model for Interpretable Latent Embedding of Multi-sample, Multi-condition Single-cell Data

Overview
Journal Nat Commun
Specialty Biology
Date 2024 Aug 3
PMID 39097589
Authors
Affiliations
Soon will be listed here.
Abstract

Single-cell analysis across multiple samples and conditions requires quantitative modeling of the interplay between the continuum of cell states and the technical and biological sources of sample-to-sample variability. We introduce GEDI, a generative model that identifies latent space variations in multi-sample, multi-condition single-cell datasets and attributes them to sample-level covariates. GEDI enables cross-sample cell state mapping on par with state-of-the-art integration methods, cluster-free differential gene expression analysis along the continuum of cell states, and machine learning-based prediction of sample characteristics from single-cell data. GEDI can also incorporate gene-level prior knowledge to infer pathway and regulatory network activities in single cells. Finally, GEDI extends all these concepts to previously unexplored modalities that require joint consideration of dual measurements, such as the joint analysis of exon inclusion/exclusion reads to model alternative cassette exon splicing, or spliced/unspliced reads to model the mRNA stability landscapes of single cells.

References
1.
Luecken M, Buttner M, Chaichoompu K, Danese A, Interlandi M, Mueller M . Benchmarking atlas-level data integration in single-cell genomics. Nat Methods. 2021; 19(1):41-50. PMC: 8748196. DOI: 10.1038/s41592-021-01336-8. View

2.
Cheng L, Pastrana E, Tavazoie M, Doetsch F . miR-124 regulates adult neurogenesis in the subventricular zone stem cell niche. Nat Neurosci. 2009; 12(4):399-408. PMC: 2766245. DOI: 10.1038/nn.2294. View

3.
Lopez R, Regier J, Cole M, Jordan M, Yosef N . Deep generative modeling for single-cell transcriptomics. Nat Methods. 2018; 15(12):1053-1058. PMC: 6289068. DOI: 10.1038/s41592-018-0229-2. View

4.
Haghverdi L, Lun A, Morgan M, Marioni J . Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat Biotechnol. 2018; 36(5):421-427. PMC: 6152897. DOI: 10.1038/nbt.4091. View

5.
McCarthy D, Campbell K, Lun A, Wills Q . Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R. Bioinformatics. 2017; 33(8):1179-1186. PMC: 5408845. DOI: 10.1093/bioinformatics/btw777. View