» Articles » PMID: 40069486

Infusing Structural Assumptions into Dimensionality Reduction for Single-cell RNA Sequencing Data to Identify Small Gene Sets

Overview
Journal Commun Biol
Specialty Biology
Date 2025 Mar 12
PMID 40069486
Authors
Affiliations
Soon will be listed here.
Abstract

Dimensionality reduction greatly facilitates the exploration of cellular heterogeneity in single-cell RNA sequencing data. While most of such approaches are data-driven, it can be useful to incorporate biologically plausible assumptions about the underlying structure or the experimental design. We propose the boosting autoencoder (BAE) approach, which combines the advantages of unsupervised deep learning for dimensionality reduction and boosting for formalizing assumptions. Specifically, our approach selects small sets of genes that explain latent dimensions. As illustrative applications, we explore the diversity of neural cell identities and temporal patterns of embryonic development.

References
1.
Binder H, Schumacher M . Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models. BMC Bioinformatics. 2008; 9:14. PMC: 2245904. DOI: 10.1186/1471-2105-9-14. View

2.
Tasic B, Menon V, Nguyen T, Kim T, Jarsky T, Yao Z . Adult mouse cortical cell taxonomy revealed by single cell transcriptomics. Nat Neurosci. 2016; 19(2):335-46. PMC: 4985242. DOI: 10.1038/nn.4216. View

3.
Treppner M, Binder H, Hess M . Interpretable generative deep learning: an illustration with single cell gene expression data. Hum Genet. 2022; 141(9):1481-1498. PMC: 9360114. DOI: 10.1007/s00439-021-02417-6. View

4.
Lu C, Dong L, Zhou H, Li Q, Huang G, Bai S . G-Protein-Coupled Receptor Gpr17 Regulates Oligodendrocyte Differentiation in Response to Lysolecithin-Induced Demyelination. Sci Rep. 2018; 8(1):4502. PMC: 5852120. DOI: 10.1038/s41598-018-22452-0. View

5.
Ashuach T, Gabitto M, Koodli R, Saldi G, Jordan M, Yosef N . MultiVI: deep generative model for the integration of multimodal data. Nat Methods. 2023; 20(8):1222-1231. PMC: 10406609. DOI: 10.1038/s41592-023-01909-9. View