ScGen Predicts Single-cell Perturbation Responses
Overview
Pathology
Affiliations
Accurately modeling cellular response to perturbations is a central goal of computational biology. While such modeling has been based on statistical, mechanistic and machine learning models in specific settings, no generalization of predictions to phenomena absent from training data (out-of-sample) has yet been demonstrated. Here, we present scGen (https://github.com/theislab/scgen), a model combining variational autoencoders and latent space vector arithmetics for high-dimensional single-cell gene expression data. We show that scGen accurately models perturbation and infection response of cells across cell types, studies and species. In particular, we demonstrate that scGen learns cell-type and species-specific responses implying that it captures features that distinguish responding from non-responding genes and cells. With the upcoming availability of large-scale atlases of organs in a healthy state, we envision scGen to become a tool for experimental design through in silico screening of perturbation response in the context of disease and drug treatment.
Consequences of training data composition for deep learning models in single-cell biology.
Nadig A, Thoutam A, Hughes M, Gupta A, Navia A, Fusi N bioRxiv. 2025; .
PMID: 40060416 PMC: 11888162. DOI: 10.1101/2025.02.19.639127.
Wei S, Lu Y, Wang P, Li Q, Shuai J, Zhao Q J Transl Med. 2025; 23(1):264.
PMID: 40038714 PMC: 11877821. DOI: 10.1186/s12967-025-06263-2.
Cross-species imputation and comparison of single-cell transcriptomic profiles.
Zhang R, Yang M, Schreiber J, ODay D, Turner J, Shendure J Genome Biol. 2025; 26(1):40.
PMID: 40012008 PMC: 11863430. DOI: 10.1186/s13059-025-03493-x.
Rodov A, Baniadam H, Zeiser R, Amit I, Yosef N, Wertheimer T Eur J Immunol. 2025; 55(2):e202451234.
PMID: 39964048 PMC: 11834372. DOI: 10.1002/eji.202451234.
Leveraging prior knowledge to infer gene regulatory networks from single-cell RNA-sequencing data.
Stock M, Losert C, Zambon M, Popp N, Lubatti G, Hormanseder E Mol Syst Biol. 2025; 21(3):214-230.
PMID: 39939367 PMC: 11876610. DOI: 10.1038/s44320-025-00088-3.