» Articles » PMID: 39243755

Imputing Abundance of over 2,500 Surface Proteins from Single-cell Transcriptomes with Context-agnostic Zero-shot Deep Ensembles

Overview
Journal Cell Syst
Publisher Cell Press
Date 2024 Sep 7
PMID 39243755
Authors
Affiliations
Soon will be listed here.
Abstract

Cell surface proteins serve as primary drug targets and cell identity markers. Techniques such as CITE-seq (cellular indexing of transcriptomes and epitopes by sequencing) have enabled the simultaneous quantification of surface protein abundance and transcript expression within individual cells. The published data have been utilized to train machine learning models for predicting surface protein abundance solely from transcript expression. However, the small scale of proteins predicted and the poor generalization ability of these computational approaches across diverse contexts (e.g., different tissues/disease states) impede their widespread adoption. Here, we propose SPIDER (surface protein prediction using deep ensembles from single-cell RNA sequencing), a context-agnostic zero-shot deep ensemble model, which enables large-scale protein abundance prediction and generalizes better to various contexts. Comprehensive benchmarking shows that SPIDER outperforms other state-of-the-art methods. Using the predicted surface abundance of >2,500 proteins from single-cell transcriptomes, we demonstrate the broad applications of SPIDER, including cell type annotation, biomarker/target identification, and cell-cell interaction analysis in hepatocellular carcinoma and colorectal cancer. A record of this paper's transparent peer review process is included in the supplemental information.

References
1.
Ma X, Somasundaram A, Qi Z, Hartman D, Singh H, Osmanbeyoglu H . SPaRTAN, a computational framework for linking cell-surface receptors to transcriptional regulators. Nucleic Acids Res. 2021; 49(17):9633-9647. PMC: 8464045. DOI: 10.1093/nar/gkab745. View

2.
Ahearn J, Fearon D . Structure and function of the complement receptors, CR1 (CD35) and CR2 (CD21). Adv Immunol. 1989; 46:183-219. DOI: 10.1016/s0065-2776(08)60654-9. View

3.
Vistain L, Tay S . Single-Cell Proteomics. Trends Biochem Sci. 2021; 46(8):661-672. PMC: 11697639. DOI: 10.1016/j.tibs.2021.01.013. View

4.
Inoue S, Leitner W, Golding B, Scott D . Inhibitory effects of B cells on antitumor immunity. Cancer Res. 2006; 66(15):7741-7. DOI: 10.1158/0008-5472.CAN-05-3766. View

5.
Zhao Y, Kilian C, Turner J, Bosurgi L, Roedl K, Bartsch P . Clonal expansion and activation of tissue-resident memory-like Th17 cells expressing GM-CSF in the lungs of severe COVID-19 patients. Sci Immunol. 2021; 6(56). PMC: 8128299. DOI: 10.1126/sciimmunol.abf6692. View