» Articles » PMID: 35879608

Self-supervised Deep Learning Encodes High-resolution Features of Protein Subcellular Localization

Overview
Journal Nat Methods
Date 2022 Jul 25
PMID 35879608
Authors
Affiliations
Soon will be listed here.
Abstract

Explaining the diversity and complexity of protein localization is essential to fully understand cellular architecture. Here we present cytoself, a deep-learning approach for fully self-supervised protein localization profiling and clustering. Cytoself leverages a self-supervised training scheme that does not require preexisting knowledge, categories or annotations. Training cytoself on images of 1,311 endogenously labeled proteins from the OpenCell database reveals a highly resolved protein localization atlas that recapitulates major scales of cellular organization, from coarse classes, such as nuclear and cytoplasmic, to the subtle localization signatures of individual protein complexes. We quantitatively validate cytoself's ability to cluster proteins into organelles and protein complexes, showing that cytoself outperforms previous self-supervised approaches. Moreover, to better understand the inner workings of our model, we dissect the emergent features from which our clustering is derived, interpret them in the context of the fluorescence images, and analyze the performance contributions of each component of our approach.

Citing Articles

Evaluating feature extraction in ovarian cancer cell line co-cultures using deep neural networks.

Sharma O, Gudoityte G, Minozada R, Kallioniemi O, Turkki R, Paavolainen L Commun Biol. 2025; 8(1):303.

PMID: 40000764 PMC: 11862010. DOI: 10.1038/s42003-025-07766-w.


Self-supervision advances morphological profiling by unlocking powerful image representations.

Kim V, Adaloglou N, Osterland M, Morelli F, Halawa M, Konig T Sci Rep. 2025; 15(1):4876.

PMID: 39929956 PMC: 11811211. DOI: 10.1038/s41598-025-88825-4.


Addressing scalability and managing sparsity and dropout events in single-cell representation identification with ZIGACL.

Shi M, Li X Brief Bioinform. 2025; 26(1.

PMID: 39775477 PMC: 11705091. DOI: 10.1093/bib/bbae703.


A highly efficient, scalable pipeline for fixed feature extraction from large-scale high-content imaging screens.

Comolet G, Bose N, Winchell J, Duren-Lubanski A, Rusielewicz T, Goldberg J iScience. 2024; 27(12):111434.

PMID: 39720532 PMC: 11667173. DOI: 10.1016/j.isci.2024.111434.


AI: A transformative opportunity in cell biology.

Carr A, Cool J, Karaletsos T, Li D, Lowe A, Otte S Mol Biol Cell. 2024; 35(12):pe4.

PMID: 39621362 PMC: 11656480. DOI: 10.1091/mbc.E24-09-0415.