PIFiA: Self-supervised Approach for Protein Functional Annotation from Single-cell Imaging Data
Overview
Authors
Affiliations
Fluorescence microscopy data describe protein localization patterns at single-cell resolution and have the potential to reveal whole-proteome functional information with remarkable precision. Yet, extracting biologically meaningful representations from cell micrographs remains a major challenge. Existing approaches often fail to learn robust and noise-invariant features or rely on supervised labels for accurate annotations. We developed PIFiA (Protein Image-based Functional Annotation), a self-supervised approach for protein functional annotation from single-cell imaging data. We imaged the global yeast ORF-GFP collection and applied PIFiA to generate protein feature profiles from single-cell images of fluorescently tagged proteins. We show that PIFiA outperforms existing approaches for molecular representation learning and describe a range of downstream analysis tasks to explore the information content of the feature profiles. Specifically, we cluster extracted features into a hierarchy of functional organization, study cell population heterogeneity, and develop techniques to distinguish multi-localizing proteins and identify functional modules. Finally, we confirm new PIFiA predictions using a colocalization assay, suggesting previously unappreciated biological roles for several proteins. Paired with a fully interactive website ( https://thecellvision.org/pifia/ ), PIFiA is a resource for the quantitative analysis of protein organization within the cell.
van Dijk R, Arevalo J, Babadi M, Carpenter A, Singh S PLoS Comput Biol. 2024; 20(11):e1012547.
PMID: 39527652 PMC: 11611260. DOI: 10.1371/journal.pcbi.1012547.
van Dijk R, Arevalo J, Babadi M, Carpenter A, Singh S bioRxiv. 2024; .
PMID: 39131344 PMC: 11312468. DOI: 10.1101/2023.11.14.567038.
Visual interpretability of bioimaging deep learning models.
Rotem O, Zaritsky A Nat Methods. 2024; 21(8):1394-1397.
PMID: 39122948 DOI: 10.1038/s41592-024-02322-6.
Anomaly detection for high-content image-based phenotypic cell profiling.
Shpigler A, Kolet N, Golan S, Weisbart E, Zaritsky A bioRxiv. 2024; .
PMID: 38895267 PMC: 11185510. DOI: 10.1101/2024.06.01.595856.
Masinas M, Litsios A, Razdaibiedina A, Usaj M, Boone C, Andrews B Genetics. 2024; 227(1).
PMID: 38518223 PMC: 11075560. DOI: 10.1093/genetics/iyae044.