» Articles » PMID: 38472305

PIFiA: Self-supervised Approach for Protein Functional Annotation from Single-cell Imaging Data

Overview
Journal Mol Syst Biol
Specialty Molecular Biology
Date 2024 Mar 13
PMID 38472305
Authors
Affiliations
Soon will be listed here.
Abstract

Fluorescence microscopy data describe protein localization patterns at single-cell resolution and have the potential to reveal whole-proteome functional information with remarkable precision. Yet, extracting biologically meaningful representations from cell micrographs remains a major challenge. Existing approaches often fail to learn robust and noise-invariant features or rely on supervised labels for accurate annotations. We developed PIFiA (Protein Image-based Functional Annotation), a self-supervised approach for protein functional annotation from single-cell imaging data. We imaged the global yeast ORF-GFP collection and applied PIFiA to generate protein feature profiles from single-cell images of fluorescently tagged proteins. We show that PIFiA outperforms existing approaches for molecular representation learning and describe a range of downstream analysis tasks to explore the information content of the feature profiles. Specifically, we cluster extracted features into a hierarchy of functional organization, study cell population heterogeneity, and develop techniques to distinguish multi-localizing proteins and identify functional modules. Finally, we confirm new PIFiA predictions using a colocalization assay, suggesting previously unappreciated biological roles for several proteins. Paired with a fully interactive website ( https://thecellvision.org/pifia/ ), PIFiA is a resource for the quantitative analysis of protein organization within the cell.

Citing Articles

Capturing cell heterogeneity in representations of cell populations for image-based profiling using contrastive learning.

van Dijk R, Arevalo J, Babadi M, Carpenter A, Singh S PLoS Comput Biol. 2024; 20(11):e1012547.

PMID: 39527652 PMC: 11611260. DOI: 10.1371/journal.pcbi.1012547.


Capturing cell heterogeneity in representations of cell populations for image-based profiling using contrastive learning.

van Dijk R, Arevalo J, Babadi M, Carpenter A, Singh S bioRxiv. 2024; .

PMID: 39131344 PMC: 11312468. DOI: 10.1101/2023.11.14.567038.


Visual interpretability of bioimaging deep learning models.

Rotem O, Zaritsky A Nat Methods. 2024; 21(8):1394-1397.

PMID: 39122948 DOI: 10.1038/s41592-024-02322-6.


Anomaly detection for high-content image-based phenotypic cell profiling.

Shpigler A, Kolet N, Golan S, Weisbart E, Zaritsky A bioRxiv. 2024; .

PMID: 38895267 PMC: 11185510. DOI: 10.1101/2024.06.01.595856.


Expanding TheCellVision.org: a central repository for visualizing and mining high-content cell imaging projects.

Masinas M, Litsios A, Razdaibiedina A, Usaj M, Boone C, Andrews B Genetics. 2024; 227(1).

PMID: 38518223 PMC: 11075560. DOI: 10.1093/genetics/iyae044.

References
1.
Tkach J, Yimit A, Lee A, Riffle M, Costanzo M, Jaschob D . Dissecting DNA damage response pathways by analysing protein localization and abundance changes during DNA replication stress. Nat Cell Biol. 2012; 14(9):966-76. PMC: 3434236. DOI: 10.1038/ncb2549. View

2.
Haase S, Wittenberg C . Topology and control of the cell-cycle-regulated transcriptional circuitry. Genetics. 2014; 196(1):65-90. PMC: 3872199. DOI: 10.1534/genetics.113.152595. View

3.
Albert S, Schaffer M, Beck F, Mosalaganti S, Asano S, Thomas H . Proteasomes tether to two distinct sites at the nuclear pore complex. Proc Natl Acad Sci U S A. 2017; 114(52):13726-13731. PMC: 5748218. DOI: 10.1073/pnas.1716305114. View

4.
Harris M, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R . The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2003; 32(Database issue):D258-61. PMC: 308770. DOI: 10.1093/nar/gkh036. View

5.
Thul P, Akesson L, Wiking M, Mahdessian D, Geladaki A, Blal H . A subcellular map of the human proteome. Science. 2017; 356(6340). DOI: 10.1126/science.aal3321. View