» Articles » PMID: 38659952

Protein Codes Promote Selective Subcellular Compartmentalization

Abstract

Cells have evolved mechanisms to distribute ~10 billion protein molecules to subcellular compartments where diverse proteins involved in shared functions must efficiently assemble. Here, we demonstrate that proteins with shared functions share amino acid sequence codes that guide them to compartment destinations. A protein language model, ProtGPS, was developed that predicts with high performance the compartment localization of human proteins excluded from the training set. ProtGPS successfully guided generation of novel protein sequences that selectively assemble in targeted subcellular compartments. ProtGPS also identified pathological mutations that change this code and lead to altered subcellular localization of proteins. Our results indicate that protein sequences contain not only a folding code, but also a previously unrecognized code governing their distribution in specific cellular compartments.

References
1.
Watson J, Juergens D, Bennett N, Trippe B, Yim J, Eisenach H . De novo design of protein structure and function with RFdiffusion. Nature. 2023; 620(7976):1089-1100. PMC: 10468394. DOI: 10.1038/s41586-023-06415-8. View

2.
Shin J, Riesselman A, Kollasch A, McMahon C, Simon E, Sander C . Protein design and variant prediction using autoregressive generative models. Nat Commun. 2021; 12(1):2403. PMC: 8065141. DOI: 10.1038/s41467-021-22732-w. View

3.
Ilik I, Malszycki M, Lubke A, Schade C, Meierhofer D, Aktas T . SON and SRRM2 are essential for nuclear speckle formation. Elife. 2020; 9. PMC: 7671692. DOI: 10.7554/eLife.60579. View

4.
Sabari B, DallAgnese A, Boija A, Klein I, Coffey E, Shrinivas K . Coactivator condensation at super-enhancers links phase separation and gene control. Science. 2018; 361(6400). PMC: 6092193. DOI: 10.1126/science.aar3958. View

5.
Banani S, Lee H, Hyman A, Rosen M . Biomolecular condensates: organizers of cellular biochemistry. Nat Rev Mol Cell Biol. 2017; 18(5):285-298. PMC: 7434221. DOI: 10.1038/nrm.2017.7. View