» Articles » PMID: 39651244

HyperMPNN-A General Strategy to Design Thermostable Proteins Learned from Hyperthermophiles

Overview
Journal bioRxiv
Date 2024 Dec 9
PMID 39651244
Authors
Affiliations
Soon will be listed here.
Abstract

Stability is a key factor to enable the use of recombinant proteins in therapeutic or biotechnological applications. Deep learning protein design approaches like ProteinMPNN have shown strong performance both in creating novel proteins or stabilizing existing ones. However, it is unlikely that the stability of the designs will significantly exceed that of the natural proteins in the training set, which are biophysically only marginally stable. Therefore, we collected predicted protein structures from hyperthermophiles, which differ substantially in their amino acid composition from mesophiles. Notably, ProteinMPNN fails to recover their unique amino acid composition. Here we show that a retrained network on predicted proteins from hyperthermophiles, termed HyperMPNN, not only recovers this unique amino acid composition but can also be applied to proteins from non-hyperthermophiles. Using this novel approach on a protein nanoparticle with a melting temperature of 65°C resulted in designs remaining stable at 95°C. In conclusion, we created a new way to design highly thermostable proteins through self-supervised learning on data from hyperthermophiles.

References
1.
Tsuboyama K, Dauparas J, Chen J, Laine E, Mohseni Behbahani Y, Weinstein J . Mega-scale experimental analysis of protein folding stability in biology and design. Nature. 2023; 620(7973):434-444. PMC: 10412457. DOI: 10.1038/s41586-023-06328-6. View

2.
Blaabjerg L, Kassem M, Good L, Jonsson N, Cagiada M, Johansson K . Rapid protein stability prediction using deep learning representations. Elife. 2023; 12. PMC: 10266766. DOI: 10.7554/eLife.82593. View

3.
. UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 2022; 51(D1):D523-D531. PMC: 9825514. DOI: 10.1093/nar/gkac1052. View

4.
Chaudhury S, Lyskov S, Gray J . PyRosetta: a script-based interface for implementing molecular modeling algorithms using Rosetta. Bioinformatics. 2010; 26(5):689-91. PMC: 2828115. DOI: 10.1093/bioinformatics/btq007. View

5.
Jarzab A, Kurzawa N, Hopf T, Moerch M, Zecha J, Leijten N . Meltome atlas-thermal proteome stability across the tree of life. Nat Methods. 2020; 17(5):495-503. DOI: 10.1038/s41592-020-0801-4. View