» Articles » PMID: 39446959

Scalable Protein Design Using Optimization in a Relaxed Sequence Space

Overview
Journal Science
Specialty Science
Date 2024 Oct 24
PMID 39446959
Authors
Affiliations
Soon will be listed here.
Abstract

Machine learning (ML)-based design approaches have advanced the field of de novo protein design, with diffusion-based generative methods increasingly dominating protein design pipelines. Here, we report a "hallucination"-based protein design approach that functions in relaxed sequence space, enabling the efficient design of high-quality protein backbones over multiple scales and with broad scope of application without the need for any form of retraining. We experimentally produced and characterized more than 100 proteins. Three high-resolution crystal structures and two cryo-electron microscopy density maps of designed single-chain proteins comprising up to 1000 amino acids validate the accuracy of the method. Our pipeline can also be used to design synthetic protein-protein interactions, as validated experimentally by a set of protein heterodimers. Relaxed sequence optimization offers attractive performance with respect to designability, scope of applicability for different design problems, and scalability across protein sizes.

Citing Articles

AlphaFold as a Prior: Experimental Structure Determination Conditioned on a Pretrained Neural Network.

Fadini A, Li M, McCoy A, Terwilliger T, Read R, Hekstra D bioRxiv. 2025; .

PMID: 40027838 PMC: 11870471. DOI: 10.1101/2025.02.18.638828.


SFP6 fluorescent probes for imaging SAM dynamics in living cells.

Zhang S, Li J, Cao G Mikrochim Acta. 2025; 192(3):180.

PMID: 39982573 DOI: 10.1007/s00604-025-07039-7.


De novo design of transmembrane fluorescence-activating proteins.

Zhu J, Liang M, Sun K, Wei Y, Guo R, Zhang L Nature. 2025; .

PMID: 39972138 DOI: 10.1038/s41586-025-08598-8.


SFCalculator: connecting deep generative models and crystallography.

Li M, Dalton K, Hekstra D bioRxiv. 2025; .

PMID: 39868231 PMC: 11760793. DOI: 10.1101/2025.01.12.632630.

References
1.
Watson J, Juergens D, Bennett N, Trippe B, Yim J, Eisenach H . De novo design of protein structure and function with RFdiffusion. Nature. 2023; 620(7976):1089-1100. PMC: 10468394. DOI: 10.1038/s41586-023-06415-8. View

2.
Roney J, Ovchinnikov S . State-of-the-Art Estimation of Protein Model Accuracy Using AlphaFold. Phys Rev Lett. 2022; 129(23):238101. DOI: 10.1103/PhysRevLett.129.238101. View

3.
Zhang Y, Skolnick J . Scoring function for automated assessment of protein structure template quality. Proteins. 2004; 57(4):702-10. DOI: 10.1002/prot.20264. View

4.
van Kempen M, Kim S, Tumescheit C, Mirdita M, Lee J, Gilchrist C . Fast and accurate protein structure search with Foldseek. Nat Biotechnol. 2023; 42(2):243-246. PMC: 10869269. DOI: 10.1038/s41587-023-01773-0. View

5.
Vazquez Torres S, Leung P, Venkatesh P, Lutz I, Hink F, Huynh H . De novo design of high-affinity binders of bioactive helical peptides. Nature. 2023; 626(7998):435-442. PMC: 10849960. DOI: 10.1038/s41586-023-06953-1. View