» Articles » PMID: 39833164

Accelerated Enzyme Engineering by Machine-learning Guided Cell-free Expression

Overview
Journal Nat Commun
Date 2025 Jan 20
PMID 39833164
Authors
Affiliations
Soon will be listed here.
Abstract

Enzyme engineering is limited by the challenge of rapidly generating and using large datasets of sequence-function relationships for predictive design. To address this challenge, we develop a machine learning (ML)-guided platform that integrates cell-free DNA assembly, cell-free gene expression, and functional assays to rapidly map fitness landscapes across protein sequence space and optimize enzymes for multiple, distinct chemical reactions. We apply this platform to engineer amide synthetases by evaluating substrate preference for 1217 enzyme variants in 10,953 unique reactions. We use these data to build augmented ridge regression ML models for predicting amide synthetase variants capable of making 9 small molecule pharmaceuticals. Over these nine compounds, ML-predicted enzyme variants demonstrate 1.6- to 42-fold improved activity relative to the parent. Our ML-guided, cell-free framework promises to accelerate enzyme engineering by enabling iterative exploration of protein sequence space to build specialized biocatalysts in parallel.

Citing Articles

Single-Walled Carbon Nanotube Probes for Protease Characterization Directly in Cell-Free Expression Reactions.

Hejazi S, Godin R, Jurasic V, Reuel N bioRxiv. 2025; .

PMID: 39868320 PMC: 11760254. DOI: 10.1101/2025.01.11.632549.


Accelerated enzyme engineering by machine-learning guided cell-free expression.

Landwehr G, Bogart J, Magalhaes C, Hammarlund E, Karim A, Jewett M Nat Commun. 2025; 16(1):865.

PMID: 39833164 PMC: 11747319. DOI: 10.1038/s41467-024-55399-0.


Active learning-assisted directed evolution.

Yang J, Lal R, Bowden J, Astudillo R, Hameedi M, Kaur S Nat Commun. 2025; 16(1):714.

PMID: 39821082 PMC: 11739421. DOI: 10.1038/s41467-025-55987-8.

References
1.
Silverman A, Karim A, Jewett M . Cell-free gene expression: an expanded repertoire of applications. Nat Rev Genet. 2019; 21(3):151-170. DOI: 10.1038/s41576-019-0186-3. View

2.
Chu A, Lu T, Huang P . Sparks of function by de novo protein design. Nat Biotechnol. 2024; 42(2):203-215. PMC: 11366440. DOI: 10.1038/s41587-024-02133-2. View

3.
Schwander T, Schada von Borzyskowski L, Burgener S, Cortina N, Erb T . A synthetic pathway for the fixation of carbon dioxide in vitro. Science. 2016; 354(6314):900-904. PMC: 5892708. DOI: 10.1126/science.aah5237. View

4.
Hopf T, Ingraham J, Poelwijk F, Scharfe C, Springer M, Sander C . Mutation effects predicted from sequence co-variation. Nat Biotechnol. 2017; 35(2):128-135. PMC: 5383098. DOI: 10.1038/nbt.3769. View

5.
Biswas S, Khimulya G, Alley E, Esvelt K, Church G . Low-N protein engineering with data-efficient deep learning. Nat Methods. 2021; 18(4):389-396. DOI: 10.1038/s41592-021-01100-y. View