Machine-learning-guided Directed Evolution for Protein Engineering
Overview
Pathology
Authors
Affiliations
Protein engineering through machine-learning-guided directed evolution enables the optimization of protein functions. Machine-learning approaches predict how sequence maps to function in a data-driven manner without requiring a detailed model of the underlying physics or biological pathways. Such methods accelerate directed evolution by learning from the properties of characterized variants and using that information to select sequences that are likely to exhibit improved properties. Here we introduce the steps required to build machine-learning sequence-function models and to use those models to guide engineering, making recommendations at each stage. This review covers basic concepts relevant to the use of machine learning for protein engineering, as well as the current literature and applications of this engineering paradigm. We illustrate the process with two case studies. Finally, we look to future opportunities for machine learning to enable the discovery of unknown protein functions and uncover the relationship between protein sequence and function.
Kohout P, Vasina M, Majerova M, Novakova V, Damborsky J, Bednar D JACS Au. 2025; 5(2):838-850.
PMID: 40017771 PMC: 11862945. DOI: 10.1021/jacsau.4c01101.
Yee B, Ali N, Mohd-Naim N, Ahmed M Chem Bio Eng. 2025; 1(4):330-339.
PMID: 39974464 PMC: 11835143. DOI: 10.1021/cbe.3c00112.
Integrating protein language models and automatic biofoundry for enhanced protein evolution.
Zhang Q, Chen W, Qin M, Wang Y, Pu Z, Ding K Nat Commun. 2025; 16(1):1553.
PMID: 39934638 PMC: 11814318. DOI: 10.1038/s41467-025-56751-8.
Artificial Intelligence-Powered Materials Science.
Bai X, Zhang X Nanomicro Lett. 2025; 17(1):135.
PMID: 39912967 PMC: 11803041. DOI: 10.1007/s40820-024-01634-8.
Functional diversity and metabolic engineering of plant-specialized metabolites.
Zhou S, Ma Y, Shang Y, Qi X, Huang S, Li J Life Metab. 2025; 1(2):109-121.
PMID: 39872355 PMC: 11749740. DOI: 10.1093/lifemeta/loac019.