» Articles » PMID: 39580482

An Automatic End-to-end Chemical Synthesis Development Platform Powered by Large Language Models

Overview
Journal Nat Commun
Specialty Biology
Date 2024 Nov 23
PMID 39580482
Authors
Affiliations
Soon will be listed here.
Abstract

The rapid emergence of large language model (LLM) technology presents promising opportunities to facilitate the development of synthetic reactions. In this work, we leveraged the power of GPT-4 to build an LLM-based reaction development framework (LLM-RDF) to handle fundamental tasks involved throughout the chemical synthesis development. LLM-RDF comprises six specialized LLM-based agents, including Literature Scouter, Experiment Designer, Hardware Executor, Spectrum Analyzer, Separation Instructor, and Result Interpreter, which are pre-prompted to accomplish the designated tasks. A web application with LLM-RDF as the backend was built to allow chemist users to interact with automated experimental platforms and analyze results via natural language, thus, eliminating the need for coding skills and ensuring accessibility for all chemists. We demonstrated the capabilities of LLM-RDF in guiding the end-to-end synthesis development process for the copper/TEMPO catalyzed aerobic alcohol oxidation to aldehyde reaction, including literature search and information extraction, substrate scope and condition screening, reaction kinetics study, reaction condition optimization, reaction scale-up and product purification. Furthermore, LLM-RDF's broader applicability and versability was validated on various synthesis tasks of three distinct reactions (SAr reaction, photoredox C-C cross-coupling reaction, and heterogeneous photoelectrochemical reaction).

Citing Articles

A review of large language models and autonomous agents in chemistry.

Ramos M, Collison C, White A Chem Sci. 2025; 16(6):2514-2572.

PMID: 39829984 PMC: 11739813. DOI: 10.1039/d4sc03921a.


Integrating Machine Learning and Large Language Models to Advance Exploration of Electrochemical Reactions.

Zheng Z, Florit F, Jin B, Wu H, Li S, Nandiwale K Angew Chem Int Ed Engl. 2024; 64(6):e202418074.

PMID: 39625837 PMC: 11795713. DOI: 10.1002/anie.202418074.

References
1.
DiMasi J, Grabowski H, Hansen R . Innovation in the pharmaceutical industry: New estimates of R&D costs. J Health Econ. 2016; 47:20-33. DOI: 10.1016/j.jhealeco.2016.01.012. View

2.
Feng F, Lai L, Pei J . Computational Chemical Synthesis Analysis and Pathway Design. Front Chem. 2018; 6:199. PMC: 5994992. DOI: 10.3389/fchem.2018.00199. View

3.
Molga K, Szymkuc S, Grzybowski B . Chemist Ex Machina: Advanced Synthesis Planning by Computers. Acc Chem Res. 2021; 54(5):1094-1106. DOI: 10.1021/acs.accounts.0c00714. View

4.
Andersson S, Armstrong A, Bjore A, Bowker S, Chapman S, Davies R . Making medicinal chemistry more effective--application of Lean Sigma to improve processes, speed and quality. Drug Discov Today. 2009; 14(11-12):598-604. DOI: 10.1016/j.drudis.2009.03.005. View

5.
Struble T, Alvarez J, Brown S, Chytil M, Cisar J, DesJarlais R . Current and Future Roles of Artificial Intelligence in Medicinal Chemistry Synthesis. J Med Chem. 2020; 63(16):8667-8682. PMC: 7457232. DOI: 10.1021/acs.jmedchem.9b02120. View