» Articles » PMID: 37816837

The Future Landscape of Large Language Models in Medicine

Abstract

Large language models (LLMs) are artificial intelligence (AI) tools specifically trained to process and generate text. LLMs attracted substantial public attention after OpenAI's ChatGPT was made publicly available in November 2022. LLMs can often answer questions, summarize, paraphrase and translate text on a level that is nearly indistinguishable from human capabilities. The possibility to actively interact with models like ChatGPT makes LLMs attractive tools in various fields, including medicine. While these models have the potential to democratize medical knowledge and facilitate access to healthcare, they could equally distribute misinformation and exacerbate scientific misconduct due to a lack of accountability and transparency. In this article, we provide a systematic and comprehensive overview of the potentials and limitations of LLMs in clinical practice, medical research and medical education.

Citing Articles

[Focus: artificial intelligence in medicine-Legal aspects of using large language models in clinical practice].

Weicken E, Mittermaier M, Hoeren T, Kliesch J, Wiegand T, Witzenrath M Inn Med (Heidelb). 2025; .

PMID: 40085197 DOI: 10.1007/s00108-025-01861-0.


Assessing large language models for Lugano classification of malignant lymphoma in Japanese FDG-PET reports.

Ito R, Kato K, Nanataki K, Abe Y, Ogawa H, Minamimoto R EJNMMI Rep. 2025; 9(1):8.

PMID: 40059276 PMC: 11891112. DOI: 10.1186/s41824-025-00246-8.


Agents for Change: Artificial Intelligent Workflows for Quantitative Clinical Pharmacology and Translational Sciences.

Shahin M, Goswami S, Lobentanzer S, Corrigan B Clin Transl Sci. 2025; 18(3):e70188.

PMID: 40055986 PMC: 11889410. DOI: 10.1111/cts.70188.


Unregulated large language models produce medical device-like output.

Weissman G, Mankowitz T, Kanter G NPJ Digit Med. 2025; 8(1):148.

PMID: 40055537 PMC: 11889144. DOI: 10.1038/s41746-025-01544-y.


Red teaming ChatGPT in medicine to yield real-world insights on model behavior.

Chang C, Farah H, Gui H, Rezaei S, Bou-Khalil C, Park Y NPJ Digit Med. 2025; 8(1):149.

PMID: 40055532 PMC: 11889229. DOI: 10.1038/s41746-025-01542-0.


References
1.
Tang L, Sun Z, Idnay B, Nestor J, Soroush A, Elias P . Evaluating large language models on medical evidence summarization. NPJ Digit Med. 2023; 6(1):158. PMC: 10449915. DOI: 10.1038/s41746-023-00896-7. View

2.
Sallam M . ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns. Healthcare (Basel). 2023; 11(6). PMC: 10048148. DOI: 10.3390/healthcare11060887. View

3.
Borve A, Gyllencreutz J, Terstappen K, Johansson Backman E, Aldenbratt A, Danielsson M . Smartphone teledermoscopy referrals: a novel process for improved triage of skin cancer patients. Acta Derm Venereol. 2014; 95(2):186-90. DOI: 10.2340/00015555-1906. View

4.
Sanderson K . GPT-4 is here: what scientists think. Nature. 2023; 615(7954):773. DOI: 10.1038/d41586-023-00816-5. View

5.
Stokel-Walker C, Van Noorden R . What ChatGPT and generative AI mean for science. Nature. 2023; 614(7947):214-216. DOI: 10.1038/d41586-023-00340-6. View