ChatGPT in Glioma Adjuvant Therapy Decision Making: Ready to Assume the Role of a Doctor in the Tumour Board?

Overview

Journal BMJ Health Care Inform

Specialties Medical Informatics
Public Health

Date 2023 Jul 3

PMID 37399360

Authors

Julien Haemmerli

Lukas Sveikata

Aria Nouri

Adrien May

Kristof Egervari

Christian Freyschlag

Johannes A Lobrinus

Denis Migliorini

Shahan Momjian

Nicolae Sanda

Karl Schaller

Sebastien Tran

Jacky Yeung

Philippe Bijlenga

Affiliations

Soon will be listed here.

Abstract

Objective: To evaluate ChatGPT's performance in brain glioma adjuvant therapy decision-making.

Methods: We randomly selected 10 patients with brain gliomas discussed at our institution's central nervous system tumour board (CNS TB). Patients' clinical status, surgical outcome, textual imaging information and immuno-pathology results were provided to ChatGPT V.3.5 and seven CNS tumour experts. The chatbot was asked to give the adjuvant treatment choice, and the regimen while considering the patient's functional status. The experts rated the artificial intelligence-based recommendations from 0 (complete disagreement) to 10 (complete agreement). An intraclass correlation coefficient agreement (ICC) was used to measure the inter-rater agreement.

Results: Eight patients (80%) met the criteria for glioblastoma and two (20%) were low-grade gliomas. The experts rated the quality of ChatGPT recommendations as poor for diagnosis (median 3, IQR 1-7.8, ICC 0.9, 95% CI 0.7 to 1.0), good for treatment recommendation (7, IQR 6-8, ICC 0.8, 95% CI 0.4 to 0.9), good for therapy regimen (7, IQR 4-8, ICC 0.8, 95% CI 0.5 to 0.9), moderate for functional status consideration (6, IQR 1-7, ICC 0.7, 95% CI 0.3 to 0.9) and moderate for overall agreement with the recommendations (5, IQR 3-7, ICC 0.7, 95% CI 0.3 to 0.9). No differences were observed between the glioblastomas and low-grade glioma ratings.

Conclusions: ChatGPT performed poorly in classifying glioma types but was good for adjuvant treatment recommendations as evaluated by CNS TB experts. Even though the ChatGPT lacks the precision to replace expert opinion, it may serve as a promising supplemental tool within a human-in-the-loop approach.

Citing Articles

Large Language Models for Chatbot Health Advice Studies: A Systematic Review.

Huo B, Boyle A, Marfo N, Tangamornsuksan W, Steen J, McKechnie T JAMA Netw Open. 2025; 8(2):e2457879.

PMID: 39903463 PMC: 11795331. DOI: 10.1001/jamanetworkopen.2024.57879.

ChatGPT vs Expert-Guided Care Pathways for Postesophagectomy Symptom Management.

Abou Chaar M, Grigsby-Rocca G, Huang M, Blackmon S Ann Thorac Surg Short Rep. 2025; 2(4):674-679.

PMID: 39790627 PMC: 11708366. DOI: 10.1016/j.atssr.2024.06.007.

Adaptive Treatment of Metastatic Prostate Cancer Using Generative Artificial Intelligence.

Derbal Y Clin Med Insights Oncol. 2025; 19():11795549241311408.

PMID: 39776668 PMC: 11701910. DOI: 10.1177/11795549241311408.

Healthcare professionals and the public sentiment analysis of ChatGPT in clinical practice.

Lu L, Zhu Y, Yang J, Yang Y, Ye J, Ai S Sci Rep. 2025; 15(1):1223.

PMID: 39774168 PMC: 11707298. DOI: 10.1038/s41598-024-84512-y.

Analyzing evaluation methods for large language models in the medical field: a scoping review.

Lee J, Park S, Shin J, Cho B BMC Med Inform Decis Mak. 2024; 24(1):366.

PMID: 39614219 PMC: 11606129. DOI: 10.1186/s12911-024-02709-7.

References

Else H . Abstracts written by ChatGPT fool scientists. Nature. 2023; 613(7944):423. DOI: 10.1038/d41586-023-00056-7. View

Connor C . Artificial Intelligence and Machine Learning in Anesthesiology. Anesthesiology. 2019; 131(6):1346-1359. PMC: 6778496. DOI: 10.1097/ALN.0000000000002694. View

Thorp H . ChatGPT is fun, but not an author. Science. 2023; 379(6630):313. DOI: 10.1126/science.adg7879. View

Biswas S . ChatGPT and the Future of Medical Writing. Radiology. 2023; 307(2):e223312. DOI: 10.1148/radiol.223312. View

Lee J, Yoon W, Kim S, Kim D, Kim S, So C . BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2019; 36(4):1234-1240. PMC: 7703786. DOI: 10.1093/bioinformatics/btz682. View

Ryken T, Parney I, Buatti J, Kalkanis S, Olson J . The role of radiotherapy in the management of patients with diffuse low grade glioma: A systematic review and evidence-based clinical practice guideline. J Neurooncol. 2015; 125(3):551-83. DOI: 10.1007/s11060-015-1948-1. View

Ameratunga M, Miller D, Ng W, Wada M, Gonzalvo A, Cher L . A single-institution prospective evaluation of a neuro-oncology multidisciplinary team meeting. J Clin Neurosci. 2018; 56:127-130. DOI: 10.1016/j.jocn.2018.06.032. View

Stupp R, Taillibert S, Kanner A, Read W, Steinberg D, Lhermitte B . Effect of Tumor-Treating Fields Plus Maintenance Temozolomide vs Maintenance Temozolomide Alone on Survival in Patients With Glioblastoma: A Randomized Clinical Trial. JAMA. 2017; 318(23):2306-2316. PMC: 5820703. DOI: 10.1001/jama.2017.18718. View

Snyder J, Schultz L, Walbert T . The role of tumor board conferences in neuro-oncology: a nationwide provider survey. J Neurooncol. 2017; 133(1):1-7. DOI: 10.1007/s11060-017-2416-x. View

10.

Barbaro M, Fine H, Magge R . Foundations of Neuro-Oncology: A Multidisciplinary Approach. World Neurosurg. 2021; 151:392-401. DOI: 10.1016/j.wneu.2021.02.059. View

11.

Kitamura F . ChatGPT Is Shaping the Future of Medical Writing But Still Requires Human Judgment. Radiology. 2023; 307(2):e230171. DOI: 10.1148/radiol.230171. View

12.

Doshi R, Bajaj S, Krumholz H . ChatGPT: Temptations of Progress. Am J Bioeth. 2023; 23(4):6-8. DOI: 10.1080/15265161.2023.2180110. View

13.

Berardi R, Morgese F, Rinaldi S, Torniai M, Mentrasti G, Scortichini L . Benefits and Limitations of a Multidisciplinary Approach in Cancer Patient Management. Cancer Manag Res. 2020; 12:9363-9374. PMC: 7533227. DOI: 10.2147/CMAR.S220976. View

14.

Bagley S, Kothari S, Rahman R, Lee E, Dunn G, Galanis E . Glioblastoma Clinical Trials: Current Landscape and Opportunities for Improvement. Clin Cancer Res. 2021; 28(4):594-602. PMC: 9044253. DOI: 10.1158/1078-0432.CCR-21-2750. View

15.

The Lancet Digital Health . ChatGPT: friend or foe?. Lancet Digit Health. 2023; 5(3):e102. DOI: 10.1016/S2589-7500(23)00023-7. View

16.

Kulkarni S, Seneviratne N, Baig M, Khan A . Artificial Intelligence in Medicine: Where Are We Now?. Acad Radiol. 2019; 27(1):62-70. DOI: 10.1016/j.acra.2019.10.001. View

17.

Koo T, Li M . A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med. 2016; 15(2):155-63. PMC: 4913118. DOI: 10.1016/j.jcm.2016.02.012. View

18.

Malmstrom A, Gronberg B, Marosi C, Stupp R, Frappaz D, Schultz H . Temozolomide versus standard 6-week radiotherapy versus hypofractionated radiotherapy in patients older than 60 years with glioblastoma: the Nordic randomised, phase 3 trial. Lancet Oncol. 2012; 13(9):916-26. DOI: 10.1016/S1470-2045(12)70265-6. View

19.

van Dis E, Bollen J, Zuidema W, van Rooij R, Bockting C . ChatGPT: five priorities for research. Nature. 2023; 614(7947):224-226. DOI: 10.1038/d41586-023-00288-7. View

20.

Liu Z, Roberts R, Lal-Nag M, Chen X, Huang R, Tong W . AI-based language models powering drug discovery and development. Drug Discov Today. 2021; 26(11):2593-2607. PMC: 8604259. DOI: 10.1016/j.drudis.2021.06.009. View