» Articles » PMID: 38212802

ChatGPT Sits the DFPH Exam: Large Language Model Performance and Potential to Support Public Health Learning

Overview
Journal BMC Med Educ
Publisher Biomed Central
Specialty Medical Education
Date 2024 Jan 11
PMID 38212802
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Artificial intelligence-based large language models, like ChatGPT, have been rapidly assessed for both risks and potential in health-related assessment and learning. However, their applications in public health professional exams have not yet been studied. We evaluated the performance of ChatGPT in part of the Faculty of Public Health's Diplomat exam (DFPH).

Methods: ChatGPT was provided with a bank of 119 publicly available DFPH question parts from past papers. Its performance was assessed by two active DFPH examiners. The degree of insight and level of understanding apparently displayed by ChatGPT was also assessed.

Results: ChatGPT passed 3 of 4 papers, surpassing the current pass rate. It performed best on questions relating to research methods. Its answers had a high floor. Examiners identified ChatGPT answers with 73.6% accuracy and human answers with 28.6% accuracy. ChatGPT provided a mean of 3.6 unique insights per question and appeared to demonstrate a required level of learning on 71.4% of occasions.

Conclusions: Large language models have rapidly increasing potential as a learning tool in public health education. However, their factual fallibility and the difficulty of distinguishing their responses from that of humans pose potential threats to teaching and learning.

Citing Articles

Generative AI Decision-Making Attributes in Complex Health Services: A Rapid Review.

Doreswamy N, Horstmanshof L Cureus. 2025; 17(1):e78257.

PMID: 40026934 PMC: 11871968. DOI: 10.7759/cureus.78257.


Using ChatGPT for medical education: the technical perspective.

Chan K, Yuen T, Co M BMC Med Educ. 2025; 25(1):201.

PMID: 39920711 PMC: 11806775. DOI: 10.1186/s12909-025-06785-9.


eHealth Assistant AI Chatbot Using a Large Language Model to Provide Personalized Answers through Secure Decentralized Communication.

Pap I, Oniga S Sensors (Basel). 2024; 24(18).

PMID: 39338885 PMC: 11436070. DOI: 10.3390/s24186140.


Opportunities, challenges, and future directions of large language models, including ChatGPT in medical education: a systematic scoping review.

Xu X, Chen Y, Miao J J Educ Eval Health Prof. 2024; 21:6.

PMID: 38486402 PMC: 11035906. DOI: 10.3352/jeehp.2024.21.6.


ChatGPT's Accuracy on Magnetic Resonance Imaging Basics: Characteristics and Limitations Depending on the Question Type.

Lee K, Lee R Diagnostics (Basel). 2024; 14(2).

PMID: 38248048 PMC: 10814518. DOI: 10.3390/diagnostics14020171.

References
1.
Wang Y, Shen H, Chen T . Performance of ChatGPT on the pharmacist licensing examination in Taiwan. J Chin Med Assoc. 2023; 86(7):653-658. DOI: 10.1097/JCMA.0000000000000942. View

2.
Oh N, Choi G, Lee W . ChatGPT goes to the operating room: evaluating GPT-4 performance and its potential in surgical education and training in the era of large language models. Ann Surg Treat Res. 2023; 104(5):269-273. PMC: 10172028. DOI: 10.4174/astr.2023.104.5.269. View

3.
Ayers J, Zhu Z, Poliak A, Leas E, Dredze M, Hogarth M . Evaluating Artificial Intelligence Responses to Public Health Questions. JAMA Netw Open. 2023; 6(6):e2317517. PMC: 10248742. DOI: 10.1001/jamanetworkopen.2023.17517. View

4.
Holzinger A, Keiblinger K, Holub P, Zatloukal K, Muller H . AI for life: Trends in artificial intelligence for biotechnology. N Biotechnol. 2023; 74:16-24. DOI: 10.1016/j.nbt.2023.02.001. View

5.
Tsang R . Practical Applications of ChatGPT in Undergraduate Medical Education. J Med Educ Curric Dev. 2023; 10:23821205231178449. PMC: 10226299. DOI: 10.1177/23821205231178449. View