» Articles » PMID: 39719917

Evaluation of ChatGPT-4o's Answers to Questions About Hip Arthroscopy from the Patient Perspective

Overview
Date 2024 Dec 25
PMID 39719917
Authors
Affiliations
Soon will be listed here.
Abstract

Objectives: This study aimed to evaluate the responses provided by ChatGPT-4o to the most frequently asked questions by patients regarding hip arthroscopy.

Materials And Methods: In this cross-sectional survey study, a new Google account without a search history was created to determine the 20 most frequently asked questions about hip arthroscopy via Google. These questions were asked to a new ChatGPT-4o account on June 1, 2024, and the responses were recorded. Ten orthopedic surgeons specializing in sports surgery rated the responses using a rating scale to assess relevance, accuracy, clarity, and completeness. The responses were scored on a scale from 1 to 5, with 1 being the worst and 5 being the best. The interrater reliability assessed via the intraclass correlation coefficient (ICC).

Results: The lowest score given by the surgeons for any response was 4/5 in each subcategory. The highest mean scores were in accuracy and clarity, followed by relevance, with completeness receiving the lowest scores. The overall mean score was 4.49±0.16. Interrater reliability showed insufficient overall agreement (ICC=0.004, p=0.383), with the highest agreement in clarity (ICC=0.039, p=0.131) and the lowest in accuracy (ICC=-0.019, p=0.688).

Conclusion: The study confirms our hypothesis that ChatGPT-4o provides above-average quality responses to frequently asked questions about hip arthroscopy, as evidenced by the high scores in relevance, accuracy, clarity, and completeness. However, it is still advisable to consult orthopedic specialists on the subject, incorporating ChatGPT's suggestions during the final decision-making process.

Citing Articles

Can ChatGPT pass the Turkish Orthopedics and Traumatology Board Examination? Turkish orthopedic surgeons versus artificial intelligence.

Pamuk C, Uyanik A, Kuyucu E, Ugurlar M Ulus Travma Acil Cerrahi Derg. 2025; 31(3):310-315.

PMID: 40052322 PMC: 11894241. DOI: 10.14744/tjtes.2025.07724.

References
1.
AlShehri Y, McConkey M, Lodhia P . ChatGPT Provides Satisfactory but Occasionally Inaccurate Answers to Common Patient Hip Arthroscopy Questions. Arthroscopy. 2024; . DOI: 10.1016/j.arthro.2024.06.017. View

2.
Clarke M, Arora A, Villar R . Hip arthroscopy: complications in 1054 cases. Clin Orthop Relat Res. 2003; (406):84-8. DOI: 10.1097/01.blo.0000043048.84315.af. View

3.
Johns W, Martinazzi B, Miltenberg B, Nam H, Hammoud S . ChatGPT Provides Unsatisfactory Responses to Frequently Asked Questions Regarding Anterior Cruciate Ligament Reconstruction. Arthroscopy. 2024; 40(7):2067-2079.e1. DOI: 10.1016/j.arthro.2024.01.017. View

4.
Divecha H, Rajpura A, Board T . Hip arthroscopy: a focus on the future. Hip Int. 2015; 25(4):323-9. DOI: 10.5301/hipint.5000271. View

5.
Yapar D, Demir Avci Y, Tokur Sonuvar E, Egerci O, Yapar A . ChatGPT's potential to support home care for patients in the early period after orthopedic interventions and enhance public health. Jt Dis Relat Surg. 2023; 35(1):169-176. PMC: 10746912. DOI: 10.52312/jdrs.2023.1402. View