Evaluation of ChatGPT-4o's Answers to Questions About Hip Arthroscopy from the Patient Perspective

Overview

Journal Jt Dis Relat Surg

Date 2024 Dec 25

PMID 39719917

Authors

Gokhan Ayik

Niyazi Ercan

Yunus Demirtas

Tugrul Yildirim

Gokhan Cakmak

Affiliations

Soon will be listed here.

Abstract

Objectives: This study aimed to evaluate the responses provided by ChatGPT-4o to the most frequently asked questions by patients regarding hip arthroscopy.

Materials And Methods: In this cross-sectional survey study, a new Google account without a search history was created to determine the 20 most frequently asked questions about hip arthroscopy via Google. These questions were asked to a new ChatGPT-4o account on June 1, 2024, and the responses were recorded. Ten orthopedic surgeons specializing in sports surgery rated the responses using a rating scale to assess relevance, accuracy, clarity, and completeness. The responses were scored on a scale from 1 to 5, with 1 being the worst and 5 being the best. The interrater reliability assessed via the intraclass correlation coefficient (ICC).

Results: The lowest score given by the surgeons for any response was 4/5 in each subcategory. The highest mean scores were in accuracy and clarity, followed by relevance, with completeness receiving the lowest scores. The overall mean score was 4.49±0.16. Interrater reliability showed insufficient overall agreement (ICC=0.004, p=0.383), with the highest agreement in clarity (ICC=0.039, p=0.131) and the lowest in accuracy (ICC=-0.019, p=0.688).

Conclusion: The study confirms our hypothesis that ChatGPT-4o provides above-average quality responses to frequently asked questions about hip arthroscopy, as evidenced by the high scores in relevance, accuracy, clarity, and completeness. However, it is still advisable to consult orthopedic specialists on the subject, incorporating ChatGPT's suggestions during the final decision-making process.

Citing Articles

Can ChatGPT pass the Turkish Orthopedics and Traumatology Board Examination? Turkish orthopedic surgeons versus artificial intelligence.

Pamuk C, Uyanik A, Kuyucu E, Ugurlar M Ulus Travma Acil Cerrahi Derg. 2025; 31(3):310-315.

PMID: 40052322 PMC: 11894241. DOI: 10.14744/tjtes.2025.07724.

References

AlShehri Y, McConkey M, Lodhia P . ChatGPT Provides Satisfactory but Occasionally Inaccurate Answers to Common Patient Hip Arthroscopy Questions. Arthroscopy. 2024; . DOI: 10.1016/j.arthro.2024.06.017. View

Clarke M, Arora A, Villar R . Hip arthroscopy: complications in 1054 cases. Clin Orthop Relat Res. 2003; (406):84-8. DOI: 10.1097/01.blo.0000043048.84315.af. View

Johns W, Martinazzi B, Miltenberg B, Nam H, Hammoud S . ChatGPT Provides Unsatisfactory Responses to Frequently Asked Questions Regarding Anterior Cruciate Ligament Reconstruction. Arthroscopy. 2024; 40(7):2067-2079.e1. DOI: 10.1016/j.arthro.2024.01.017. View

Divecha H, Rajpura A, Board T . Hip arthroscopy: a focus on the future. Hip Int. 2015; 25(4):323-9. DOI: 10.5301/hipint.5000271. View

Yapar D, Demir Avci Y, Tokur Sonuvar E, Egerci O, Yapar A . ChatGPT's potential to support home care for patients in the early period after orthopedic interventions and enhance public health. Jt Dis Relat Surg. 2023; 35(1):169-176. PMC: 10746912. DOI: 10.52312/jdrs.2023.1402. View

Alkaissi H, McFarlane S . Artificial Hallucinations in ChatGPT: Implications in Scientific Writing. Cureus. 2023; 15(2):e35179. PMC: 9939079. DOI: 10.7759/cureus.35179. View

Sparks C, Fasulo S, Windsor J, Bankauskas V, Contrada E, Kraeutler M . ChatGPT Is Moderately Accurate in Providing a General Overview of Orthopaedic Conditions. JB JS Open Access. 2024; 9(2). PMC: 11191019. DOI: 10.2106/JBJS.OA.23.00129. View

Gilson A, Safranek C, Huang T, Socrates V, Chi L, Taylor R . How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment. JMIR Med Educ. 2023; 9:e45312. PMC: 9947764. DOI: 10.2196/45312. View

Van Riel N, Auwerx K, Debbaut P, Van Hees S, Schoenmakers B . The effect of Dr Google on doctor-patient encounters in primary care: a quantitative, observational, cross-sectional study. BJGP Open. 2018; 1(2):bjgpopen17X100833. PMC: 6169945. DOI: 10.3399/bjgpopen17X100833. View

10.

Kunze K, Orr M, Krebs V, Bhandari M, Piuzzi N . Potential benefits, unintended consequences, and future roles of artificial intelligence in orthopaedic surgery research : a call to emphasize data quality and indications. Bone Jt Open. 2022; 3(1):93-97. PMC: 9047073. DOI: 10.1302/2633-1462.31.BJO-2021-0123.R1. View

11.

Atik O . Artificial intelligence: Who must have autonomy the machine or the human?. Jt Dis Relat Surg. 2023; 35(1):1-2. PMC: 10746914. DOI: 10.52312/jdrs.2023.57918. View

12.

Ciceklidag M, Ayanoglu T, Kaptan A, Vural A, Kalaycioglu O, Ozer M . Effect of the presence of cysts in the hip joint on hip arthroscopy. Jt Dis Relat Surg. 2024; 35(3):645-653. PMC: 11411879. DOI: 10.52312/jdrs.2024.1657. View

13.

Magruder M, Rodriguez A, Wong J, Erez O, Piuzzi N, Scuderi G . Assessing Ability for ChatGPT to Answer Total Knee Arthroplasty-Related Questions. J Arthroplasty. 2024; 39(8):2022-2027. DOI: 10.1016/j.arth.2024.02.023. View

14.

Hurley E, Crook B, Lorentz S, Danilkowicz R, Lau B, Taylor D . Evaluation High-Quality of Information from ChatGPT (Artificial Intelligence-Large Language Model) Artificial Intelligence on Shoulder Stabilization Surgery. Arthroscopy. 2023; 40(3):726-731.e6. DOI: 10.1016/j.arthro.2023.07.048. View

15.

Harris J, McCormick F, Abrams G, Gupta A, Ellis T, Bach Jr B . Complications and reoperations during and after hip arthroscopy: a systematic review of 92 studies and more than 6,000 patients. Arthroscopy. 2013; 29(3):589-95. DOI: 10.1016/j.arthro.2012.11.003. View

16.

Ganz R, Parvizi J, Beck M, Leunig M, Notzli H, Siebenrock K . Femoroacetabular impingement: a cause for osteoarthritis of the hip. Clin Orthop Relat Res. 2003; (417):112-20. DOI: 10.1097/01.blo.0000096804.78689.c2. View

17.

Cocco A, Zordan R, Taylor D, Weiland T, Dilley S, Kant J . Dr Google in the ED: searching for online health information by adult emergency department patients. Med J Aust. 2018; 209(8):342-347. DOI: 10.5694/mja17.00889. View

18.

Jamil M, Dandachli W, Noordin S, Witt J . Hip arthroscopy: Indications, outcomes and complications. Int J Surg. 2017; 54(Pt B):341-344. DOI: 10.1016/j.ijsu.2017.08.557. View

19.

Ozbek E, Ertan M, Kindan P, Karaca M, Gursoy S, Chahla J . ChatGPT Can Offer At Least Satisfactory Responses to Common Patient Questions Regarding Hip Arthroscopy. Arthroscopy. 2024; . DOI: 10.1016/j.arthro.2024.08.036. View

20.

Densen P . Challenges and opportunities facing medical education. Trans Am Clin Climatol Assoc. 2011; 122:48-58. PMC: 3116346. View