دورية أكاديمية

Evaluating the accuracy and relevance of ChatGPT responses to frequently asked questions regarding total knee replacement

التفاصيل البيبلوغرافية
العنوان: Evaluating the accuracy and relevance of ChatGPT responses to frequently asked questions regarding total knee replacement
المؤلفون: Siyuan Zhang, Zi Qiang Glen Liau, Kian Loong Melvin Tan, Wei Liang Chua
المصدر: Knee Surgery & Related Research, Vol 36, Iss 1, Pp 1-8 (2024)
بيانات النشر: BMC, 2024.
سنة النشر: 2024
المجموعة: LCC:Orthopedic surgery
مصطلحات موضوعية: ChatGPT, Artificial intelligence, Chatbot, Large language model, Total knee replacement, Total knee arthroplasty, Orthopedic surgery, RD701-811
الوصف: Abstract Background Chat Generative Pretrained Transformer (ChatGPT), a generative artificial intelligence chatbot, may have broad applications in healthcare delivery and patient education due to its ability to provide human-like responses to a wide range of patient queries. However, there is limited evidence regarding its ability to provide reliable and useful information on orthopaedic procedures. This study seeks to evaluate the accuracy and relevance of responses provided by ChatGPT to frequently asked questions (FAQs) regarding total knee replacement (TKR). Methods A list of 50 clinically-relevant FAQs regarding TKR was collated. Each question was individually entered as a prompt to ChatGPT (version 3.5), and the first response generated was recorded. Responses were then reviewed by two independent orthopaedic surgeons and graded on a Likert scale for their factual accuracy and relevance. These responses were then classified into accurate versus inaccurate and relevant versus irrelevant responses using preset thresholds on the Likert scale. Results Most responses were accurate, while all responses were relevant. Of the 50 FAQs, 44/50 (88%) of ChatGPT responses were classified as accurate, achieving a mean Likert grade of 4.6/5 for factual accuracy. On the other hand, 50/50 (100%) of responses were classified as relevant, achieving a mean Likert grade of 4.9/5 for relevance. Conclusion ChatGPT performed well in providing accurate and relevant responses to FAQs regarding TKR, demonstrating great potential as a tool for patient education. However, it is not infallible and can occasionally provide inaccurate medical information. Patients and clinicians intending to utilize this technology should be mindful of its limitations and ensure adequate supervision and verification of information provided.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2234-2451
Relation: https://doaj.org/toc/2234-2451
DOI: 10.1186/s43019-024-00218-5
URL الوصول: https://doaj.org/article/c8e83cea3662483da140e91cec7a5aa2
رقم الأكسشن: edsdoj.8e83cea3662483da140e91cec7a5aa2
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:22342451
DOI:10.1186/s43019-024-00218-5