دورية أكاديمية

Evaluation of ChatGPT as a Counselling Tool for Italian-Speaking MASLD Patients: Assessment of Accuracy, Completeness and Comprehensibility

التفاصيل البيبلوغرافية
العنوان: Evaluation of ChatGPT as a Counselling Tool for Italian-Speaking MASLD Patients: Assessment of Accuracy, Completeness and Comprehensibility
المؤلفون: Nicola Pugliese, Davide Polverini, Rosa Lombardi, Grazia Pennisi, Federico Ravaioli, Angelo Armandi, Elena Buzzetti, Andrea Dalbeni, Antonio Liguori, Alessandro Mantovani, Rosanna Villani, Ivan Gardini, Cesare Hassan, Luca Valenti, Luca Miele, Salvatore Petta, Giada Sebastiani, Alessio Aghemo, NAFLD Expert Chatbot Working Group
المصدر: Journal of Personalized Medicine, Vol 14, Iss 6, p 568 (2024)
بيانات النشر: MDPI AG, 2024.
سنة النشر: 2024
المجموعة: LCC:Medicine
مصطلحات موضوعية: MASLD, artificial intelligence, counselling, diet, physical activity, steatosis, Medicine
الوصف: Background: Artificial intelligence (AI)-based chatbots have shown promise in providing counseling to patients with metabolic dysfunction-associated steatotic liver disease (MASLD). While ChatGPT3.5 has demonstrated the ability to comprehensively answer MASLD-related questions in English, its accuracy remains suboptimal. Whether language influences these results is unclear. This study aims to assess ChatGPT’s performance as a counseling tool for Italian MASLD patients. Methods: Thirteen Italian experts rated the accuracy, completeness and comprehensibility of ChatGPT3.5 in answering 15 MASLD-related questions in Italian using a six-point accuracy, three-point completeness and three-point comprehensibility Likert’s scale. Results: Mean scores for accuracy, completeness and comprehensibility were 4.57 ± 0.42, 2.14 ± 0.31 and 2.91 ± 0.07, respectively. The physical activity domain achieved the highest mean scores for accuracy and completeness, whereas the specialist referral domain achieved the lowest. Overall, Fleiss’s coefficient of concordance for accuracy, completeness and comprehensibility across all 15 questions was 0.016, 0.075 and −0.010, respectively. Age and academic role of the evaluators did not influence the scores. The results were not significantly different from our previous study focusing on English. Conclusion: Language does not appear to affect ChatGPT’s ability to provide comprehensible and complete counseling to MASLD patients, but accuracy remains suboptimal in certain domains.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2075-4426
Relation: https://www.mdpi.com/2075-4426/14/6/568; https://doaj.org/toc/2075-4426
DOI: 10.3390/jpm14060568
URL الوصول: https://doaj.org/article/6649a0b3d71240c393e06f624952f4ae
رقم الأكسشن: edsdoj.6649a0b3d71240c393e06f624952f4ae
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:20754426
DOI:10.3390/jpm14060568