TY - JOUR T1 - Artificial intelligence exercise recommendations in knee osteoarthritis rehabilitation: ChatGPT-4o and Gemini Advanced example TT - Diz osteoartriti rehabilitasyonunda yapay zeka egzersiz önerileri: ChatGPT-4o ve Gemini Advanced örneği AU - Gürses, Ömer Alperen AU - Özüdoğru, Anıl AU - Tuncay, Figen AU - Karartı, Caner PY - 2025 DA - June Y2 - 2025 DO - 10.54005/geneltip.1634118 JF - Genel Tıp Dergisi JO - Genel Tıp Derg PB - Selçuk Üniversitesi WT - DergiPark SN - 2602-3741 SP - 487 EP - 492 VL - 35 IS - 3 LA - en AB - AbstractAim: This study aimed to comparatively evaluate the propensity of the large language models ChatGPT-4o and Gemini Advanced to recommend personalised exercise based on patients' assessment data in knee osteoarthritis rehabilitation.Methods: This observational study included 40 patients diagnosed with knee OA according to the American College of Rheumatology criteria. Demographic data, pain levels, range of motion, muscle strength, functional status, and balance were assessed using standardized clinical tests. ChatGPT-4o and Gemini Advanced generated three-phase rehabilitation programs based on these assessments. Exercise recommendations were analyzed across 12 parameters, and statistical comparisons were conducted using the Mann-Whitney U test and Spearman’s correlation (p KW - Artificial intelligence KW - ChatGPT KW - Gemini KW - large language models KW - physiotherapy KW - knee osteoarthritis KW - knee osteoartrithis KW - rehabilitation program N2 - Amaç: Bu çalışma, büyük dil modelleri ChatGPT-4o ve Gemini Advanced'in diz osteoartriti rehabilitasyonunda hastaların değerlendirme verilerine dayanarak kişiselleştirilmiş egzersiz önerme eğilimini karşılaştırmalı olarak değerlendirmeyi amaçlamıştır.Yöntem: Gözlemsel nitelikteki bu çalışmaya, Amerikan Romatoloji Koleji kriterlerine göre diz osteoartriti tanısı almış 40 hasta dahil edilmiştir. Demografik veriler, ağrı düzeyi, eklem hareket açıklığı, kas kuvveti ve fonksiyonel durum ve denge standart klinik testlerle değerlendirilmiştir. ChatGPT-4o ve Gemini Advanced, bu değerlendirmelere dayanarak üç fazdan oluşan rehabilitasyon programları oluşturmuştur. Egzersiz önerileri 12 parametre üzerinden analiz edilmiş, istatistiksel karşılaştırmalar Mann-Whitney U testi ve Spearman korelasyonu ile yapılmıştır (p CR - 1. Bedi S, Liu Y, Orr-Ewing L, et al. Testing and evaluation of health care applications of large language models: a systematic review. JAMA. 2024. CR - 2. Goldberg CB, Adams L, Blumenthal D, et al. To do no harm—and the most good—with AI in health care. NejmAi. 2024. p. AIp2400036. CR - 3. Kohane IS. Injecting artificial intelligence into medicine. NejmAi. 2024. p. AIe2300197. CR - 4. Rao A, Pang M, Kim J, et al. Assessing the utility of ChatGPT throughout the entire clinical workflow: development and usability study. JMIR. 2023;25:e48659. CR - 5. Stafie CS, Sufaru I-G, Ghiciuc CM, et al. Exploring the intersection of artificial intelligence and clinical healthcare: a multidisciplinary review. Diagnostics. 2023;13:1995. CR - 6. Wachter RM, Brynjolfsson E. Will generative artificial intelligence deliver on its promise in health care? JAMA. 2024;331:65-9. CR - 7. Nazi ZA, Peng W, editors. Large language models in healthcare and medical domain: A review. Informatics; 2024: MDPI. CR - 8. Duran A, Cortuk O, Ok B. Future Perspective of Risk Prediction in Aesthetic Surgery: Is Artificial Intelligence Reliable? Aesthet Surg J. 2024;44:NP839-NP49. CR - 9. Güneş YC, Cesur T, Çamur E. Comparative Analysis of Large Language Models in Simplifying Turkish Ultrasound Reports to Enhance Patient Understanding. EurJTher. 2024;30:714-23. CR - 10. Cao M, Wang Q, Zhang X, et al. Large language models’ performances regarding common patient questions about osteoarthritis: A comparative analysis of ChatGPT-3.5, ChatGPT-4.0, and perplexity. J Sport Health Sci. 2024:101016. CR - 11. Bilika P, Stefanouli V, Strimpakos N, Kapreli EV. Clinical reasoning using ChatGPT: Is it beyond credibility for physiotherapists to use? Physiother Theory Pract. 2024;40:2943-62. CR - 12. Zhang L, Tashiro S, Mukaino M, Yamada S. Use of artificial intelligence large language models as a clinical tool in rehabilitation medicine: a comparative test case. J Rehabil Med. 2023;55. CR - 13. Nazir T, Ahmad U, Mal M, et al. Microsoft Bing vs Google Bard in Neurology: A comparative study of AI-generated patient education material. medRxiv. 2023:2023.08. 25.23294641. CR - 14. Dobson F, Hinman RS, Roos EM, et al. OARSI recommended performance-based tests to assess physical function in people diagnosed with hip or knee osteoarthritis. Osteoarthritis cartilage. 2013;21:1042-52. CR - 15. Fransen M, McConnell S, Harmer AR, et al. Exercise for osteoarthritis of the knee. Cochrane database of systematic reviews. 2015. CR - 16. McAlindon TE, Bannuru RR, Sullivan M, et al. OARSI guidelines for the non-surgical management of knee osteoarthritis. Osteoarthritis cartilage. 2014;22:363-88. CR - 17. Arbel Y, Gimmon Y, Shmueli L. Evaluating the Potential of Large Language Models for Vestibular Rehabilitation Education: A Comparison of ChatGPT, Google Gemini, and Clinicians. medRxiv. 2024:2024.01. 24.24301737. CR - 18. Chen X, You M, Wang L, et al. Evaluating and Enhancing Large Language Models Performance in Domain-specific Medicine: Osteoarthritis Management with DocOA. arXiv preprint arXiv:240112998. 2024. CR - 19. Gomez-Cabello CA, Borna S, Pressman SM, Haider SA, Forte AJ. Large Language Models for Intraoperative Decision Support in Plastic Surgery: A Comparison between ChatGPT-4 and Gemini. Medicina. 2024;60:957. CR - 20. Pirkle S, Yang J, Blumberg TJ. Do ChatGPT and Gemini Provide Appropriate Recommendations for Pediatric Orthopaedic Conditions? J Pediatr Orthop. 2025;45:e66-e71. CR - 21. Lau J. Gemini vs. ChatGPT: What's the difference? [2025]. In: Zapier, editor. 2024. UR - https://doi.org/10.54005/geneltip.1634118 L1 - https://dergipark.org.tr/tr/download/article-file/4583729 ER -