Evaluating AI Chatbots for Pediatric Contact Lenses: A Study on Accuracy, Readability, and Reliability
Öz
This study evaluated the accuracy, readability, and comprehensiveness of patient-facing responses generated by LLM-based chatbot platforms to pediatric contact lens (CL)–related questions, using expert grading and readability benchmarking. Five platforms (ChatGPT-4o, Gemini 1.5, Perplexity, Copilot, and Claude 3.5 Sonnet) were assessed using 28 curated questions. Two pediatric ophthalmologists graded anonymized outputs using DISCERN and PEMAT-P, 5-point Likert scales for accuracy and comprehensiveness, and multiple automated readability indices. Expert-written responses were included only for readability benchmarking. ChatGPT-4o produced the longest responses (p<0.0001). Accuracy and comprehensiveness differed across platforms (p=0.0216 and p=0.0067), with ChatGPT-4o scoring higher than Perplexity in post-hoc comparisons (p=0.0173 and p=0.0087). Expert responses were shorter but showed higher complexity on readability indices. Accuracy-based reproducibility was high for general pediatric CL queries but lower for aphakic CL–related questions (p=0.041), and factual inaccuracies were more frequent in aphakic topics. While LLMs may support patient education, variability in correctness and completeness underscores the need for expert oversight; these tools should complement, not replace, clinical expertise in pediatric CL usage.
Anahtar Kelimeler
Destekleyen Kurum
Etik Beyan
Teşekkür
Kaynakça
- 1. Korngiebel DM, Mooney SD. Considering the possibilities and pitfalls of Generative Pre-trained Transformer 3 (GPT-3) in healthcare delivery. NPJ Digit Med. Jun 3 2021;4(1):93. doi:10.1038/s41746-021-00464-x
- 2. Wang L, Wan Z, Ni C, et al. A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare. medRxiv. Apr 27 2024;doi:10.1101/2024.04.26.24306390
- 3. Alowais SA, Alghamdi SS, Alsuhebany N, et al. Revolutionizing healthcare: the role of artificial intelligence in clinical practice. BMC Medical Education. 2023/09/22 2023;23(1):689. doi:10.1186/s12909-023-04698-z
- 4. Sengor T, Gencaga Atakan T. Management of Contact Lenses and Visual Development in Pediatric Aphakia. Turk J Ophthalmol. Apr 19 2024;54(2):90-102. doi:10.4274/tjo.galenos.2023.56252
- 5. Tomiyama ES, Kobia-Acquah E, Ansari SM, et al. Scoping review: Reporting characteristics for the safety of contact lenses in the pediatric population. Optom Vis Sci. Jul 16 2024;doi:10.1097/OPX.0000000000002156
- 6. Bullimore MA, Richdale K. Incidence of Corneal Adverse Events in Children Wearing Soft Contact Lenses. Eye Contact Lens. May 1 2023;49(5):204-211. doi:10.1097/ICL.0000000000000976
- 7. Ezinne NE, Bhattarai D, Ekemiri KK, et al. Demographic profiles of contact lens wearers and their association with lens wear characteristics in Trinidad and Tobago: A retrospective study. PLoS One. 2022;17(7):e0264659. doi:10.1371/journal.pone.0264659
- 8. Bullimore MA. The Safety of Soft Contact Lenses in Children. Optom Vis Sci. Jun 2017;94(6):638-646. doi:10.1097/OPX.0000000000001078
Ayrıntılar
Birincil Dil
İngilizce
Konular
Göz Hastalıkları
Bölüm
Araştırma Makalesi
Yazarlar
Meral Yıldız
0000-0002-8503-5637
Türkiye
Sevde İşleker
0000-0002-7352-7044
Türkiye
Esin Söğütlü Sarı
0000-0003-3729-6178
Türkiye
Ahmet Özmen
0000-0002-1261-5120
Türkiye
Mehmet Baykara
0000-0002-5555-1649
Türkiye
Yayımlanma Tarihi
16 Mart 2026
Gönderilme Tarihi
23 Eylül 2025
Kabul Tarihi
26 Şubat 2026
Yayımlandığı Sayı
Yıl 2026 Cilt: 52
