Araştırma Makalesi

COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS

Cilt: 25 Sayı: 3 26 Aralık 2023
PDF İndir
TR EN

COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS

Öz

Objective: Being publicly available, easy to use, and continuously evolving, next-generation chatbots have the potential to be used in triage, one of the most critical functions of an Emergency Department. The aim of this study was to assess the performance of Generative Pre-trained Transformer 4 (GPT-4), Bard and Claude during decision-making for Emergency Department triage. Material and Methods: This was a preliminary cross-sectional study conducted with 50 case scenarios. Emergency Medicine specialists determined the reference Emergency Severity Index triage category of each scenario. Subsequently, each case scenario was queried using three chatbots. Inconsistent classifications between the chatbots and references were defined as over-triage (false positive) or under-triage (false negative). The primary and secondary outcomes were the predictive performance of chatbots and the difference between them in predicting high acuity triage. Results: F1 Scores for GPT-4, Bard, and Claude for predicting Emergency Severity Index 1 and 2 were 0.899, 0.791, and 0.865 respectively. The ROC Curve of GPT-4 for high acuity predictions showed an area under the curve (AUC) of 0.911 (95% CI: 0,814-1; p<0.001), while Bard showed an AUC of 0.819 (95% CI: 0.692-0.945; p<0.001) and for Claude this was 0.881 (95% CI:0.768-0.994; p<0.001). Conclusion: GPT-4, in its current form, was able to detect high acuity Emergency Severity Index scores in our case set and had close agreement with Emergency Medicine specialists, followed by Claude, while Bard's agreement was relatively lower. GPT-4 and Claude provided better results than Bard in case management recommendations. We believe that studies evaluating the effectiveness and limitations of chatbots in triage are important because of their future potential.

Anahtar Kelimeler

Etik Beyan

Institutional review board approval was obtained for this study on 06.04.2023 (Kocaeli University Non-Interventional Clinical Research Ethics Committee - GOKAEK-2023/07.10).

Teşekkür

The authors would like to thank Prof. Elif Yaka for her valuable insights.

Kaynakça

  1. Lee P, Bubeck S, Petro J. Benefits, limits, and risks of GPT-4 as an AI chatbot for medicine. New England Journal of Medicine. 2023;388(13):1233-9.
  2. OpenAI. GPT-4 technical report. ArXiv. Accessed date: September 29, 2023: https://arxiv.org/abs/2303.08774.
  3. Katz DM, Bommarito MJ, Gao S, Arredondo P. GPT-4 passes the bar exam. SSRN Electronic Journal. Published online 2023.
  4. Google. Bard FAQ. Accessed date: April 21, 2023: https://bard.google.com/faq?hl=en
  5. Anthropic. Introducing Claude. Accessed date: April 21, 2023:https://www.anthropic.com/index/introducing- claude
  6. Kuriyama A, Urushidani S, Nakayama T. Five-level emergency triage systems: Variation in assessment of validity. Emergency Medicine Journal. 2017;34(11):703-10.
  7. McHugh M, Tanabe P, McClelland M, Khare RK. More patients are triaged using the emergency severity index than any other triage acuity system in the United States. Academic Emergency Medicine. 2012;19(1):106-9.
  8. Gilboy N, Tanabe P, Travers D, Rosenau A, Eitel D. Emergency Severity Index, Version 4: Implementation Handbook. 2005. Accessed date: September 29, 2023: https://www.sgnor.ch/fileadmin/user_upload/Doku mente/Downloads/Esi_Handbook.pdf.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Sağlık Hizmetleri ve Sistemleri (Diğer)

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

26 Aralık 2023

Gönderilme Tarihi

1 Ekim 2023

Kabul Tarihi

12 Ekim 2023

Yayımlandığı Sayı

Yıl 2023 Cilt: 25 Sayı: 3

Kaynak Göster

APA
Sarbay, İ., Bozdereli Berikol, G., Özturan, İ. U., & Grimes, K. (2023). COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS. The Journal of Kırıkkale University Faculty of Medicine, 25(3), 482-521. https://doi.org/10.24938/kutfd.1369468
AMA
1.Sarbay İ, Bozdereli Berikol G, Özturan İU, Grimes K. COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS. Kırıkkale Üni Tıp Derg. 2023;25(3):482-521. doi:10.24938/kutfd.1369468
Chicago
Sarbay, İbrahim, Göksu Bozdereli Berikol, İbrahim Ulaş Özturan, ve Keith Grimes. 2023. “COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS”. The Journal of Kırıkkale University Faculty of Medicine 25 (3): 482-521. https://doi.org/10.24938/kutfd.1369468.
EndNote
Sarbay İ, Bozdereli Berikol G, Özturan İU, Grimes K (01 Aralık 2023) COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS. The Journal of Kırıkkale University Faculty of Medicine 25 3 482–521.
IEEE
[1]İ. Sarbay, G. Bozdereli Berikol, İ. U. Özturan, ve K. Grimes, “COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS”, Kırıkkale Üni Tıp Derg, c. 25, sy 3, ss. 482–521, Ara. 2023, doi: 10.24938/kutfd.1369468.
ISNAD
Sarbay, İbrahim - Bozdereli Berikol, Göksu - Özturan, İbrahim Ulaş - Grimes, Keith. “COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS”. The Journal of Kırıkkale University Faculty of Medicine 25/3 (01 Aralık 2023): 482-521. https://doi.org/10.24938/kutfd.1369468.
JAMA
1.Sarbay İ, Bozdereli Berikol G, Özturan İU, Grimes K. COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS. Kırıkkale Üni Tıp Derg. 2023;25:482–521.
MLA
Sarbay, İbrahim, vd. “COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS”. The Journal of Kırıkkale University Faculty of Medicine, c. 25, sy 3, Aralık 2023, ss. 482-21, doi:10.24938/kutfd.1369468.
Vancouver
1.İbrahim Sarbay, Göksu Bozdereli Berikol, İbrahim Ulaş Özturan, Keith Grimes. COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS. Kırıkkale Üni Tıp Derg. 01 Aralık 2023;25(3):482-521. doi:10.24938/kutfd.1369468

Cited By

Bu Dergi, Kırıkkale Üniversitesi Tıp Fakültesi Yayınıdır.