Research Article

COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS

Volume: 25 Number: 3 December 26, 2023
TR EN

COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS

Abstract

Objective: Being publicly available, easy to use, and continuously evolving, next-generation chatbots have the potential to be used in triage, one of the most critical functions of an Emergency Department. The aim of this study was to assess the performance of Generative Pre-trained Transformer 4 (GPT-4), Bard and Claude during decision-making for Emergency Department triage. Material and Methods: This was a preliminary cross-sectional study conducted with 50 case scenarios. Emergency Medicine specialists determined the reference Emergency Severity Index triage category of each scenario. Subsequently, each case scenario was queried using three chatbots. Inconsistent classifications between the chatbots and references were defined as over-triage (false positive) or under-triage (false negative). The primary and secondary outcomes were the predictive performance of chatbots and the difference between them in predicting high acuity triage. Results: F1 Scores for GPT-4, Bard, and Claude for predicting Emergency Severity Index 1 and 2 were 0.899, 0.791, and 0.865 respectively. The ROC Curve of GPT-4 for high acuity predictions showed an area under the curve (AUC) of 0.911 (95% CI: 0,814-1; p<0.001), while Bard showed an AUC of 0.819 (95% CI: 0.692-0.945; p<0.001) and for Claude this was 0.881 (95% CI:0.768-0.994; p<0.001). Conclusion: GPT-4, in its current form, was able to detect high acuity Emergency Severity Index scores in our case set and had close agreement with Emergency Medicine specialists, followed by Claude, while Bard's agreement was relatively lower. GPT-4 and Claude provided better results than Bard in case management recommendations. We believe that studies evaluating the effectiveness and limitations of chatbots in triage are important because of their future potential.

Keywords

Ethical Statement

Institutional review board approval was obtained for this study on 06.04.2023 (Kocaeli University Non-Interventional Clinical Research Ethics Committee - GOKAEK-2023/07.10).

Thanks

The authors would like to thank Prof. Elif Yaka for her valuable insights.

References

  1. Lee P, Bubeck S, Petro J. Benefits, limits, and risks of GPT-4 as an AI chatbot for medicine. New England Journal of Medicine. 2023;388(13):1233-9.
  2. OpenAI. GPT-4 technical report. ArXiv. Accessed date: September 29, 2023: https://arxiv.org/abs/2303.08774.
  3. Katz DM, Bommarito MJ, Gao S, Arredondo P. GPT-4 passes the bar exam. SSRN Electronic Journal. Published online 2023.
  4. Google. Bard FAQ. Accessed date: April 21, 2023: https://bard.google.com/faq?hl=en
  5. Anthropic. Introducing Claude. Accessed date: April 21, 2023:https://www.anthropic.com/index/introducing- claude
  6. Kuriyama A, Urushidani S, Nakayama T. Five-level emergency triage systems: Variation in assessment of validity. Emergency Medicine Journal. 2017;34(11):703-10.
  7. McHugh M, Tanabe P, McClelland M, Khare RK. More patients are triaged using the emergency severity index than any other triage acuity system in the United States. Academic Emergency Medicine. 2012;19(1):106-9.
  8. Gilboy N, Tanabe P, Travers D, Rosenau A, Eitel D. Emergency Severity Index, Version 4: Implementation Handbook. 2005. Accessed date: September 29, 2023: https://www.sgnor.ch/fileadmin/user_upload/Doku mente/Downloads/Esi_Handbook.pdf.

Details

Primary Language

English

Subjects

Health Services and Systems (Other)

Journal Section

Research Article

Publication Date

December 26, 2023

Submission Date

October 1, 2023

Acceptance Date

October 12, 2023

Published in Issue

Year 2023 Volume: 25 Number: 3

APA
Sarbay, İ., Bozdereli Berikol, G., Özturan, İ. U., & Grimes, K. (2023). COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS. The Journal of Kırıkkale University Faculty of Medicine, 25(3), 482-521. https://doi.org/10.24938/kutfd.1369468
AMA
1.Sarbay İ, Bozdereli Berikol G, Özturan İU, Grimes K. COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS. Kırıkkale Uni Med J. 2023;25(3):482-521. doi:10.24938/kutfd.1369468
Chicago
Sarbay, İbrahim, Göksu Bozdereli Berikol, İbrahim Ulaş Özturan, and Keith Grimes. 2023. “COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS”. The Journal of Kırıkkale University Faculty of Medicine 25 (3): 482-521. https://doi.org/10.24938/kutfd.1369468.
EndNote
Sarbay İ, Bozdereli Berikol G, Özturan İU, Grimes K (December 1, 2023) COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS. The Journal of Kırıkkale University Faculty of Medicine 25 3 482–521.
IEEE
[1]İ. Sarbay, G. Bozdereli Berikol, İ. U. Özturan, and K. Grimes, “COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS”, Kırıkkale Uni Med J, vol. 25, no. 3, pp. 482–521, Dec. 2023, doi: 10.24938/kutfd.1369468.
ISNAD
Sarbay, İbrahim - Bozdereli Berikol, Göksu - Özturan, İbrahim Ulaş - Grimes, Keith. “COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS”. The Journal of Kırıkkale University Faculty of Medicine 25/3 (December 1, 2023): 482-521. https://doi.org/10.24938/kutfd.1369468.
JAMA
1.Sarbay İ, Bozdereli Berikol G, Özturan İU, Grimes K. COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS. Kırıkkale Uni Med J. 2023;25:482–521.
MLA
Sarbay, İbrahim, et al. “COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS”. The Journal of Kırıkkale University Faculty of Medicine, vol. 25, no. 3, Dec. 2023, pp. 482-21, doi:10.24938/kutfd.1369468.
Vancouver
1.İbrahim Sarbay, Göksu Bozdereli Berikol, İbrahim Ulaş Özturan, Keith Grimes. COMPARISON OF PERFORMANCES OF OPEN ACCESS NATURAL LANGUAGE PROCESSING BASED CHATBOT APPLICATIONS IN TRIAGE DECISIONS. Kırıkkale Uni Med J. 2023 Dec. 1;25(3):482-521. doi:10.24938/kutfd.1369468

Cited By

This Journal is a Publication of Kırıkkale University Faculty of Medicine.