Araştırma Makalesi

Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis

Cilt: 13 Sayı: 4 15 Ekim 2024
PDF İndir
TR EN

Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis

Öz

Large Language Models (LLMs) have gained popularity across healthcare and attracted the attention of researchers of various medical specialties. Determining which model performs well in which circumstances is essential for accurate results. This study aims to compare the accuracy of recently developed LLMs for medical imaging systems and to evaluate the reliability of LLMs in terms of correct responses. A total of 400 questions were divided into four categories: X-ray, ultrasound, magnetic resonance imaging, and nuclear medicine. LLMs’ responses were evaluated with a zero-prompting approach by measuring the percentage of correct answers. McNemar tests were used to evaluate the significance of differences between models, and Cohen kappa statistics were used to determine the reliability of the models. Gemini Advanced, GPT-4, Copilot, and GPT-3.5 resulted in accuracy rates of 86.25%, 84.25%, 77.5%, and 59.75%, respectively. There was a strong correlation between Gemini Advanced and the GPT-4 compared with other models, К=0.762. This study is the first that analyzes the accuracy of responses of recently developed LLMs: Gemini Advanced, GPT-4, Copilot, and GPT-3.5 on questions related to medical imaging systems. And a comprehensive dataset with three question types was created within medical imaging systems, which was evenly distributed from various sources.

Anahtar Kelimeler

Kaynakça

  1. S. R. Bowman, Eight things to know about large language models, arXiv preprint arXiv:2304.00612, 2023. https://doi.org/10.48550/arXiv.2304.01964
  2. ChatGPT. https://chat.openai.com/ Accessed 27 Feb. 2024.
  3. GPT-4. https://openai.com/research/gpt-4, Accessed 27 Feb. 2024.
  4. Bing Chat: how to use Microsoft’s own version of ChatGPT Digital Trends. https://www.digitaltrends .com/computing/how-to-use-microsoft-chatgpt-bing-edge/, Accessed 27 Feb. 2024.
  5. Gemini - Google DeepMind. https://deepmind.google /technologies/gemini/#gemini-1.0, Accessed 28 Feb. 2024.
  6. A. J. Thirunavukarasu, D. S. J. Ting, K. Elangovan, L. Gutierrez, T. F. Tan, and D. S. W. Ting, Large language models in medicine, Nature medicine, vol. 29, no. 8, pp. 1930–1940, 2023. https://doi.org/10.1038/s41591-023-02448-8
  7. A. Rao, J. Kim, M. Kamineni, M. Pang, W. Lie, K. J. Dreyer, M. D. Succi, Evaluating GPT as an adjunct for radiologic decision making: GPT-4 versus GPT-3.5 in a breast ımaging pilot, Journal of the American College of Radiology, vol. 20, no. 10, pp. 990–997, 2023. https://doi.org/10.1016/j.jacr.2023.05. 003
  8. H. Nori, N. King, S. M. McKinney, D. Carignan, and E. Horvitz, Capabilities of gpt-4 on medical challenge problems, arXiv preprint arXiv:2303.13375, 2023. https://doi.org/10.48550/arXiv.2303.13375

Ayrıntılar

Birincil Dil

İngilizce

Konular

Doğal Dil İşleme, Planlama ve Karar Verme, Biyomedikal Bilimler ve Teknolojiler

Bölüm

Araştırma Makalesi

Erken Görünüm Tarihi

11 Eylül 2024

Yayımlanma Tarihi

15 Ekim 2024

Gönderilme Tarihi

29 Mayıs 2024

Kabul Tarihi

30 Temmuz 2024

Yayımlandığı Sayı

Yıl 2024 Cilt: 13 Sayı: 4

Kaynak Göster

APA
Koç, A., & Öztiryaki, A. B. (2024). Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis. Niğde Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi, 13(4), 1216-1223. https://doi.org/10.28948/ngumuh.1492129
AMA
1.Koç A, Öztiryaki AB. Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis. NÖHÜ Müh. Bilim. Derg. 2024;13(4):1216-1223. doi:10.28948/ngumuh.1492129
Chicago
Koç, Alpaslan, ve Ayşe Betül Öztiryaki. 2024. “Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis”. Niğde Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi 13 (4): 1216-23. https://doi.org/10.28948/ngumuh.1492129.
EndNote
Koç A, Öztiryaki AB (01 Ekim 2024) Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis. Niğde Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi 13 4 1216–1223.
IEEE
[1]A. Koç ve A. B. Öztiryaki, “Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis”, NÖHÜ Müh. Bilim. Derg., c. 13, sy 4, ss. 1216–1223, Eki. 2024, doi: 10.28948/ngumuh.1492129.
ISNAD
Koç, Alpaslan - Öztiryaki, Ayşe Betül. “Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis”. Niğde Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi 13/4 (01 Ekim 2024): 1216-1223. https://doi.org/10.28948/ngumuh.1492129.
JAMA
1.Koç A, Öztiryaki AB. Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis. NÖHÜ Müh. Bilim. Derg. 2024;13:1216–1223.
MLA
Koç, Alpaslan, ve Ayşe Betül Öztiryaki. “Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis”. Niğde Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi, c. 13, sy 4, Ekim 2024, ss. 1216-23, doi:10.28948/ngumuh.1492129.
Vancouver
1.Alpaslan Koç, Ayşe Betül Öztiryaki. Comparison of the accuracy performances of the Gemini Advanced, the GPT-4, the Copilot, and the GPT-3.5 models in medical imaging systems: A Zero-shot prompting analysis. NÖHÜ Müh. Bilim. Derg. 01 Ekim 2024;13(4):1216-23. doi:10.28948/ngumuh.1492129

Cited By