Research Article

A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.o in Heart Failure

Volume: 50 Number: 3 January 12, 2025
EN TR

A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.o in Heart Failure

Abstract

Digitalization have increasingly penetrated in healthcare. Generative artificial intelligence (AI) is a type of AI technology that can generate new content. Patients can use AI-powered chatbots to get medical information. Heart failure is a syndrome with high morbidity and mortality. Patients search about heart failure in many web sites commonly. This study aimed to assess Large Language Models (LLMs) -ChatGPT 3.5, GPT-4 and GPT-4.o- in terms of their accuracy in answering the questions about heart failure (HF). Thirteen questions regarding to the definition, causes, signs and symptoms, complications, treatment and lifestyle recommendations of the HF were evaluated. These questions to assess the knowledge and awareness of medical students about heart failure were taken from a previous study in literature. Of the students who participated in this study, 158 (58.7%) were first-year students, while 111 (41.3%) were sixth-year students and were taking their cardiology internship in their fourth year. The questions were entered in Turkish language and 2 cardiologists with over ten years of experience evaluated the responses generated by different models including GPT-3.5, GPT-4 and GPT-4.o. ChatGPT-3.5 yielded “correct” responses to 8/13 (61.5%) of the questions whereas, GPT-4 yielded “correct” responses to 11/13 (84.6%) of the questions. All of the responses of GPT-4.o were accurate and complete. Performance of medical students did not include 100% correct answers for any question. This study revealed that performance of GPT-4.o was superior to GPT-3.5, but similar with GPT-4

Keywords

Ethical Statement

Bursa Uludağ Üniversitesi Tıp Fakültesi Dergisi’ne gönderdiğimiz “ A Comparative Analysis of GPT-3.5, GPT-4, GPT-4.o and Human Performance in Heart Failure” başlıklı makale, yapay zeka modellerine sorular sorularak yürütülmüştür. İnsan katılımcı yoktur. Literatürde yer alan benzer çalışmalarda olduğu gibi bu araştırmada da etik kurul onayı gerekmemektedir.

References

  1. 1-Braunwald E., Heart Failure, Journal of the American Collegeof Cardiology: Heart Failure, (2013). 1(1): 1-20.
  2. 2-Wagner S & Cohn K. Heart failure. A proposed definition andclassification. Arch Intern Med. 1977; 137: 675-678.
  3. 3-Biykem B. et al. Universal Definition and Classification ofHeart Failure, Journal of Cardiac Failure, (2021) 27 (4), 387-413.
  4. 4-Khan, M.S., Shahid, I., Bennis, A. et al. Global epidemiologyof heart failure. Nat Rev Cardiol (2024). https://doi.org/10.1038/s41569-024-01046-6
  5. 5-GBD 2017 Disease and Injury Incidence and PrevalenceCollaborators. Global, regional, and national incidence,prevalence, and years lived with disability for 354 diseases andinjuries for 195 countries and territories, 1990-2017: asystematic analysis for the Global Burden of Disease Study2017. Lancet 2018; 392: 1789– 1858.
  6. 6-Lloyd-Jones DM, Larson MG, Leip EP, et al. Lifetime risk fordeveloping congestive heart failure: the Framingham HeartStudy. Circulation. 2002;106(24):3068-3072.
  7. 7-Johansson S, Wallander M.A., Ruigomez A., Garcia RodriguezL.A. Incidence of newly diagnosed heart failure in UK generalpractice. Eur J Heart Fail. 2001; 3 (2): 225–231.
  8. 8-ITU releases 2015 ICT figures. Statistics confirm ICTrevolution of the past 15years. http://www.itu.int/net/pressoffice/press_releases/2015/17.aspx#.

Details

Primary Language

English

Subjects

Cardiovascular Medicine and Haematology (Other)

Journal Section

Research Article

Publication Date

January 12, 2025

Submission Date

September 4, 2024

Acceptance Date

November 18, 2024

Published in Issue

Year 2024 Volume: 50 Number: 3

APA
Günay-polatkan, Ş., & Sığırlı, D. (2025). A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.o in Heart Failure. Journal of Uludağ University Medical Faculty, 50(3), 443-447. https://doi.org/10.32708/uutfd.1543370
AMA
1.Günay-polatkan Ş, Sığırlı D. A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.o in Heart Failure. Journal of Uludağ University Medical Faculty. 2025;50(3):443-447. doi:10.32708/uutfd.1543370
Chicago
Günay-polatkan, Şeyda, and Deniz Sığırlı. 2025. “A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.O in Heart Failure”. Journal of Uludağ University Medical Faculty 50 (3): 443-47. https://doi.org/10.32708/uutfd.1543370.
EndNote
Günay-polatkan Ş, Sığırlı D (January 1, 2025) A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.o in Heart Failure. Journal of Uludağ University Medical Faculty 50 3 443–447.
IEEE
[1]Ş. Günay-polatkan and D. Sığırlı, “A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.o in Heart Failure”, Journal of Uludağ University Medical Faculty, vol. 50, no. 3, pp. 443–447, Jan. 2025, doi: 10.32708/uutfd.1543370.
ISNAD
Günay-polatkan, Şeyda - Sığırlı, Deniz. “A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.O in Heart Failure”. Journal of Uludağ University Medical Faculty 50/3 (January 1, 2025): 443-447. https://doi.org/10.32708/uutfd.1543370.
JAMA
1.Günay-polatkan Ş, Sığırlı D. A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.o in Heart Failure. Journal of Uludağ University Medical Faculty. 2025;50:443–447.
MLA
Günay-polatkan, Şeyda, and Deniz Sığırlı. “A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.O in Heart Failure”. Journal of Uludağ University Medical Faculty, vol. 50, no. 3, Jan. 2025, pp. 443-7, doi:10.32708/uutfd.1543370.
Vancouver
1.Şeyda Günay-polatkan, Deniz Sığırlı. A Comparative Analysis of GPT-3.5, GPT-4 and GPT-4.o in Heart Failure. Journal of Uludağ University Medical Faculty. 2025 Jan. 1;50(3):443-7. doi:10.32708/uutfd.1543370

Cited By

ISSN: 1300-414X, e-ISSN: 2645-9027

Creative Commons License
Journal of Uludag University Medical Faculty is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
2023