Theoretical Article

The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis

Volume: 10 Number: 1 June 1, 2025
EN TR

The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis

Abstract

Diabetes is a chronic metabolic disease in which the level of glucose in the blood rises above normal. The main reason for this is that the pancreas cannot produce enough insulin or the insulin produced cannot be used effectively. For diabetes to be managed and complications to be avoided, early diagnosis is essential. Advanced technologies such as machine learning contribute to both individual health management and public health systems by providing high accuracy rates in early diagnosis. In this study, it is aimed to examine the role of machine learning methods in the early diagnosis of diabetes. For this purpose, the methods were analysed on two different datasets. Support Vector Machines, Decision Trees, and Artificial Neural Networks were among the machine learning classifiers that were employed. In both datasets, the performance of the models in terms of metrics such as accuracy, sensitivity, and specificity were evaluated and compared. According to the results, the Bagged Trees algorithm showed the best performance with 96.2% in the first dataset we used, BIT Mesra Dataset. In the Pima Indian dataset, the SVM algorithm achieved an accuracy rate of 77.2%. The study provides a method for early diagnosis of diabetes, and emphasises the importance of data diversity in this field.

Keywords

References

  1. Arıkoğlu H, Kaya DE. (2015). Tip 2 diyabetin moleküler genetik temeli; Son gelişmeler. Genel Tıp Dergisi, 25(4), 147-159.
  2. Başer BÖ, Yangın M, Sarıdaş ES. (2021). Makine öğrenmesi teknikleriyle diyabet hastalığının sınıflandırılması. Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi, 25(1), 112-120.
  3. Buscema M. (2002). A brief overview and introduction to artificial neural networks. Substance use & misuse, 37(8-10), 1093-1148.
  4. Cervantes J, Garcia-Lamont F, Rodríguez-Mazahua L, Lopez A. (2020). A comprehensive survey on support vector machine classification: Applications, challenges, and trends. Neurocomputing, 408, 189-215.
  5. Modak, S. K. S., & Jha, V. K. (2024). Diabetes prediction model using machine learning techniques. Multimedia Tools and Applications, 83(13), 38523-38549.
  6. Önsüz MF, Topuzoğlu A. (2018). İstanbul İlinde Üç Hastanede Ayaktan İzlenen Tip II Diyabetik Hastalarda Glisemik Kontrolün Maliyet Etkinliğinin Değerlendirilmesi. ESTÜDAM Halk Sağlığı Dergisi, 3(2), 1-14.
  7. Patro VM, Patra MR. (2014). Augmenting weighted average with confusion matrix to enhance classification accuracy. Transactions on Machine Learning and Artificial Intelligence, 2(4), 77-91.
  8. Pima Indians Diabetes dataset (May 2008). Pima Indians Diabetes dataset. Available from: http://archive.ics.uci.edu/ml/machine-learning-databases/pima-indians-diabetes/pima-indians-diabetes data. Accessed 15 Dec 2024.

Details

Primary Language

English

Subjects

Machine Learning (Other)

Journal Section

Theoretical Article

Publication Date

June 1, 2025

Submission Date

February 3, 2025

Acceptance Date

February 20, 2025

Published in Issue

Year 2025 Volume: 10 Number: 1

APA
Çengel, E., Cengil, E., & Yıldırım, M. (2025). The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis. Computer Science, 10(1), 33-42. https://doi.org/10.53070/bbd.1632130
AMA
1.Çengel E, Cengil E, Yıldırım M. The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis. JCS. 2025;10(1):33-42. doi:10.53070/bbd.1632130
Chicago
Çengel, Ekemen, Emine Cengil, and Muhammed Yıldırım. 2025. “The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis”. Computer Science 10 (1): 33-42. https://doi.org/10.53070/bbd.1632130.
EndNote
Çengel E, Cengil E, Yıldırım M (June 1, 2025) The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis. Computer Science 10 1 33–42.
IEEE
[1]E. Çengel, E. Cengil, and M. Yıldırım, “The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis”, JCS, vol. 10, no. 1, pp. 33–42, June 2025, doi: 10.53070/bbd.1632130.
ISNAD
Çengel, Ekemen - Cengil, Emine - Yıldırım, Muhammed. “The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis”. Computer Science 10/1 (June 1, 2025): 33-42. https://doi.org/10.53070/bbd.1632130.
JAMA
1.Çengel E, Cengil E, Yıldırım M. The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis. JCS. 2025;10:33–42.
MLA
Çengel, Ekemen, et al. “The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis”. Computer Science, vol. 10, no. 1, June 2025, pp. 33-42, doi:10.53070/bbd.1632130.
Vancouver
1.Ekemen Çengel, Emine Cengil, Muhammed Yıldırım. The Role of Machine Learning Models in Early Diabetes Diagnosis: A Dataset Based Analysis. JCS. 2025 Jun. 1;10(1):33-42. doi:10.53070/bbd.1632130

The Creative Commons Attribution 4.0 International License 88x31.png is applied to all research papers published by JCS and

A Digital Object Identifier (DOI) Logo_TM.png is assigned for each published paper