Araştırma Makalesi

Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset

Cilt: 13 Sayı: 2 30 Haziran 2025
PDF İndir
EN

Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset

Öz

This paper investigates the application of machine learning (ML) models for predicting diabetes using the Pima Indians Diabetes Database, with a focus on enhancing model interpretability through the use of SHapley Additive exPlanations (SHAP). The study evaluates eight ML models, including Adaptive Boosting (AdaBoost), k-Nearest Neighbors (k-NN), Logistic Regression (LR), Multi-layer Perceptron (MLP), Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), and eXtreme Gradient Boosting (XGBoost), utilizing both test/train split and 10-fold cross-validation methods. The RF model demonstrated superior performance, achieving an accuracy of 82% and an F1-score of 0.83 in the test/train split, and an accuracy of 83% and an F1-score of 0.84 in the 10-fold cross-validation. SHAP analysis was employed to identify the most influential predictors, revealing that glucose, BMI, pregnancies, and insulin levels are the key factors in diabetes prediction, aligning with established clinical markers. Additionally, the use of the Synthetic Minority Over-sampling TEchnique (SMOTE) for class balancing and data scaling contributes to robust model performance. The study emphasizes the necessity for interpretable ML in healthcare, proposing SHAP as a valuable tool for bridging predictive accuracy and clinical transparency in diabetes diagnostics.

Anahtar Kelimeler

Kaynakça

  1. [1] P. David, S. Singh, R. Ankar, “A comprehensive overview of skin complications in diabetes and their prevention,” Cureus, vol. 15, no. 5, p. e38961, 2023.
  2. [2] A. F. Walker et al., “Interventions to address global inequity in diabetes: international progress,” Lancet, vol. 402, no. 10397, 2023, pp. 250-264.
  3. [3] M. Zakir et al., “Cardiovascular complications of diabetes: From microvascular to macrovascular pathways,” Cureus, vol. 15, no. 9, p. e45835, 2023.
  4. [4] A. Avogaro, M. Rigato, E. di Brino, D. Bianco, I. Gianotto, G. Brusaporco, “The socio-environmental determinants of diabetes and their consequences,” Acta Diabetol., vol. 61, no. 10, 2024, pp. 1205-1210.
  5. [5] S. Gowthami, R. Venkata Siva Reddy, M. R. Ahmed, “Exploring the effectiveness of machine learning algorithms for early detection of Type-2 Diabetes Mellitus,” Measur. Sens., vol. 31, no. 100983, p. 100983, 2024.
  6. [6] A. A. L. Ahmad, A. A. Mohamed, “Artificial intelligence and machine learning techniques in the diagnosis of type I diabetes: Case studies,” in Studies in Computational Intelligence, Singapore: Springer Nature Singapore, 2024, pp. 289-302.
  7. [7] T. Althobaiti, S. Althobaiti, M. M. Selim, “An optimized diabetes mellitus detection model for improved prediction of accuracy and clinical decision-making,” Alex. Eng. J., vol. 94, 2024, pp. 311-324.
  8. [8] R. F. Albadri, S. M. Awad, A. S. Hameed, T. H. Mandeel, R. A. Jabbar, “A diabetes prediction model using hybrid machine learning algorithm,” Math. Model. Eng. Probl., vol. 11, no. 8, 2024, pp. 2119-2126.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Bilgisayar Yazılımı

Bölüm

Araştırma Makalesi

Erken Görünüm Tarihi

11 Temmuz 2025

Yayımlanma Tarihi

30 Haziran 2025

Gönderilme Tarihi

1 Kasım 2024

Kabul Tarihi

27 Aralık 2024

Yayımlandığı Sayı

Yıl 2025 Cilt: 13 Sayı: 2

Kaynak Göster

APA
Kırbaş, İ., & Çifci, A. (2025). Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset. Balkan Journal of Electrical and Computer Engineering, 13(2), 128-139. https://doi.org/10.17694/bajece.1577929
AMA
1.Kırbaş İ, Çifci A. Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset. Balkan Journal of Electrical and Computer Engineering. 2025;13(2):128-139. doi:10.17694/bajece.1577929
Chicago
Kırbaş, İsmail, ve Ahmet Çifci. 2025. “Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset”. Balkan Journal of Electrical and Computer Engineering 13 (2): 128-39. https://doi.org/10.17694/bajece.1577929.
EndNote
Kırbaş İ, Çifci A (01 Haziran 2025) Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset. Balkan Journal of Electrical and Computer Engineering 13 2 128–139.
IEEE
[1]İ. Kırbaş ve A. Çifci, “Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset”, Balkan Journal of Electrical and Computer Engineering, c. 13, sy 2, ss. 128–139, Haz. 2025, doi: 10.17694/bajece.1577929.
ISNAD
Kırbaş, İsmail - Çifci, Ahmet. “Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset”. Balkan Journal of Electrical and Computer Engineering 13/2 (01 Haziran 2025): 128-139. https://doi.org/10.17694/bajece.1577929.
JAMA
1.Kırbaş İ, Çifci A. Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset. Balkan Journal of Electrical and Computer Engineering. 2025;13:128–139.
MLA
Kırbaş, İsmail, ve Ahmet Çifci. “Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset”. Balkan Journal of Electrical and Computer Engineering, c. 13, sy 2, Haziran 2025, ss. 128-39, doi:10.17694/bajece.1577929.
Vancouver
1.İsmail Kırbaş, Ahmet Çifci. Leveraging SHAP for Interpretable Diabetes Prediction: A Study of Machine Learning Models on the Pima Indians Diabetes Dataset. Balkan Journal of Electrical and Computer Engineering. 01 Haziran 2025;13(2):128-39. doi:10.17694/bajece.1577929

Cited By

All articles published by BAJECE are licensed under the Creative Commons Attribution 4.0 International License. This permits anyone to copy, redistribute, remix, transmit and adapt the work provided the original work and source is appropriately cited.Creative Commons Lisans