Araştırma Makalesi

A Balanced Machine Learning Approach to Obesity Risk Classification: Comparative Analysis and Feature Importance

Cilt: 9 Sayı: 2 31 Aralık 2025
PDF İndir
EN

A Balanced Machine Learning Approach to Obesity Risk Classification: Comparative Analysis and Feature Importance

Abstract

Obesity is a growing public health concern, particularly among university students who are exposed to lifestyle changes, disordered eating habits, and reduced physical activity. The aim of this study is to classify obesity risk levels among university students using machine learning classification methods and to identify the most influential factors associated with this risk. The study sample consisted of data collected from 445 students studying at Çankırı Karatekin University. In this context, eight machine learning algorithms—Logistic Regression, Random Forest, Extra Trees, Support Vector Machines, K-Nearest Neighbor, Quadratic Discriminant Analysis, Naive Bayes, and Multilayer Perceptron—were compared to classify obesity risk. Class imbalance in the dataset was addressed using the Adaptive Synthetic Sampling (ADASYN) method applied exclusively to the training set. The models were evaluated using standard performance metrics, and the highest accuracy rate (96.26%) was achieved by the Random Forest model, followed by Logistic Regression with 94.77% accuracy. Variable importance analysis indicated that age, internet use scale score, and fast-food consumption frequency were the most influential factors in classification, while the low correlation between variables (|r| < 0.2) suggested that model performance was driven by the combined contribution of multiple features. Overall, the findings demonstrate that the balanced machine learning approach, particularly ensemble-based methods, can classify obesity risk with high accuracy and provide valuable insights for targeted prevention strategies among university students.

Keywords

Etik Beyan

Çalışma için etik onay [T.C. ÇANKIRI KARATEKIN ÜNİVERSİTESİ Fen, Matematik ve Sosyal Bilimler Etik Kurulu]'ndan alınmıştır (Onay No: [44], Tarih: [23-08-2024]). Beyan edilecek herhangi bir çıkar çatışması yoktur.

Kaynakça

  1. 1. Akın, P. (2023). A new hybrid approach based on genetic algorithm and support vector machine methods for hyperparameter optimization in synthetic minority over-sampling technique (SMOTE). AIMS Mathematics, 8(6), 9400–9415.
  2. 2. Alzahrani, S. H., Saeedi, A. A., Baamer, M. K., Shalabi, A. F., & Alzahrani, A. M. (2020). Eating habits among medical students at king abdulaziz university, Jeddah, Saudi Arabia. International journal of general medicine, 77-88.
  3. 3. Bikku, T. (2020). Multi-layered deep learning perceptron approach for health risk prediction. Journal of Big Data, 7(1), 50.
  4. 4. Bishop, C. M., & Nasrabadi, N. M. (2006). Pattern recognition and machine learning (Vol. 4, No. 4, p. 738). New York: springer.
  5. 5. Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32.
  6. 6. Brownlee, J. (2020). Imbalanced classification with Python: better metrics, balance skewed classes, cost-sensitive learning. Machine Learning Mastery.
  7. 7. Chatterjee, A., Gerdes, M. W., & Martinez, S. G. (2020). Identification of risk factors associated with obesity and overweight—a machine learning overview. Sensors, 20(9), 2734.
  8. 8. Choudhuri, A. (2022). A hybrid machine learning model for estimation of obesity levels. In Data management, analytics and innovation conference (pp. 257–266). Springer. https://doi.org/10.1007/978-981-19-2600-6_22

Ayrıntılar

Birincil Dil

İngilizce

Konular

Sağlık ve Ekolojik Risk Değerlendirmesi , Dijital Sağlık

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

31 Aralık 2025

Gönderilme Tarihi

19 Ağustos 2025

Kabul Tarihi

17 Kasım 2025

Yayımlandığı Sayı

Yıl 2025 Cilt: 9 Sayı: 2

Kaynak Göster

APA
Koç, H., & Koc, T. (2025). A Balanced Machine Learning Approach to Obesity Risk Classification: Comparative Analysis and Feature Importance. Eurasian Journal of Health Technology Assessment, 9(2), 90-107. https://doi.org/10.52148/ehta.1768556

Açık erişimli ve çift-kör hakemli bir dergidir.

Dergi içeriği tüm kullanıcılara ücretsiz olarak sunulmaktadır.
Dergideki yazıların bilimsel sorumluluğu yazarlarına aittir.
Dergimizde yayınlanmış makaleler kaynak gösterilmeden kullanılamaz
© T.C. Sağlık Bakanlığı Sağlık Hizmetleri Genel Müdürlüğü Araştırma, Geliştirme ve Sağlık Teknolojisi Değerlendirme Daire Başkanlığı
Tüm Hakları Türkiye Cumhuriyeti Sağlık Bakanlığı Sağlık Hizmetleri Genel Müdürlüğüne aittir.