Research Article

Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods

Volume: 12 Number: 4 December 31, 2025

Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods

Abstract

Diabetes is spreading rapidly around the world, and there is an urgent need for governments to develop comprehensive strategies for diabetes detection. Early detection of diabetes is important for early initiation of treatment. In this paper, Data Mining (DM) and Machine Learning (ML) techniques are used to detect early diabetes by age from survey data. The dataset was divided into 3 groups (young, middle-aged, elderly), and a unique feature selection process was performed by averaging the feature importance obtained in Random Forest (RF), Gradient Boosting (GB), and eXtreme Gradient Boosting (XGBoost) algorithms for each group, and the features that should be considered in diabetes detection according to age groups were determined. Then, the features selected for each age group were classified using different ML methods. Accuracies of 96.77%, 98.10% and 99% were obtained for the young, middle-aged and elderly groups, respectively. The characteristics that should be taken into account in the assessment of diabetes according to age groups were also identified.

Keywords

References

  1. American Diabetes Association. (2014). Diagnosis and classification of diabetes mellitus. Diabetes Care, 37(Suppl 1), S81–S90. https://doi.org/10.2337/dc14-S081
  2. Arif Ali, Z., H. Abduljabbar, Z., A. Tahir, H., Bibo Sallow, A., & Almufti, S. M. (2023). eXtreme Gradient Boosting algorithm with machine learning: A review. Academic Journal of Nawroz University, 12(2), 320–334. https://doi.org/10.25007/ajnu.v12n2a1612
  3. Beagley, J., Guariguata, L., Weil, C., & Motala, A. A. (2014). Global estimates of undiagnosed diabetes in adults. Diabetes Research and Clinical Practice, 103(2), 150–160. https://doi.org/10.1016/j.diabres.2013.11.001
  4. Boser, B. E., Guyon, I. M., & Vapnik, V. N. (1992, July 27-29). A training algorithm for optimal margin classifiers. In: Proceedings of the 5th Annual Workshop on Computational Learning Theory (COLT’92) (pp. 144–152). Pittsburgh, PA. https://doi.org/10.1145/130385.130401
  5. Breiman, L. (1996). Bagging predictors. Machine Learning, 24(2), 123–140. https://doi.org/10.1007/BF00058655
  6. Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
  7. Chen, T. Q., & Guestrin, C. (2016, August 13-17). XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794). https://doi.org/10.1145/2939672.2939785
  8. Cho, N. H., Shaw, J. E., Karuranga, S., Huang, Y., da Rocha Fernandes, J. D., Ohlrogge, A. W., & Malanda, B. (2018). IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045. Diabetes Research and Clinical Practice, 138, 271–281. https://doi.org/10.1016/j.diabres.2018.02.023

Details

Primary Language

English

Subjects

Computing Applications in Health

Journal Section

Research Article

Publication Date

December 31, 2025

Submission Date

July 29, 2025

Acceptance Date

October 21, 2025

Published in Issue

Year 2025 Volume: 12 Number: 4

APA
Uzbaş, B. (2025). Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods. Gazi University Journal of Science Part A: Engineering and Innovation, 12(4), 953-965. https://doi.org/10.54287/gujsa.1753574
AMA
1.Uzbaş B. Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods. GU J Sci, Part A. 2025;12(4):953-965. doi:10.54287/gujsa.1753574
Chicago
Uzbaş, Betül. 2025. “Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods”. Gazi University Journal of Science Part A: Engineering and Innovation 12 (4): 953-65. https://doi.org/10.54287/gujsa.1753574.
EndNote
Uzbaş B (December 1, 2025) Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods. Gazi University Journal of Science Part A: Engineering and Innovation 12 4 953–965.
IEEE
[1]B. Uzbaş, “Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods”, GU J Sci, Part A, vol. 12, no. 4, pp. 953–965, Dec. 2025, doi: 10.54287/gujsa.1753574.
ISNAD
Uzbaş, Betül. “Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods”. Gazi University Journal of Science Part A: Engineering and Innovation 12/4 (December 1, 2025): 953-965. https://doi.org/10.54287/gujsa.1753574.
JAMA
1.Uzbaş B. Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods. GU J Sci, Part A. 2025;12:953–965.
MLA
Uzbaş, Betül. “Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods”. Gazi University Journal of Science Part A: Engineering and Innovation, vol. 12, no. 4, Dec. 2025, pp. 953-65, doi:10.54287/gujsa.1753574.
Vancouver
1.Betül Uzbaş. Early-Stage Diabetes Detection Using Age-Based Feature Selection and Machine Learning Methods. GU J Sci, Part A. 2025 Dec. 1;12(4):953-65. doi:10.54287/gujsa.1753574