TR
EN
Early Stage Diabetes Prediction Using Machine Learning Methods
Abstract
Diabetes is a common disease that is incurable and fatal. Millions of people worldwide have diabetes and it directly affects people’s lives. Early diagnosis helps reduce the effects of diabetes and improve the life quality of patients, but in common case people live with diabetes for years before getting diagnosed. Early diagnosis can be done by applying machine learning methods on existing data of patients. In this way, people can quickly get diagnosed without taking a glucose screening test or any blood test. Answering a simple question set would be enough to determine if a person is diabetic or has a risk of being diabetic. In the proposed study, determination of diabetes is performed by machine learning techniques. In this scope, a publicly available diabetes dataset, which includes 16 features that are collected from 520 people, was used to create predictive models. Eight machine learning methods were individually performed over the dataset. The results of each model were validated by using a 10 fold cross validation schema. Addition to accuracy metric, confusion matrix based other performance metrics; precision, recall and f1 score, were also reported. All of the created models resulted in high accuracy scores. The minimum accuracy score was measured as 88.85% by using one of the basic machine learning techniques, Naive Bayes. The highest accuracy rate was 99.04%, which is obtained by using a one dimensional convolutional neural network model. The designed Convolutional Neural Network model also resulted in highest performance scores for other metrics as 100.00%, 98.63% and 99.31% for precision, recall and f1 scores, respectively. These findings indicate that the created 1D CNN model can be utilized in the determination of diabetic patients by asking only several questions to patients.
Keywords
Kaynakça
- Ampadu, H. (2021, May 01). Random Forests Understanding. AI Pool. https://ai-pool.com/a/s/random-forests-understanding
- Berkley, C. (2021, May 18). How Is Rapid Weight Loss Related to Diabetes. Verywell Health. https://www.verywellhealth.com/rapid-weight-loss-5101064
- Bilgin, G. (2021). Makine Öğrenmesi Algoritmaları Kullanarak Erken Dönemde Diyabet Hastalığı Riskinin Araştırılması. Zeki Sistemler Teori ve Uygulamaları Dergisi, 4(1), 55-64. https://doi.org/10.46387/bjesr.790225
- Cirino, E. (2019, July 6). What Causes Muscle Rigidity. Healthline. https://www.healthline.com/health/muscle-rigidity
- Coelho, S. (2021, April 28). What Is Blurred Vision. Verywell Health. https://www.verywellhealth.com/blurred-vision-5114184
- Draelos, R. (2019). Measuring Performance: The Confusion Matrix. Glass Box Medicine. https://glassboxmedicine.com/2019/02/17/measuring-performance-the-confusion-matrix/
- Harris, M. I., Klein, R., Welborn, T. A. & Knuiman, M. W. (1992). Onset of NIDDM occurs at least 4–7 yr before clinical diagnosis. Diabetes Care, 15(7), 815-819. DOI: 10.2337/diacare.15.7.815
- Hawkins, D. M., Subhash, C. B. & Mills, D. (2003). Assessing Model Fit by Cross-Validation. Journal of Chemical Information and Computer Sciences, 43(2), 579–586. https://doi.org/10.1021/ci025626i
Ayrıntılar
Birincil Dil
İngilizce
Konular
Mühendislik
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
1 Aralık 2021
Gönderilme Tarihi
28 Ekim 2021
Kabul Tarihi
8 Aralık 2021
Yayımlandığı Sayı
Yıl 2021 Sayı: 29
APA
Ergün, Ö. N., & O.ilhan, H. (2021). Early Stage Diabetes Prediction Using Machine Learning Methods. Avrupa Bilim ve Teknoloji Dergisi, 29, 52-57. https://doi.org/10.31590/ejosat.1015816
Cited By
Desenleştirilmiş Karma Verilerin Transfer Öğrenme Yöntemi Kullanılarak Evrişimli Sinir Ağlarıyla Sınıflandırılması
Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi
https://doi.org/10.19113/sdufenbed.1293579Feature Selection in the Diabetes Dataset with the Marine Predator Algorithm and Classification using Machine Learning Methods
Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji
https://doi.org/10.29109/gujsc.1396051An explainable analysis of diabetes mellitus using statistical and artificial intelligence techniques
BMC Medical Informatics and Decision Making
https://doi.org/10.1186/s12911-024-02810-xA Hybrid Ensemble Approach for Early‐Stage Diabetes Detection
Healthcare Technology Letters
https://doi.org/10.1049/htl2.70060