Research Article

Predicting Stroke Risk with Machine Learning and Hyperparameter Optimization

Volume: 15 Number: 2 June 15, 2025
EN TR

Predicting Stroke Risk with Machine Learning and Hyperparameter Optimization

Abstract

Stroke is a serious medical condition that causes the death of brain cells due to insufficient blood flow due to blockage or rupture in the blood vessels leading to the brain. Stroke is the most common cause of death and disability in adults after heart attack and cancer, causing individuals to not only die but also live with permanent disabilities. In this study, 12 features and 7 different machine learning methods belonging to 5100 individuals in an open-source dataset were used to predict stroke risk. Hyperparameter optimization was applied to increase the performance of machine learning methods and the best parameters were selected. When the results were examined, the random forest algorithm was able to detect the risk of stroke with an accuracy of 96.98%, which is higher than other studies in literature. This study discusses the effective use of machine learning algorithms to predict stroke risk and efforts to improve model performance. The results obtained may help in more accurate determination of stroke risk and taking preventive measures.

Keywords

Classification, Hyperparameter optimization, Stroke, Machine learning

References

  1. Agatonovic-Kustrin, S., & Beresford, R. (2000). Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research. Journal of Pharmaceutical and Biomedical Analysis, 22(5), 717–727.
  2. Alomoush, W., Houssein, E. H., Alrosan, A., Abd-Alrazaq, A., Alweshah, M., & Alshinwan, M. (2024). Joint opposite selection enhanced Mountain Gazelle Optimizer for brain stroke classification. Evolutionary Intelligence, 17(4), 2865–2883.
  3. Arslan, A. K., Colak, C., & Sarihan, M. E. (2016). Different medical data mining approaches based prediction of ischemic stroke. Computer Methods and Programs in Biomedicine, 130, 87–92.
  4. Chawla, N. V, Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321–357.
  5. Delpont, B., Blanc, C., Osseby, G. V, Hervieu-Bègue, M., Giroud, M., & Béjot, Y. (2018). Pain after stroke: a review. Revue Neurologique, 174(10), 671–674.
  6. Emon, M. U., Keya, M. S., Meghla, T. I., Rahman, M. M., Al Mamun, M. S., & Kaiser, M. S. (2020). Performance analysis of machine learning approaches in stroke prediction. 2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA), 1464–1469.
  7. Federico Soriano Palacios. (n.d.). Stroke Prediction Dataset. https://www.kaggle.com/datasets/fedesoriano/stroke-prediction-dataset?resource=download
  8. Firmansyah, M. R., & Astuti, Y. P. (2024). Stroke classification comparison with KNN through standardization and normalization techniques. Advance Sustainable Science, Engineering and Technology, 6(1), 2401012.
  9. Hosmer Jr, D. W., Lemeshow, S., & Sturdivant, R. X. (2013). Applied logistic regression (Vol. 398). John Wiley & Sons.
  10. Imran, B., Wahyudi, E., Subki, A., Salman, S., & Yani, A. (2022). Classification of stroke patients using data mining with adaboost, decision tree and random forest models. ILKOM Jurnal Ilmiah, 14(3), 218–228.
APA
Özkanat, B., & Şahin Sadık, E. (2025). Predicting Stroke Risk with Machine Learning and Hyperparameter Optimization. Karadeniz Fen Bilimleri Dergisi, 15(2), 633-647. https://doi.org/10.31466/kfbd.1538305
AMA
1.Özkanat B, Şahin Sadık E. Predicting Stroke Risk with Machine Learning and Hyperparameter Optimization. KFBD. 2025;15(2):633-647. doi:10.31466/kfbd.1538305
Chicago
Özkanat, Burak, and Evin Şahin Sadık. 2025. “Predicting Stroke Risk With Machine Learning and Hyperparameter Optimization”. Karadeniz Fen Bilimleri Dergisi 15 (2): 633-47. https://doi.org/10.31466/kfbd.1538305.
EndNote
Özkanat B, Şahin Sadık E (June 1, 2025) Predicting Stroke Risk with Machine Learning and Hyperparameter Optimization. Karadeniz Fen Bilimleri Dergisi 15 2 633–647.
IEEE
[1]B. Özkanat and E. Şahin Sadık, “Predicting Stroke Risk with Machine Learning and Hyperparameter Optimization”, KFBD, vol. 15, no. 2, pp. 633–647, June 2025, doi: 10.31466/kfbd.1538305.
ISNAD
Özkanat, Burak - Şahin Sadık, Evin. “Predicting Stroke Risk With Machine Learning and Hyperparameter Optimization”. Karadeniz Fen Bilimleri Dergisi 15/2 (June 1, 2025): 633-647. https://doi.org/10.31466/kfbd.1538305.
JAMA
1.Özkanat B, Şahin Sadık E. Predicting Stroke Risk with Machine Learning and Hyperparameter Optimization. KFBD. 2025;15:633–647.
MLA
Özkanat, Burak, and Evin Şahin Sadık. “Predicting Stroke Risk With Machine Learning and Hyperparameter Optimization”. Karadeniz Fen Bilimleri Dergisi, vol. 15, no. 2, June 2025, pp. 633-47, doi:10.31466/kfbd.1538305.
Vancouver
1.Burak Özkanat, Evin Şahin Sadık. Predicting Stroke Risk with Machine Learning and Hyperparameter Optimization. KFBD. 2025 Jun. 1;15(2):633-47. doi:10.31466/kfbd.1538305