Research Article

Long-Term Prediction of Coronary Artery Disease via Ensemble Machine Learning Algorithms

Volume: 15 Number: 4 December 23, 2024
TR EN

Long-Term Prediction of Coronary Artery Disease via Ensemble Machine Learning Algorithms

Abstract

Coronary artery disease (CAD) is the leading cause of death worldwide, necessitating early detection methods that are non-invasive, cost-effective, and reliable. In this study, the effectiveness of various machine learning (ML) models in predicting CAD was evaluated, with a focus on addressing class imbalance using the Synthetic Minority Oversampling Technique (SMOTE). The Framingham CAD dataset was utilized, and SMOTE was applied with different k-values to balance the data, examining the impact on prediction performance. Eight significant features—age, diaBP, glucose, heart rate, sysBP, totChol, cigsPerDay, and BMI—were determined during preprocessing and used for further analysis. Among the models tested, the StackingC classifier demonstrated superior performance, achieving an accuracy of 95.81%, sensitivity of 95.9%, specificity of 95.7%, and an AUROC of 99.2% for k=1. These findings highlight the potential of the StackingC model as a robust tool for CAD prediction, offering a promising non-invasive method for early diagnosis.

Keywords

References

  1. [1] World Health Organization, “The top 10 causes of death,” Jun. 09, 2024. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death
  2. [2] R. Alizadehsani et al., “Machine learning-based coronary artery disease diagnosis: A comprehensive review,” Comput. Biol. Med., vol. 111, p. 103346, Aug. 2019, doi: 10.1016/j.compbiomed.2019.103346.
  3. [3] S. Kutiame, R. Millham, A. F. Adekoya, M. Tettey, B. A. Weyori, and P. Appiahene, “Application of Machine Learning Algorithms in Coronary Heart Disease: A Systematic Literature Review and Meta-Analysis,” Int. J. Adv. Comput. Sci. Appl., vol. 13, no. 6, 2022, doi: 10.14569/IJACSA.2022.0130620.
  4. [4] I. Kurt, M. Ture, and A. T. Kurum, “Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease,” Expert Syst. Appl., vol. 34, no. 1, pp. 366–374, Jan. 2008, doi: 10.1016/j.eswa.2006.09.004.
  5. [5] I. Babaoğlu, O. Fındık, and M. Bayrak, “Effects of principle component analysis on assessment of coronary artery diseases using support vector machine,” Expert Syst. Appl., vol. 37, no. 3, pp. 2182–2185, Mar. 2010, doi: 10.1016/j.eswa.2009.07.055.
  6. [6] R. Alizadehsani et al., “A data mining approach for diagnosis of coronary artery disease,” Comput. Methods Programs Biomed., vol. 111, no. 1, pp. 52–61, Jul. 2013, doi: 10.1016/j.cmpb.2013.03.004.
  7. [7] S. Akila and S. Chandramathi, “A Hybrid Method for Coronary Heart Disease Risk Prediction using Decision Tree and Multi Layer Perceptron,” Indian J. Sci. Technol., vol. 8, no. 34, Dec. 2015, doi: 10.17485/ijst/2015/v8i34/85947.
  8. [8] Y.-T. Lo, H. Fujita, and T.-W. Pai, “PREDICTION OF CORONARY ARTERY DISEASE BASED ON ENSEMBLE LEARNING APPROACHES AND CO-EXPRESSED OBSERVATIONS,” J. Mech. Med. Biol., vol. 16, no. 01, p. 1640010, Feb. 2016, doi: 10.1142/S0219519416400108.

Details

Primary Language

English

Subjects

Machine Learning (Other)

Journal Section

Research Article

Early Pub Date

December 23, 2024

Publication Date

December 23, 2024

Submission Date

August 12, 2024

Acceptance Date

October 24, 2024

Published in Issue

Year 2024 Volume: 15 Number: 4

IEEE
[1]Ş. Aslan, “Long-Term Prediction of Coronary Artery Disease via Ensemble Machine Learning Algorithms”, DUJE, vol. 15, no. 4, pp. 827–837, Dec. 2024, doi: 10.24012/dumf.1531523.