EN
Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction
Abstract
Timely diagnosis of heart disease remains a global healthcare priority. Although machine learning (ML) models offer promising solutions, issues such as overfitting, limited interpretability, and dependency on high-dimensional features persist. This study introduces a feature-efficient, stacking-based ensemble framework for heart disease prediction by combining dimensionality reduction with hybrid modeling. Five hybrid models were proposed and evaluated: Hybrid Model 1 used Logistic Regression (LR), Naive Bayes (NB), and Random Forest (RF) as base learners with Ridge Classifier as the meta-learner; Hybrid Model 2a employed LR, NB, and Extreme Gradient Boosting (XGB) with Ridge as the meta-learner; Hybrid Model 2b retained the same base learners as 2a but utilized LR as the meta-learner, Hybrid Model 3 used LR and NB as base learners with Ridge Classifier as the meta-learner and Hybrid Model 4 used LR, NB, and RF as base learners and LR as the meta-learner. Models were evaluated using 13, 9, and 6 feature subsets from the Cleveland dataset, employing both 80:20 train-test splits and 10-fold stratified cross-validation. Hybrid Model 1 attained the highest accuracy of 91.27% and AUC of 0.906 (95% CI: 0.852–0.961) using only 9 features. Performance metrics were supported by confidence intervals, ROC curve analysis, and confusion matrices. The results demonstrate that stacking ensembles with reduced, high-impact features provides scalable, interpretable, and accurate diagnostic support for cardiovascular healthcare. Future research will focus on external validation, explainability integration, and cross-population generalizability.
Keywords
References
- [1] Chang, Y., Dong, M., Fan, L., Kang, B., Sun, W., Li, X., Yang, Z., and Ren, M., “Research on noninvasive electrophysiologic imaging based on cardiac electrophysiology simulation and deep learning methods for the inverse problem”, BMC Cardiovascular Disorders, 25(1): 335, (2025). DOI: https://doi.org/10.1186/s12872-025-04728-2
- [2] Bansal, A., and Hiwale, K., “Updates in the management of coronary artery disease: A review article”, Cureus, 15(12): e50644, (2023). DOI: https://doi.org/10.7759/cureus.50644.
- [3] Iacobescu, P., Marina, V., Anghel, C., and Anghele, A.D., “Evaluating binary classifiers for cardiovascular disease prediction: Enhancing early diagnostic capabilities”, Journal of Cardiovascular Development and Disease, 11(12): 396, (2024). DOI: https://doi.org/10.3390/jcdd11120396.
- [4] M.A. Chowdhury, R. Rizk, C. Chiu, J.J. Zhang, J.L. Scholl, T.J. Bosch, A. Singh, L.A. Baugh, J.S. McGough, K. Santosh, and W.C.W. Chen, “The heart of transformation: exploring artificial intelligence in cardiovascular disease”, Biomedicines, 13(2): 427, (2025). DOI: https://doi.org/10.3390/biomedicines13020427.
- [5] Chowdhury, M. A., Rizk, R., Chiu, C., Zhang, J. J., Scholl, J. L., Bosch, T. J., Singh, A., Baugh, L. A., McGough, J. S., Santosh, K., and Chen, W. C. W., “Delving into machine learning’s ınfluence on disease diagnosis and prediction”, The Open Public Health Journal, 17(1): (2024). DOI: https://doi.org/10.2174/0118749445297804240401061128.
- [6] Ogunpola, A., Saeed, F., Basurra, S., Albarrak, A. M., and Qasem, S. N., “Machine Learning-Based Predictive Models for detection of cardiovascular diseases”, Diagnostics, 14(2): 144, (2024). DOI: https://doi.org/10.3390/diagnostics14020144.
- [7] Chaudhari, J. P. , Patel, K. P., Mewada, H. K., Jayswal, H. S., Kosta, Y. P., Bhagat, K. S., and Kirange, S. D., “Recursive feature elimination and optimized hybrid ensemble approach for early heart disease prediction”, Advances in Technology Innovation, 10(1): 58–71, (2025). DOI: https://doi.org/10.46604/aiti.2024.13825.
- [8] Al-Alshaikh, H. A., P, P., Poonia, R. C., Saudagar, A. K. J., Yadav, M., AlSagri, H. S., and AlSanad, A. A., “Comprehensive evaluation and performance analysis of machine learning in heart disease prediction”, Scientific Reports, 14(1): 7819, (2024). DOI: https://doi.org/10.1038/s41598-024-58489-7.
Details
Primary Language
English
Subjects
Computing Applications in Life Sciences, Artificial Intelligence (Other)
Journal Section
Research Article
Authors
Early Pub Date
October 6, 2025
Publication Date
December 1, 2025
Submission Date
May 7, 2025
Acceptance Date
September 3, 2025
Published in Issue
Year 2025 Volume: 38 Number: 4
APA
Jain, R., Parmar, K., Palaniappan, D., & T, P. (2025). Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction. Gazi University Journal of Science, 38(4), 1770-1794. https://doi.org/10.35378/gujs.1694513
AMA
1.Jain R, Parmar K, Palaniappan D, T P. Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction. Gazi University Journal of Science. 2025;38(4):1770-1794. doi:10.35378/gujs.1694513
Chicago
Jain, Rituraj, Kumar Parmar, Damodharan Palaniappan, and Premavathi T. 2025. “Lightweight and Scalable Hybrid Ensemble Learning With Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction”. Gazi University Journal of Science 38 (4): 1770-94. https://doi.org/10.35378/gujs.1694513.
EndNote
Jain R, Parmar K, Palaniappan D, T P (December 1, 2025) Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction. Gazi University Journal of Science 38 4 1770–1794.
IEEE
[1]R. Jain, K. Parmar, D. Palaniappan, and P. T, “Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction”, Gazi University Journal of Science, vol. 38, no. 4, pp. 1770–1794, Dec. 2025, doi: 10.35378/gujs.1694513.
ISNAD
Jain, Rituraj - Parmar, Kumar - Palaniappan, Damodharan - T, Premavathi. “Lightweight and Scalable Hybrid Ensemble Learning With Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction”. Gazi University Journal of Science 38/4 (December 1, 2025): 1770-1794. https://doi.org/10.35378/gujs.1694513.
JAMA
1.Jain R, Parmar K, Palaniappan D, T P. Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction. Gazi University Journal of Science. 2025;38:1770–1794.
MLA
Jain, Rituraj, et al. “Lightweight and Scalable Hybrid Ensemble Learning With Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction”. Gazi University Journal of Science, vol. 38, no. 4, Dec. 2025, pp. 1770-94, doi:10.35378/gujs.1694513.
Vancouver
1.Rituraj Jain, Kumar Parmar, Damodharan Palaniappan, Premavathi T. Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction. Gazi University Journal of Science. 2025 Dec. 1;38(4):1770-94. doi:10.35378/gujs.1694513