Research Article

Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction

Volume: 38 Number: 4 December 1, 2025
EN

Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction

Abstract

Timely diagnosis of heart disease remains a global healthcare priority. Although machine learning (ML) models offer promising solutions, issues such as overfitting, limited interpretability, and dependency on high-dimensional features persist. This study introduces a feature-efficient, stacking-based ensemble framework for heart disease prediction by combining dimensionality reduction with hybrid modeling. Five hybrid models were proposed and evaluated: Hybrid Model 1 used Logistic Regression (LR), Naive Bayes (NB), and Random Forest (RF) as base learners with Ridge Classifier as the meta-learner; Hybrid Model 2a employed LR, NB, and Extreme Gradient Boosting (XGB) with Ridge as the meta-learner; Hybrid Model 2b retained the same base learners as 2a but utilized LR as the meta-learner, Hybrid Model 3 used LR and NB as base learners with Ridge Classifier as the meta-learner and Hybrid Model 4 used LR, NB, and RF as base learners and LR as the meta-learner. Models were evaluated using 13, 9, and 6 feature subsets from the Cleveland dataset, employing both 80:20 train-test splits and 10-fold stratified cross-validation. Hybrid Model 1 attained the highest accuracy of 91.27% and AUC of 0.906 (95% CI: 0.852–0.961) using only 9 features. Performance metrics were supported by confidence intervals, ROC curve analysis, and confusion matrices. The results demonstrate that stacking ensembles with reduced, high-impact features provides scalable, interpretable, and accurate diagnostic support for cardiovascular healthcare. Future research will focus on external validation, explainability integration, and cross-population generalizability.

Keywords

References

  1. [1] Chang, Y., Dong, M., Fan, L., Kang, B., Sun, W., Li, X., Yang, Z., and Ren, M., “Research on noninvasive electrophysiologic imaging based on cardiac electrophysiology simulation and deep learning methods for the inverse problem”, BMC Cardiovascular Disorders, 25(1): 335, (2025). DOI: https://doi.org/10.1186/s12872-025-04728-2
  2. [2] Bansal, A., and Hiwale, K., “Updates in the management of coronary artery disease: A review article”, Cureus, 15(12): e50644, (2023). DOI: https://doi.org/10.7759/cureus.50644.
  3. [3] Iacobescu, P., Marina, V., Anghel, C., and Anghele, A.D., “Evaluating binary classifiers for cardiovascular disease prediction: Enhancing early diagnostic capabilities”, Journal of Cardiovascular Development and Disease, 11(12): 396, (2024). DOI: https://doi.org/10.3390/jcdd11120396.
  4. [4] M.A. Chowdhury, R. Rizk, C. Chiu, J.J. Zhang, J.L. Scholl, T.J. Bosch, A. Singh, L.A. Baugh, J.S. McGough, K. Santosh, and W.C.W. Chen, “The heart of transformation: exploring artificial intelligence in cardiovascular disease”, Biomedicines, 13(2): 427, (2025). DOI: https://doi.org/10.3390/biomedicines13020427.
  5. [5] Chowdhury, M. A., Rizk, R., Chiu, C., Zhang, J. J., Scholl, J. L., Bosch, T. J., Singh, A., Baugh, L. A., McGough, J. S., Santosh, K., and Chen, W. C. W., “Delving into machine learning’s ınfluence on disease diagnosis and prediction”, The Open Public Health Journal, 17(1): (2024). DOI: https://doi.org/10.2174/0118749445297804240401061128.
  6. [6] Ogunpola, A., Saeed, F., Basurra, S., Albarrak, A. M., and Qasem, S. N., “Machine Learning-Based Predictive Models for detection of cardiovascular diseases”, Diagnostics, 14(2): 144, (2024). DOI: https://doi.org/10.3390/diagnostics14020144.
  7. [7] Chaudhari, J. P. , Patel, K. P., Mewada, H. K., Jayswal, H. S., Kosta, Y. P., Bhagat, K. S., and Kirange, S. D., “Recursive feature elimination and optimized hybrid ensemble approach for early heart disease prediction”, Advances in Technology Innovation, 10(1): 58–71, (2025). DOI: https://doi.org/10.46604/aiti.2024.13825.
  8. [8] Al-Alshaikh, H. A., P, P., Poonia, R. C., Saudagar, A. K. J., Yadav, M., AlSagri, H. S., and AlSanad, A. A., “Comprehensive evaluation and performance analysis of machine learning in heart disease prediction”, Scientific Reports, 14(1): 7819, (2024). DOI: https://doi.org/10.1038/s41598-024-58489-7.

Details

Primary Language

English

Subjects

Computing Applications in Life Sciences, Artificial Intelligence (Other)

Journal Section

Research Article

Early Pub Date

October 6, 2025

Publication Date

December 1, 2025

Submission Date

May 7, 2025

Acceptance Date

September 3, 2025

Published in Issue

Year 2025 Volume: 38 Number: 4

APA
Jain, R., Parmar, K., Palaniappan, D., & T, P. (2025). Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction. Gazi University Journal of Science, 38(4), 1770-1794. https://doi.org/10.35378/gujs.1694513
AMA
1.Jain R, Parmar K, Palaniappan D, T P. Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction. Gazi University Journal of Science. 2025;38(4):1770-1794. doi:10.35378/gujs.1694513
Chicago
Jain, Rituraj, Kumar Parmar, Damodharan Palaniappan, and Premavathi T. 2025. “Lightweight and Scalable Hybrid Ensemble Learning With Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction”. Gazi University Journal of Science 38 (4): 1770-94. https://doi.org/10.35378/gujs.1694513.
EndNote
Jain R, Parmar K, Palaniappan D, T P (December 1, 2025) Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction. Gazi University Journal of Science 38 4 1770–1794.
IEEE
[1]R. Jain, K. Parmar, D. Palaniappan, and P. T, “Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction”, Gazi University Journal of Science, vol. 38, no. 4, pp. 1770–1794, Dec. 2025, doi: 10.35378/gujs.1694513.
ISNAD
Jain, Rituraj - Parmar, Kumar - Palaniappan, Damodharan - T, Premavathi. “Lightweight and Scalable Hybrid Ensemble Learning With Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction”. Gazi University Journal of Science 38/4 (December 1, 2025): 1770-1794. https://doi.org/10.35378/gujs.1694513.
JAMA
1.Jain R, Parmar K, Palaniappan D, T P. Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction. Gazi University Journal of Science. 2025;38:1770–1794.
MLA
Jain, Rituraj, et al. “Lightweight and Scalable Hybrid Ensemble Learning With Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction”. Gazi University Journal of Science, vol. 38, no. 4, Dec. 2025, pp. 1770-94, doi:10.35378/gujs.1694513.
Vancouver
1.Rituraj Jain, Kumar Parmar, Damodharan Palaniappan, Premavathi T. Lightweight and Scalable Hybrid Ensemble Learning with Reduced Feature Sets for Robust and Interpretable Heart Disease Prediction. Gazi University Journal of Science. 2025 Dec. 1;38(4):1770-94. doi:10.35378/gujs.1694513