Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach

Fahrettin Kaya

doi:10.31466/kfbd.1678766

TR EN

Meta-Öğrenme ve Açıklanabilir Yapay Zekâ ile Yoğun Bakım Ölüm Tahmininin İyileştirilmesi: MetaCost ve LIME Yaklaşımı

Abstract

Bu çalışma, yoğun bakım ünitesindeki (YBÜ) hastalar için mortalite tahminini geliştirmek amacıyla bir Meta-Öğrenme yaklaşımı kullanmayı ve bireysel tahminlerin açıklanabilirliğini LIME yöntemiyle değerlendirmeyi amaçlamıştır. Çalışmada, MIMIC-III veri tabanından alınan 428 hasta kaydı analiz edilmiştir. Veriler; demografik bilgiler, laboratuvar sonuçları (örneğin, Anyon açığı, Üre azotu) ve yandaş hastalıklar gibi 48 değişken içermektedir. Veri seti dengesizdir; mortalite oranı %15, sağkalım oranı ise %85’tir. Bu dengesizliği gidermek amacıyla, Meta-Öğrenme yöntemi olan MetaCost algoritması kullanılarak Makine Öğrenmesi modelleri (örneğin, Gradient Boosting, Random Forest) uyarlanmıştır. Model performansı, dengesiz veri setlerine uygun metriklerle değerlendirilmiştir: Ortalama Kesinlik (AP), duyarlılık (recall), F2 skoru ve Matthews korelasyon katsayısı (MCC). Özellik önem dereceleri istatistiksel olarak doğrulanmış, bireysel hasta düzeyinde açıklanabilirlik için LIME yöntemi uygulanmıştır. Tek değişkenli analiz sonucunda, ölen ve hayatta kalan hastaları ayırt eden 24 istatistiksel olarak anlamlı değişken (P<0.01) belirlenmiştir. MetaCost ile güçlendirilmiş Gradient Boosting modeli en iyi performansı göstermiştir; AUC: 0.91, AP: 0.75, duyarlılık: 0.86, F2 skoru: 0.85 ve MCC: 0.79. MetaCost algoritması, YBÜ mortalite tahmininde doğruluğu artırmada etkili bulunmuştur. LIME yöntemi ile modelin bireysel hasta düzeyinde yorumlanabilirliğini artırmıştır. Bu yaklaşım, klinik karar destek sistemlerini daha şeffaf ve güvenilir hale getirebilir. Ancak, bulguların doğrulanması için farklı veri setlerinde ek çalışmalara ihtiyaç vardır.

Keywords

Yoğun bakım mortalite tahmini, Makine öğrenmesi, Lojistik regresyon, Açıklanabilir AI

Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach

Abstract

This study aimed to enhance mortality prediction for Intensive Care Unit (ICU) patients using a Meta-Learning approach and to evaluate the explainability of individual predictions using the LIME (Local Interpretable Model-agnostic Explanations) method. This study analyzed 428 patient records from the MIMIC-III database, including 48 variables including demographics, laboratory results (e.g., Anion gap, Urea nitrogen), and comorbidities. The dataset was imbalanced, with 15% mortality and 85% survival. To address this issue, machine learning models (e.g., Gradient Boosting, Random Forest) were adapted using the MetaCost algorithm, which is a meta-learning method. Performance was evaluated using metrics suited for imbalanced data, such as Average Precision (AP), recall, F2 score, and the Matthews correlation coefficient (MCC). Feature importance was validated statistically, and LIME was applied for per-patient interpretability. Univariate analysis identified 24 statistically significant features (P<0.01) differentiating between deceased and surviving patients. The MetaCost-enhanced Gradient Boosting model achieved the best performance, with an AUC of 0.91, AP of 0.75, recall of 0.86, F2 score of 0.85, and MCC of 0.79. The MetaCost algorithm effectively improves ICU mortality prediction accuracy, while LIME enhances interpretability at the individual patient level. This approach can make clinical decision support systems more transparent and reliable. However, further validation on diverse datasets is required to confirm these findings.

Keywords

ICU mortality prediction, Machine learning, Logistic regression, Explainable AI

References

Bagshaw, S. M., George, C., Bellomo, R., and ANZICS Database Management Committee. (2008). A comparison of the RIFLE and AKIN criteria for acute kidney injury in critically ill patients. Nephrology Dialysis Transplantation, 23(5), 1569–1574. https://doi.org/10.1093/ndt/gfn009
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
Çanga, D., and Boğa, M. (2020). Determination of the effect of some properties on egg yield with regression analysis method bagging Mars and R application. Turkish Journal of Agriculture - Food Science and Technology, 8(8), 1705–1712. https://doi.org/10.24925/turjaf.v8i8.1705-1712.3468
Çanga, D., and Boğa, M. (2022). Detection of correct pregnancy status in lactating dairy cattle using MARS data mining algorithm. Turkish Journal of Veterinary & Animal Sciences, 46(6), 809–819. https://doi.org/10.55730/1300-0128.4257
Çanga Boğa, D., Boğa, M., and Tırınk, C. (2024). Comparison of nonlinear functions to define the growth in intensive feedlot system with XGBoost algorithm. Turkish Journal of Agriculture - Food Science and Technology, 12(8), 1408–1416. https://doi.org/10.24925/turjaf.v12i8.1408-1416.6562
Çelik, S., and Yilmaz, O. (2021). The relationship between the coat colors of Kars shepherd dog and its morphological characteristics using some data mining methods. International Journal of Livestock Research, 11(1), 53–61. https://doi.org/10.5455/ijlr.20200604
Chen, T., and Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794). Association for Computing Machinery. https://doi.org/10.1145/2939672.2939785
Cheng, B., Li, D., Gong, Y., Ying, B., and Wang, B. (2020). Serum anion gap predicts all-cause mortality in critically ill patients with acute kidney injury: Analysis of the MIMIC-III database. Disease Markers, 2020, Article 6501272. https://doi.org/10.1155/2020/6501272
Chicco, D., Tötsch, N., and Jurman, G. (2021). The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation. BioData Mining, 14, 13. https://doi.org/10.1186/s13040-021-00244-z
Domingos, P. (1999). MetaCost: A general method for making classifiers cost-sensitive. In Proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining (pp. 155–164). Association for Computing Machinery.

Eratlı, Ş. Y., and Şahin, M. (2020). Investigation of factors affecting the achievement of university students with logistic regression analysis: School of Physical Education and Sport example. SAGE Open, 10(1). https://doi.org/10.1177/2158244020902082
Friedman, J. (2000). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), 1189–1232. https://doi.org/10.1214/aos/1013203451
Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., and Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://doi.org/10.1038/sdata.2016.35
Jorgensen, M., Gustafsen, K., Ernst, S., and Thostrup, J. S. (1992). Disseminated intravascular coagulation in critically ill patients: Laboratory diagnosis. Intensive Care World, 9(3), 108–113.
Kaya, F., Korkmaz, F., and Efe, E. (2019). Advanced machine learning techniques for predictive modeling. In International Symposium on Advanced Engineering Technologies (ISADET), Kahramanmaraş, Turkey.
Lawson, M. J. (2009). The impact of cost matrix selection on cost-sensitive learning: An empirical study (Doctoral dissertation, Virginia Polytechnic Institute and State University). VTechWorks. https://vtechworks.lib.vt.edu/handle/10919/29623
Li, F., Xin, H., Zhang, J., Fu, M., Zhou, J., and Lian, Z. (2021). Prediction model of in-hospital mortality in intensive care unit patients with heart failure: Machine learning-based, retrospective analysis of the MIMIC-III database. BMJ Open, 11(7), e044779. https://doi.org/10.1136/bmjopen-2020-044779
Libório, A. B., Noritomi, D. T., Leite, T. T., de Melo Bezerra, C. T., de Faria, E. R., and Kellum, J. A. (2015). Increased serum bicarbonate in critically ill patients: A retrospective analysis. Intensive Care Medicine, 41(3), 479–486. https://doi.org/10.1007/s00134-015-3649-9
Ling, C. X., and Sheng, V. S. (2008). Cost-sensitive learning and the class imbalance problem. In C. Sammut (Ed.), Encyclopedia of Machine Learning. Springer. https://www.csd.uwo.ca/~xling/papers/cost_sensitive.pdf
Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., and Lee, S.-I. (2020). From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2(1), 56–67. https://doi.org/10.1038/s42256-019-0138-9
Mandelbaum, T., Scott, D. J., Lee, J., Mark, R. G., Malhotra, A., Waikar, S. S., Howell, M. D., and Talmor, D. (2011). Outcome of critically ill patients with acute kidney injury using the Acute Kidney Injury Network criteria. Critical Care Medicine, 39(12), 2659–2664. https://doi.org/10.1097/CCM.0b013e31822823e7
Melchers, M., and van Zanten, A. R. H. (2023). Management of hypocalcaemia in the critically ill. Current Opinion in Critical Care, 29(4), 330–338. https://doi.org/10.1097/MCC.0000000000001059
Meyer, B., Huelsmann, M., Wexberg, P., Delle Karth, G., Berger, R., Moertl, D., and Pacher, R. (2007). N-terminal pro-B-type natriuretic peptide is an independent predictor of outcome in an unselected cohort of critically ill patients. Critical Care Medicine, 35(10), 2268–2273. https://doi.org/10.1097/01.CCM.0000284507.59405.53
Núñez Reiz, A., Armengol de la Hoz, M. A., and Sánchez García, M. (2018). Big data analysis and machine learning in intensive care units. Medicina Intensiva (English Edition), 43(7), 416–426. https://doi.org/10.1016/j.medine.2019.06.012
Önder, H., Tirink, C., Yakubets, T., Getya, A., Matvieiev, M., Kononenko, R., ... and Kaya, F. (2025). Predicting live weight for female rabbits of meat crosses from body measurements using LightGBM, XGBoost and support vector machine algorithms. Veterinary Medicine and Science, 11(1), e70149. https://doi.org/10.1002/vms3.70149
Powers, D. M. W. (2020). Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv. https://arxiv.org/abs/2010.16061
Quinlan, J. R. (1986). Induction of decision trees. Machine Learning, 1(1), 81–106. https://doi.org/10.1007/BF00116251
Ribeiro, M. T., Singh, S., and Guestrin, C. (2016). “Why should I trust you?”: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1135–1144). Association for Computing Machinery. https://doi.org/10.1145/2939672.2939778
Saito, T., and Rehmsmeier, M. (2015). The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE, 10(3), e0118432. https://doi.org/10.1371/journal.pone.0118432
Serim, A. B. Ö. (2023). Characterization of mortality prediction: An ensemble learning analysis using the MIMIC-III dataset. Journal of Scientific Reports-A, (054), 364–384.
Tonekaboni, S., Joshi, S., Goldenberg, A., and Duvenaud, D. (2019). What clinicians want: Contextualizing explainable machine learning for clinical end use. In Proceedings of Machine Learning for Healthcare (Vol. 106, pp. 359–374). http://proceedings.mlr.press/v106/tonekaboni19a.html
Yavuz, E. (2023). Determining the factors affecting the achievement status of high school students by logistic regression. ResearchGate, 400, 155–171.
Zhang, J., Zhao, Q., Liu, S., Yuan, N., and Hu, Z. (2024). Clinical predictive value of the CRP-albumin-lymphocyte index for prognosis of critically ill patients with sepsis in intensive care unit: A retrospective single-center observational study. Frontiers in Public Health, 12, 1395134. https://doi.org/10.3389/fpubh.2024.1395134
Zhou, J., Li, F., Song, Y., et al. (2023). Prediction model of in-hospital mortality in intensive care unit patients with heart failure: Machine learning-based, retrospective analysis of the MIMIC-III database [Dataset]. Dryad. https://doi.org/10.5061/dryad.0p2ngf1zd

Details

Primary Language

English

Subjects

Software Engineering (Other)

Journal Section

Research Article

Authors

Fahrettin Kaya ^*
0000-0003-1666-4859
Türkiye

Publication Date

February 24, 2026

Submission Date

April 17, 2025

Acceptance Date

September 29, 2025

Published in Issue

Year 2026 Volume: 16 Number: 1

DOI

https://doi.org/10.31466/kfbd.1678766

IZ

https://izlik.org/JA35CH56RW

APA

Kaya, F. (2026). Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach. Karadeniz Fen Bilimleri Dergisi, 16(1), 140-155. https://doi.org/10.31466/kfbd.1678766

AMA

1.Kaya F. Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach. KFBD. 2026;16(1):140-155. doi:10.31466/kfbd.1678766

Chicago

Kaya, Fahrettin. 2026. “Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach”. Karadeniz Fen Bilimleri Dergisi 16 (1): 140-55. https://doi.org/10.31466/kfbd.1678766.

EndNote

Kaya F (February 1, 2026) Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach. Karadeniz Fen Bilimleri Dergisi 16 1 140–155.

IEEE

[1]F. Kaya, “Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach”, KFBD, vol. 16, no. 1, pp. 140–155, Feb. 2026, doi: 10.31466/kfbd.1678766.

ISNAD

Kaya, Fahrettin. “Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach”. Karadeniz Fen Bilimleri Dergisi 16/1 (February 1, 2026): 140-155. https://doi.org/10.31466/kfbd.1678766.

JAMA

1.Kaya F. Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach. KFBD. 2026;16:140–155.

MLA

Kaya, Fahrettin. “Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach”. Karadeniz Fen Bilimleri Dergisi, vol. 16, no. 1, Feb. 2026, pp. 140-55, doi:10.31466/kfbd.1678766.

Vancouver

1.Fahrettin Kaya. Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach. KFBD. 2026 Feb. 1;16(1):140-55. doi:10.31466/kfbd.1678766

The published articles in The Black Sea Journal of Sciences are licensed under Creative Commons Attribution-NonCommercial 4.0 International

Meta-Öğrenme ve Açıklanabilir Yapay Zekâ ile Yoğun Bakım Ölüm Tahmininin İyileştirilmesi: MetaCost ve LIME Yaklaşımı

Abstract

Keywords

Improving ICU Mortality Prediction via Meta-Learning and Explainable AI: A MetaCost and LIME Approach

Abstract

Keywords

References

Details

Primary Language

Subjects

Journal Section

Authors

Publication Date

Submission Date

Acceptance Date

Published in Issue

DOI

IZ

Cite