Comparison of Machine Learning Regression Methods to Predict Health Expenditures
Öz
As a result of experimental studies on different datasets, it is recommended to use machine learning regression methods as an alternative to classical regression methods in the existence of variables which are difficult to model. Health expenditure is an indicator which is difficult to model and there is no study in the literature about modelling health expenditure comparing machine learning regression methods. In this study a multiple regression model was conducted to predict health expenditure per capita. Performance results of Lasso Regression, Random Forest Regression and Support Vector Machine Regression compared when different hyperparameter values were determined. Lambda (λ) value for Lasso Regression, number of trees for Random Forest Regression, epsilon () value for Support Vector Regression was determined as hyperparameter values. Study results performed by using “k” fold cross validation changed from 5 to 50, indicate the difference between machine learning results in terms of R2, RMSE and MAE values that are statistically significant (p<0.001). Surface and bar plots and statistical test results of prediction performances show that Random Forest Regression (R2 ˃ 0.7500, RMSE ≤ 0.6000 ve MAE ≤ 0.4000) has better prediction performance according to different hyperparameter values. It is hoped that study results make contribution to studies about determining optimal hyperparameter values for machine learning regression methods for studies about modelling health expenditures.
Anahtar Kelimeler
Kaynakça
- Alpar R. (2011) Uygulamalı çok değişkenli istatistiksel yöntemler, Detay Yayıncılık, Ankara, 415-620.
- Basu, A., Manning, W.G. ve Mullahy, J. (2004). Comparing alternative model: log and cox proportional hazard? Health Economics, 13(8), 749-765. doi: 10.1002/hec.852.
- Belloni, A., Chernozhukov, V., Hansen, C. (2012) Inference for high-dimensional sparse econometric models. https://arxiv.org/abs/1201.0220. doi: 10.1017/CBO9781139060035.008. Erişim Tarihi: 01.01.2016.
- Bergstra, J. ve Bengio, Y. (2012) Random search for hyper-parameter optimization. Journal of Machine Learning Research, 13, 281-305. http://www.jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf. Erişim Tarihi: 01.02.2016.
- Box, G.E.P. ve Cox, D.R. (1964) An analysis of transformations, Journal of the Royal Statistical Society, 26(2), 211-252. doi: 10.1.1.321.3819.
- Brieman, L. (2001) Random forests, Machine Learning, 45, 5-32. doi: 10.1023%2FA%3A1010933404324.
- Cherkassky, V. ve Ma, Y. (2004) Practical selection of SVM parameters and noise estimation for SVM regression, Neural Networks, 17(1), 113-126. doi:10.1016/S0893-6080(03)00169-2.
- Cosgun E., Karaağaoğlu E. (2011). Veri madenciliği yöntemleriyle mikrodizilim gen ifade analizi, Hacettepe Tıp Dergisi, 42, 180-189. http://docplayer.biz.tr/3432783-Veri-madencili-i-yontemleriyle-mikrodizilim-gen-ifade-analizi.html. Erişim Tarihi: 01.02.2016.
Ayrıntılar
Birincil Dil
Türkçe
Konular
Mühendislik
Bölüm
Araştırma Makalesi
Yazarlar
Yayımlanma Tarihi
19 Eylül 2017
Gönderilme Tarihi
7 Mart 2016
Kabul Tarihi
19 Ağustos 2017
Yayımlandığı Sayı
Yıl 2017 Cilt: 22 Sayı: 2
Cited By
Makine Öğrenmesi Algoritmaları ile Hava Kirliliği Tahmini Üzerine Karşılaştırmalı Bir Değerlendirme
European Journal of Science and Technology
https://doi.org/10.31590/ejosat.530347Türkiye Kısa Dönem Elektrik Yük Talep Tahmininde Makine Öğrenmesi Yöntemlerinin Karşılaştırılması
Bilecik Şeyh Edebali Üniversitesi Fen Bilimleri Dergisi
https://doi.org/10.35193/bseufbd.1004827A new approach to the correlation of SPT-CPT depending on the soil behavior type index
Engineering Geology
https://doi.org/10.1016/j.enggeo.2023.106996ResNet34 Derin Öğrenme Mimarisi Kullanılarak Yüz Görüntülerinden Vücut Ağırlığı Tahmini Uygulaması
International Journal of Engineering and Innovative Research
https://doi.org/10.47933/ijeir.776106Yapay Sinir Ağları ve Çoklu Doğrusal Regresyon Kullanarak Emeklilik Fonu Net Varlık Değerlerinin Tahmin Edilmesi
Bilişim Teknolojileri Dergisi
https://doi.org/10.17671/gazibtd.742995Makine öğrenmesi algoritmaları ile deprem katalogları kullanılarak deprem tahmini
Gümüşhane Üniversitesi Fen Bilimleri Enstitüsü Dergisi
https://doi.org/10.17714/gumusfenbil.1268504Data Mining, Weka Decision Trees
Orclever Proceedings of Research and Development
https://doi.org/10.56038/oprd.v3i1.376A lasso regression-based forecasting model for daily gasoline consumption: Türkiye Case
Turkish Journal of Engineering
https://doi.org/10.31127/tuje.1354501Predicting deep well pump performance with machine learning methods during hydraulic head changes
Heliyon
https://doi.org/10.1016/j.heliyon.2024.e31505İşitme Kaybı Tahmininde Makine Öğrenmesi Yöntemlerinin Uygulanması ve Karşılaştırılması
Teknik Bilimler Dergisi
https://doi.org/10.35354/tbed.1580891Evaluation of predictive maintenance efficiency with the comparison of machine learning models in machining production process in brake industry
PeerJ Computer Science
https://doi.org/10.7717/peerj-cs.2999Predicting Global Health Expenditures Using Machine Learning and Regularized Regression Methods
Pamukkale Üniversitesi İşletme Araştırmaları Dergisi
https://doi.org/10.47097/piar.1792425