The Effect of Regularized Regression and Tree-Based Missing Data Imputation Methods on Classification Performance in High Dimensional Data
Abstract
Keywords
Ethical Statement
References
- Acuna E, Rodriguez C. 2004. The treatment of missing values and its effect on classifier accuracy. In: Classification, Clustering, and Data Mining Applications: Proceedings of the Meeting of the International Federation of Classification Societies (IFCS), Illinois Institute of Technology, July 15–18, Chicago, USA, pp: 639-647.
- Breiman L. 1995. Better subset regression using the nonnegative garrote. Technometrics, 37(4): 373-384.
- Breiman L. 2017. Classification and regression trees. Routledge, New York, USA, 1st ed., pp: 368.
- Chang LY, Chen WC. 2005. Data mining of tree-based models to analyze freeway accident frequency. J Saf Res, 36(4): 365-375.
- Choudhury SJ, Pal NR. 2019. Imputation of missing data with neural networks for classification. Knowledge-Based Syst, 182: 104838.
- Clark LA, Pregibon D. 2017. Tree-based models. In: Hastie T, Chambers J, editors. Statistical models in S, Routledge, Oxfordshire, UK, pp: 377-419.
- Cortes C, Vapnik V. 1995. Support-vector networks. Mach Learn, 20: 273-297.
- Deng Y, Chang C, Ido MS, Long Q. 2016. Multiple imputation for general missing data patterns in the presence of high-dimensional. Data Sci Rep, 621689, 6(1): 21689.
Details
Primary Language
English
Subjects
Biostatistics, Statistical Analysis, Applied Statistics
Journal Section
Research Article
Authors
Buğra Varol
*
0000-0001-8052-7782
Türkiye
İmran Kurt Omurlu
0000-0003-2887-6656
Türkiye
Mevlüt Türe
0000-0003-3187-2322
Türkiye
Publication Date
November 15, 2024
Submission Date
August 12, 2024
Acceptance Date
October 21, 2024
Published in Issue
Year 2024 Volume: 7 Number: 6