Feature selection is a pivotal process in machine learning, essential for enhancing model performance by reducing dimensionality, improving generalization, and mitigating overfitting. By eliminating irrelevant or redundant features, simpler and more interpretable models are achieved, which generally perform better. In this study, we introduce an advanced hybrid method combining ensemble feature selection and regularization techniques, designed to optimize model accuracy while significantly reducing the number of features required. Applied to a customer satisfaction dataset, our method was first tested without feature selection, where the model achieved a ROC AUC value of 0.946 on the test set using all 369 features. However, after applying our proposed feature selection method, the model achieved a higher ROC AUC value of 0.954, utilizing only 12 key features and completing the task in approximately 43% less time. These findings demonstrate the effectiveness of our approach in producing a more efficient and superior-performing model.
Feature selection Basic filter method Regularization Logistic regularization Tree based feature selection
Feature selection is a pivotal process in machine learning, essential for enhancing model performance by reducing dimensionality, improving generalization, and mitigating overfitting. By eliminating irrelevant or redundant features, simpler and more interpretable models are achieved, which generally perform better. In this study, we introduce an advanced hybrid method combining ensemble feature selection and regularization techniques, designed to optimize model accuracy while significantly reducing the number of features required. Applied to a customer satisfaction dataset, our method was first tested without feature selection, where the model achieved a ROC AUC value of 0.946 on the test set using all 369 features. However, after applying our proposed feature selection method, the model achieved a higher ROC AUC value of 0.954, utilizing only 12 key features and completing the task in approximately 43% less time. These findings demonstrate the effectiveness of our approach in producing a more efficient and superior-performing model.
Feature selection Basic filter method Regularization Logistic regularization Tree based feature selection
Birincil Dil | İngilizce |
---|---|
Konular | Karar Desteği ve Grup Destek Sistemleri, Bilgi Sistemleri (Diğer), Uygulamalı Matematik (Diğer) |
Bölüm | Research Articles |
Yazarlar | |
Yayımlanma Tarihi | 15 Kasım 2024 |
Gönderilme Tarihi | 1 Eylül 2024 |
Kabul Tarihi | 16 Ekim 2024 |
Yayımlandığı Sayı | Yıl 2024 |