Combination of PCA with SMOTE Oversampling for Classification of High-Dimensional Imbalanced Data
Abstract
Keywords
References
- Baran M. 2020. Maki̇ne Öğrenmesi̇ Yöntemleri̇yle Çoklu Eti̇ketli̇ Veri̇leri̇n Sınıflandırılması. Yüksek Lisans Tezi, Sivas Cumhuriyet Üniversitesi, Sosya Bilimler Enstitüsü, Sivas.
- Lorena A.C., Garcia L.P.F., Lehmann J., Souto M.C.P., Ho T.K. 2019. How Complex is Your Classification Problem?: A Survey on Measuring Classification Complexity. ACM Computing Surveys, 52 (5): 1–34.
- Tahir M.A.U.H., Asghar S., Manzoor A., Noor M.A. 2019. A Classification Model for Class Imbalance Dataset Using Genetic Programming. IEEE Access, 7: 71013-71037.
- Mustafa N., Li J.P., Memon E.R.A., Omer M.Z. 2017. A Classification Model for Imbalanced Medical Data based on PCA and Farther Distance based Synthetic Minority Oversampling Technique. International Journal of Advanced Computer Science and Applications, 8 (1): 61-67.
- Kambhatla N., Leen, T.K. 1997. Dimension Reduction by Local Principal Component Analysis. Neural Computation, 9 (7): 1493-1516.
- Hall M., Frank E., Holmes G., Pfahringer B., Reutemann P., Witten I.H. 2009. The WEKA Data Mining Software: An Uptade. SIGKDD Explorations, 11 (1): 10-18.
- Sun Y., Wong A.K.C., Kamel M.S. 2009. Classification of Imbalanced Data: A Review. International Journal of Pattern Recognition and Artificial Intelligence, 23 (4): 687-719.
- Basgall M.J., Hasperué W., Naiouf M., Fernández A. 2018. SMOTE-BD: An Exact and Scalable Oversampling Method for Imbalanced Classification in Big Data. Journal of Computer Science & Technology, 18 (3): 203-209.
Details
Primary Language
English
Subjects
-
Journal Section
Research Article
Authors
Guhdar A. A. Mulla
This is me
0000-0001-6742-0083
Türkiye
Yıldırım Demir
*
0000-0002-6350-8122
Türkiye
Publication Date
September 17, 2021
Submission Date
May 20, 2021
Acceptance Date
July 28, 2021
Published in Issue
Year 2021 Volume: 10 Number: 3
Cited By
Artificial Intelligence-based Colon Cancer Prediction by Identifying Genomic Biomarkers
Medical Records
https://doi.org/10.37990/medr.1077024Predictive Modeling of Student Dropout in MOOCs and Self-Regulated Learning
Computers
https://doi.org/10.3390/computers12100194Prediction of Lake Van Water Level using Artificial Neural Network Model with Meteorological Parameters and Multiple Linear Regression Analysis: A Comparative Study
Bitlis Eren Üniversitesi Fen Bilimleri Dergisi
https://doi.org/10.17798/bitlisfen.1316881Enhancing Lung Cancer Classification and Prediction With Deep Learning and Multi-Omics Data
IEEE Access
https://doi.org/10.1109/ACCESS.2024.3394030TÜRKÇE KONUŞMADA DUYGU TANIMA İÇİN MAKİNE ÖĞRENME YÖNTEMLERİ VE DERİN ÖĞRENME TABANLI MODELLERİN KARŞILAŞTIRILMASI
Mühendislik Bilimleri ve Tasarım Dergisi
https://doi.org/10.21923/jesd.1350375Çoklu Doğrusal Bağlantı Olması Durumunda Veri Madenciliği Algoritmaları Performanslarının Karşılaştırılması
Nicel Bilimler Dergisi
https://doi.org/10.51541/nicel.1371834Identification of key predictors of acute GVHD in pediatric acute Leukemia using machine learning methods
Transplant Immunology
https://doi.org/10.1016/j.trim.2025.102318Using Machine Learning Detection Malware in IoHT System
Vietnam Journal of Computer Science
https://doi.org/10.1142/S2196888826500028