Research Article

A novel undersampling method based on data classification method

Volume: 26 Number: 2 July 15, 2024
EN TR

A novel undersampling method based on data classification method

Abstract

Data mining is one of the most important research area in literature. Due to the increasing volume of data, which is directly proportional to technological advancements, the number of researches in this field is growing rapidly. The goal of data mining is to extract various insights and obtain information from raw data by leveraging machine learning techniques. The structural characteristics and also class distributions of the datasets used in machine learning techniques significantly affect the performances of the algorithms. In this study, our aim is balancing the imbalanced binary dataset, used in the machine learning techniques, with an undersampling approach including a classification method via polyhedral conic functions.

Keywords

References

  1. Ayoub, S., Gulzar, Y., Rustamov, J., Jabbari, A., Reegu, F.A. and Turaev, S., Adversarial Approaches to Tackle Imbalanced Data in Machine Learning, Sustainability, 15, 7097, (2023).
  2. Raghuwanshi, B.S. and Shukla, S., Class-Specific Extreme Learning Machine for Handling Binary Class Imbalance Problem, Neural Networks, 105, 206–217, (2018).
  3. Mohammed R., Rawashdeh J. and Abdullah M., Machine learning with oversampling and undersampling techniques: Overview study and experimental results, 2020 11th International Conference on Information and Communication Systems (ICICS), 243-248, Irbid, Jordan, (2020).
  4. Hoyos-Osorio J., Alvarez-Meza A., G. Daza-Santacoloma, Orozco-Gutierrez A. and Castellanos-Dominguez G., Relevant information undersampling to support imbalanced data classification, Neurocomputing, 436, 136-146, (2021).
  5. Sun Z., Song Q., Zhu X., Sun H., Xu B. and Zhou Y., A novel ensemble method for classifying imbalanced data, Pattern Recognition, 48, 5, 1623-1637, (2015).
  6. Barandela R., Valdovinos R.M. and Sánchez J.S., New applications of ensembles of classifiers, Pattern Analysis & Applications, 6, 3, 245-256, (2003).
  7. Seiffert C., Khoshgoftaar T.M., Van Hulse J. and Napolitano A., Rusboost: a hybrid approach to alleviating class imbalance, IEEE Transactions on Systems, Man Cybernetics-Part A: Systems and Humans, 40, 1, 185-197, (2010).
  8. Mostafaei, S.H. and Tanha, J., OUBoost: boosting based over and under sampling technique for handling imbalanced data. International Journal of Machine Learning and Cybernetics, 14, 3393–3411 (2023).

Details

Primary Language

English

Subjects

Numerical Computation and Mathematical Software, Large and Complex Data Theory, Mathematical Optimisation

Journal Section

Research Article

Early Pub Date

July 14, 2024

Publication Date

July 15, 2024

Submission Date

March 5, 2024

Acceptance Date

June 6, 2024

Published in Issue

Year 2024 Volume: 26 Number: 2

APA
Uylaş Satı, N. (2024). A novel undersampling method based on data classification method. Balıkesir Üniversitesi Fen Bilimleri Enstitüsü Dergisi, 26(2), 518-526. https://doi.org/10.25092/baunfbed.1447440
AMA
1.Uylaş Satı N. A novel undersampling method based on data classification method. Balıkesir Üniversitesi Fen Bilimleri Enstitüsü Dergisi. 2024;26(2):518-526. doi:10.25092/baunfbed.1447440
Chicago
Uylaş Satı, Nur. 2024. “A Novel Undersampling Method Based on Data Classification Method”. Balıkesir Üniversitesi Fen Bilimleri Enstitüsü Dergisi 26 (2): 518-26. https://doi.org/10.25092/baunfbed.1447440.
EndNote
Uylaş Satı N (July 1, 2024) A novel undersampling method based on data classification method. Balıkesir Üniversitesi Fen Bilimleri Enstitüsü Dergisi 26 2 518–526.
IEEE
[1]N. Uylaş Satı, “A novel undersampling method based on data classification method”, Balıkesir Üniversitesi Fen Bilimleri Enstitüsü Dergisi, vol. 26, no. 2, pp. 518–526, July 2024, doi: 10.25092/baunfbed.1447440.
ISNAD
Uylaş Satı, Nur. “A Novel Undersampling Method Based on Data Classification Method”. Balıkesir Üniversitesi Fen Bilimleri Enstitüsü Dergisi 26/2 (July 1, 2024): 518-526. https://doi.org/10.25092/baunfbed.1447440.
JAMA
1.Uylaş Satı N. A novel undersampling method based on data classification method. Balıkesir Üniversitesi Fen Bilimleri Enstitüsü Dergisi. 2024;26:518–526.
MLA
Uylaş Satı, Nur. “A Novel Undersampling Method Based on Data Classification Method”. Balıkesir Üniversitesi Fen Bilimleri Enstitüsü Dergisi, vol. 26, no. 2, July 2024, pp. 518-26, doi:10.25092/baunfbed.1447440.
Vancouver
1.Nur Uylaş Satı. A novel undersampling method based on data classification method. Balıkesir Üniversitesi Fen Bilimleri Enstitüsü Dergisi. 2024 Jul. 1;26(2):518-26. doi:10.25092/baunfbed.1447440