Araştırma Makalesi

A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification

Sayı: 34 31 Mart 2022
PDF İndir
EN TR

A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification

Öz

Data imbalance refers to the unequal distribution of classes within a dataset that directly affects the accuracy of machine learning classification algorithms. Although many resampling techniques have been proposed by researchers, learning from imbalanced data is still considered one of the contemporary challenges. The class imbalanced problem has been complicated as most of the existing techniques don't manage the similarity relationships between minority and majority classes well. In addition, due to the complex relationships among classes, most of the existing techniques do not focus on retaining valuable samples in the majority class(es) properly. In this article, a salp swarm optimization-based under-sampling technique (SSBUT) is proposed to address data class imbalance problems. Utilizing the proposed SSBUT, the similarity relationship among the samples of the majority class is well analyzed, and the samples that do not affect the accuracy of the classification algorithm are eliminated from the majority class. The performance of the proposed SSBUT has been tested on benchmark medical imbalanced datasets and the obtained results have been compared with state-of-the-art under-sampling techniques. The experimental results show that the proposed SSBUT consistently outperformed the state-of-the-art under-sampling techniques in terms of various evaluation criteria.

Anahtar Kelimeler

Kaynakça

  1. Han J, Pei J, Kamber M. (2011). Data mining: concepts and techniques. Elsevier.
  2. Sen PC, Hajra M, Ghosh M. (2020). Supervised classification algorithms in machine learning: A survey and review. In: Emerging technology in modelling and graphics. Springer, pp 99-111.
  3. Özkaya, U., Öztürk, Ş., Barstugan, M. (2020). Coronavirus (COVID-19) classification using deep features fusion and ranking technique. In Big Data Analytics and Artificial Intelligence Against COVID-19: Innovation Vision and Approach (pp. 281-295). Springer, Cham.
  4. Kwon O, Sim JM. (2013). Effects of data set features on the performances of classification algorithms. Expert Systems with Applications 40 (5):1847-1857.
  5. Atomi WH. (2012). The effect of data preprocessing on the performance of artificial neural networks techniques for classification problems. Universiti Tun Hussein Onn Malaysia.
  6. Rout N, Mishra D, Mallick MK. (2018). Handling imbalanced data: a survey. In: International proceedings on advances in soft computing, intelligent systems and applications. Springer, pp 431-443.
  7. Leevy JL, Khoshgoftaar TM, Bauder RA, Seliya N. (2018). A survey on addressing high-class imbalance in big data. Journal of Big Data 5 (1):1-30.
  8. Singh A, Purohit A. (2015). A survey on methods for solving data imbalance problem for classification. International Journal of Computer Applications 127 (15):37-41.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

31 Mart 2022

Gönderilme Tarihi

3 Mart 2022

Kabul Tarihi

5 Mart 2022

Yayımlandığı Sayı

Yıl 2022 Sayı: 34

Kaynak Göster

APA
Ibrahım, M. H. (2022). A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification. Avrupa Bilim ve Teknoloji Dergisi, 34, 396-402. https://doi.org/10.31590/ejosat.1082451
AMA
1.Ibrahım MH. A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification. EJOSAT. 2022;(34):396-402. doi:10.31590/ejosat.1082451
Chicago
Ibrahım, Mohammed Hussein. 2022. “A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification”. Avrupa Bilim ve Teknoloji Dergisi, sy 34: 396-402. https://doi.org/10.31590/ejosat.1082451.
EndNote
Ibrahım MH (01 Mart 2022) A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification. Avrupa Bilim ve Teknoloji Dergisi 34 396–402.
IEEE
[1]M. H. Ibrahım, “A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification”, EJOSAT, sy 34, ss. 396–402, Mar. 2022, doi: 10.31590/ejosat.1082451.
ISNAD
Ibrahım, Mohammed Hussein. “A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification”. Avrupa Bilim ve Teknoloji Dergisi. 34 (01 Mart 2022): 396-402. https://doi.org/10.31590/ejosat.1082451.
JAMA
1.Ibrahım MH. A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification. EJOSAT. 2022;:396–402.
MLA
Ibrahım, Mohammed Hussein. “A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification”. Avrupa Bilim ve Teknoloji Dergisi, sy 34, Mart 2022, ss. 396-02, doi:10.31590/ejosat.1082451.
Vancouver
1.Mohammed Hussein Ibrahım. A Salp Swarm-Based Under-Sampling Approach for Medical Imbalanced Data Classification. EJOSAT. 01 Mart 2022;(34):396-402. doi:10.31590/ejosat.1082451