Intrusion Detection Model Based on TF.IDF and C4.5 Algorithms
Öz
In recent years, the use of machine learning and data mining technologies has drawn researchers’ attention to new ways to improve the performance of Intrusion Detection Systems (IDS). These techniques have proven to be an effective method in distinguishing malicious network packets. One of the most challenging problems that researchers are faced with is the transformation of data into a form that can be handled effectively by Machine Learning Algorithms (MLA). In this paper, we present an IDS model based on the decision tree C4.5 algorithm with transforming simulated UNSW-NB15 dataset as a pre-processing operation. Our model uses Term Frequency.Inverse Document Frequency (TF.IDF) to convert data types to an acceptable and efficient form for machine learning to achieve high detection performance. The model has been tested with randomly selected 250000 records of the UNSW-NB15 dataset. Selected records have been grouped into various segment sizes, like 50, 500, 1000, and 5000 items. Each segment has been, further, grouped into two subsets of multi and binary class datasets. The performance of the Decision Tree C4.5 algorithm with Multilayer Perceptron (MLP) and Naive Bayes (NB) has been compared in Weka software. Our proposed method significantly has improved the accuracy of classifiers and decreased incorrectly detected instances. The increase in accuracy reflects the efficiency of transforming the dataset with TF.IDF of various segment sizes.
Anahtar Kelimeler
Kaynakça
- Yu Z. Intrusion Detection: A Machine Learning Approach (Volume 3). London, UK: Imperial College Press, 2011.
- Armin J, Thompson B, Ariu D, Giacinto G, Roli F, Kijewski P. 2020 cybercrime economic costs: No measure no solution. In: 10th International Conference on Availability, Reliability and Security (ARES); 24-27 Aug. 2015; Toulouse, France. New York, NY, USA: IEEE. pp.701-710.
- Bhattacharyya DK, Kalita JK. Network anomaly detection: A machine learning perspective. New York, NY, USA: CRC Press, 2013.
- Katkar VD, Bhatia DS. Lightweight approach for detection of denial of service attacks using numeric to binary preprocessing. In: IEEE International Conference on Circuits, Systems, Communication and Information Technology Applications (CSCITA); 4-5 April 2014; Mumbai, India. New York, NY, USA: IEEE. pp. 207-212.
- Mehmood T, Rais H. Machine learning algorithms in context of intrusion detection. In: IEEE 2016 Computer and Information Sciences (ICCOINS), International Conference; 15-17 Aug. 2016; Kuala Lumpur, Malaysia. New York, NY, USA: IEEE. pp. 369-373.
- Mane D, Pawar S. Anomaly based IDS using Backpropagation Neural Network. International Journal of Computer Applications 2016; 136: 29-34.
- Deshmukh DH, Ghorpade T, Padiya P. Intrusion detection system by improved preprocessing methods and Naive Bayes classifier using NSL-KDD 99 Dataset. In: IEEE 2014 International Conference on Electronics and Communication Systems (ICECS); 13-14 Feb. 2014; Coimbatore, India. New York, NY, USA: IEEE. pp. 1-7.
- Mogal DG, Ghungrad SR, Bhusare BB. NIDS using Machine Learning Classifiers on UNSW-NB15 and KDDCUP99 Datasets. International Journal of Advanced Research in Computer and Communication Engineering 2017; 6: 533-537.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Mühendislik
Bölüm
Araştırma Makalesi
Yazarlar
Khaldoon Awadh
Bu kişi benim
0000-0001-6697-931X
Türkiye
Ayhan Akbaş
*
0000-0002-6425-104X
Türkiye
Yayımlanma Tarihi
1 Aralık 2021
Gönderilme Tarihi
24 Şubat 2020
Kabul Tarihi
4 Temmuz 2020
Yayımlandığı Sayı
Yıl 2021 Cilt: 24 Sayı: 4
Cited By
Tekdüzen Kaynak Bulucu Yoluyla Kimlik Avı Tespiti için Makine Öğrenmesi Algoritmalarının Özellik Tabanlı Performans Karşılaştırması
Journal of Polytechnic
https://doi.org/10.2339/politeknik.1035286Stacked-Based Ensemble Machine Learning Model for Positioning Footballer
Arabian Journal for Science and Engineering
https://doi.org/10.1007/s13369-022-06857-8Ağ Adli Bilişimi Süreç Gereksinimlerinin Belirlenmesi ve Yazılım Tanımlı Ağlarda İncelenmesi
Journal of Polytechnic
https://doi.org/10.2339/politeknik.1141107Machine Learning based Network Intrusion Detection with Hybrid Frequent Item Set Mining
Journal of Polytechnic
https://doi.org/10.2339/politeknik.1386467Detection of anomalous Nitrogen Dioxide concentration of Ankara: a Reconstruction-based approach
Journal of Polytechnic
https://doi.org/10.2339/politeknik.1419512Fine-Tuned Understanding: Enhancing Social Bot Detection With Transformer-Based Classification
IEEE Access
https://doi.org/10.1109/ACCESS.2024.3440657Student adaptivity classification in online education through stacked ensemble learning
Multimedia Tools and Applications
https://doi.org/10.1007/s11042-024-20421-6