Araştırma Makalesi

Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features

Cilt: 13 Sayı: 2 30 Haziran 2025
PDF İndir
EN

Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features

Öz

Today, with the increasing use of the internet, individuals who use email have become potential targets for fraudsters. These malicious groups send fake or misleading emails to steal sensitive information such as identity, bank, and social media credentials. This tactic is known as phishing. This study proposes a machine learning-based system for detecting phishing attacks using the SeFACED dataset, which was adjusted for binary classification with 12,498 normal and 5,142 fraudulent email data points. Python was used for programming, with Google Colab and Jupyter Notebook as development platforms. Email data underwent data collection, cleaning, and word stem separation processes. Three feature extraction techniques were used: Bag of Words, TF-IDF, and Word2Vec. Six algorithms, including Logistic Regression, Random Forest, Support Vector Machines, Naive Bayes, Convolutional Neural Network, and Long Short-Term Memory, were employed for classification. Performance was evaluated using metrics like accuracy, preci-sion, recall, and F1-score. New attributes proposed to enhance detection included CSS tags, HTML tags, black-list words, link errors, and grammar and spelling errors. The addition of these features generally improved classification results.

Anahtar Kelimeler

Kaynakça

  1. [1] R. Alabdan, “Phishing attacks survey: Types, vectors, and technical approaches,” Future internet, vol. 12, no. 10, p. 168, 2020.
  2. [2] K. L. Chiew, K. S. C. Yong, and C. L. Tan, “A survey of phishing attacks: Their types, vectors and technical approaches,” Expert Systems with Applications, vol. 106, pp. 1–20, 2018.
  3. [3] U. A. Butt, R. Amin, H. Aldabbas, S. Mohan, B. Alouffi, and A. Ah-madian, “Cloud-based email phishing attack using machine and deep learning algorithm,” Complex & Intelligent Systems, vol. 9, no. 3, pp. 3043–3070, 2023.
  4. [4] M. Jakobsson, “Two-factor inauthentication–the rise in sms phishing attacks,” Computer Fraud & Security, vol. 2018, no. 6, pp. 6–8, 2018.
  5. [5] A. Basit, M. Zafar, X. Liu, A. R. Javed, Z. Jalil, and K. Kifayat, “A com-prehensive survey of ai-enabled phishing attacks detection techniques,” Telecommunication Systems, vol. 76, pp. 139–154, 2021.
  6. [6] APWG, “Apwg phishing activity trends report,” 2025. [Online]. Available: https://apwg.org/trendsreports
  7. [7] S. Gupta, A. Singhal, and A. Kapoor, “A literature survey on social engineering attacks: Phishing attack,” in 2016 International Conference on Computing, Communication and Automation (ICCCA), 2016, pp. 537–540.
  8. [8] J. Rastenis, S. Ramanauskait˙e, J. Januleviˇcius, A. ˇCenys, A. Slotkien˙e, and K. Pakrijauskas, “E-mail-based phishing attack taxonomy,” Applied sciences, vol. 10, no. 7, p. 2363, 2020.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Bilgisayar Yazılımı

Bölüm

Araştırma Makalesi

Erken Görünüm Tarihi

11 Temmuz 2025

Yayımlanma Tarihi

30 Haziran 2025

Gönderilme Tarihi

27 Mayıs 2024

Kabul Tarihi

10 Ocak 2025

Yayımlandığı Sayı

Yıl 2025 Cilt: 13 Sayı: 2

Kaynak Göster

APA
Brioua, H., Siyambaş, H., & Şahin, D. Ö. (2025). Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features. Balkan Journal of Electrical and Computer Engineering, 13(2), 183-193. https://doi.org/10.17694/bajece.1490596
AMA
1.Brioua H, Siyambaş H, Şahin DÖ. Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features. Balkan Journal of Electrical and Computer Engineering. 2025;13(2):183-193. doi:10.17694/bajece.1490596
Chicago
Brioua, Hadjer, Havvanur Siyambaş, ve Durmuş Özkan Şahin. 2025. “Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features”. Balkan Journal of Electrical and Computer Engineering 13 (2): 183-93. https://doi.org/10.17694/bajece.1490596.
EndNote
Brioua H, Siyambaş H, Şahin DÖ (01 Haziran 2025) Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features. Balkan Journal of Electrical and Computer Engineering 13 2 183–193.
IEEE
[1]H. Brioua, H. Siyambaş, ve D. Ö. Şahin, “Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features”, Balkan Journal of Electrical and Computer Engineering, c. 13, sy 2, ss. 183–193, Haz. 2025, doi: 10.17694/bajece.1490596.
ISNAD
Brioua, Hadjer - Siyambaş, Havvanur - Şahin, Durmuş Özkan. “Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features”. Balkan Journal of Electrical and Computer Engineering 13/2 (01 Haziran 2025): 183-193. https://doi.org/10.17694/bajece.1490596.
JAMA
1.Brioua H, Siyambaş H, Şahin DÖ. Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features. Balkan Journal of Electrical and Computer Engineering. 2025;13:183–193.
MLA
Brioua, Hadjer, vd. “Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features”. Balkan Journal of Electrical and Computer Engineering, c. 13, sy 2, Haziran 2025, ss. 183-9, doi:10.17694/bajece.1490596.
Vancouver
1.Hadjer Brioua, Havvanur Siyambaş, Durmuş Özkan Şahin. Phishing E-mail Detection with Machine Learning and Deep Learning: Improving Classification Performance with Proposed New Features. Balkan Journal of Electrical and Computer Engineering. 01 Haziran 2025;13(2):183-9. doi:10.17694/bajece.1490596

All articles published by BAJECE are licensed under the Creative Commons Attribution 4.0 International License. This permits anyone to copy, redistribute, remix, transmit and adapt the work provided the original work and source is appropriately cited.Creative Commons Lisans