Araştırma Makalesi

Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques

Cilt: 13 Sayı: 3 26 Eylül 2024
PDF İndir
TR EN

Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques

Abstract

Temporary e-mail addresses are e-mail addresses that users can quickly create without signing up. These e-mail addresses are useful for privacy and to avoid spam. However, they also pose several serious cyber threats, including fraud, spam campaigns, and fake account creation In this study, a method utilizing natural language processing and machine learning techniques is proposed to classify real and temporary e-mail addresses. First, temporary and real e-mail addresses are analyzed, and features are developed to identify the differences between them. These features include lexical structures, broad contexts, and structural features of e-mail addresses. Various machine learning algorithms were then applied on the resulting feature set to differentiate e-mail addresses. The results were evaluated with K-fold cross-validation method and an accuracy rate of 96% was obtained. This success rate shows that the developed method can successfully distinguish between real and temporary e-mail addresses.

Keywords

Kaynakça

  1. M. Diale, T. Celik, and C. Van Der Walt, “Unsupervised feature learning for spam email filtering,” Computers & Electrical Engineering, vol. 74, pp. 89–104, Mar. 2019, doi: 10.1016/j.compeleceng.2019.01.004.
  2. N. Saidani, K. Adi, and M. S. Allili, “A semantic-based classification approach for an enhanced spam detection,” Computers & Security, vol. 94, p. 101716, Jul. 2020, doi: 10.1016/j.cose.2020.101716.
  3. S. Salloum, T. Gaber, S. Vadera, and K. Shaalan, “Phishing Email Detection Using Natural Language Processing Techniques: A Literature Survey,” Procedia Computer Science, vol. 189, pp. 19–28, Jan. 2021, doi: 10.1016/j.procs.2021.05.077.
  4. G. Sanghani and K. Kotecha, “Incremental personalized E-mail spam filter using novel TFDCR feature selection with dynamic feature update,” Expert Systems with Applications, vol. 115, pp. 287–299, Jan. 2019, doi: 10.1016/j.eswa.2018.07.049.
  5. The Enron Email Dataset, Kaggle, Mar. 2024. [Online]. Available: https://www.kaggle.com/datasets/wcukierski/enron-email-dataset
  6. The Spam Assassin Email Dataset, Kaggle, Mar. 2024. [Online]. Available: https://www.kaggle.com/datasets/ganiyuolalekan/spam-assassin-email-classification-dataset
  7. A. Vaswani et al., “Attention is all you need,” in Advances in Neural Information Processing Systems, Neural information processing systems foundation, 2017, pp. 5999–6009. [Online]. Available: https://arxiv.org/abs/1706.03762v5
  8. K. Debnath and N. Kar, “Email Spam Detection using Deep Learning Approach,” in 2022 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COM-IT-CON), May 2022, pp. 37–41. doi: 10.1109/COM-IT-CON54601.2022.9850588.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Bilgi Güvenliği Yönetimi

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

26 Eylül 2024

Gönderilme Tarihi

20 Temmuz 2024

Kabul Tarihi

12 Eylül 2024

Yayımlandığı Sayı

Yıl 2024 Cilt: 13 Sayı: 3

Kaynak Göster

APA
Balım, C., & Olgun, N. (2024). Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques. Türk Doğa ve Fen Dergisi, 13(3), 176-183. https://doi.org/10.46810/tdfd.1519463
AMA
1.Balım C, Olgun N. Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques. TDFD. 2024;13(3):176-183. doi:10.46810/tdfd.1519463
Chicago
Balım, Caner, ve Nevzat Olgun. 2024. “Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques”. Türk Doğa ve Fen Dergisi 13 (3): 176-83. https://doi.org/10.46810/tdfd.1519463.
EndNote
Balım C, Olgun N (01 Eylül 2024) Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques. Türk Doğa ve Fen Dergisi 13 3 176–183.
IEEE
[1]C. Balım ve N. Olgun, “Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques”, TDFD, c. 13, sy 3, ss. 176–183, Eyl. 2024, doi: 10.46810/tdfd.1519463.
ISNAD
Balım, Caner - Olgun, Nevzat. “Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques”. Türk Doğa ve Fen Dergisi 13/3 (01 Eylül 2024): 176-183. https://doi.org/10.46810/tdfd.1519463.
JAMA
1.Balım C, Olgun N. Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques. TDFD. 2024;13:176–183.
MLA
Balım, Caner, ve Nevzat Olgun. “Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques”. Türk Doğa ve Fen Dergisi, c. 13, sy 3, Eylül 2024, ss. 176-83, doi:10.46810/tdfd.1519463.
Vancouver
1.Caner Balım, Nevzat Olgun. Classification of Temporary and Real E-mail Addresses with Machine Learning Techniques. TDFD. 01 Eylül 2024;13(3):176-83. doi:10.46810/tdfd.1519463