Research Article

Classification of Unwanted E-Mails (Spam) with Turkish Text by Different Algorithms in Weka Program

Volume: 3 Number: 1 June 28, 2022
EN

Classification of Unwanted E-Mails (Spam) with Turkish Text by Different Algorithms in Weka Program

Abstract

Abstract: Today, with the widespread use of the Internet, electronic communication tools have also been widely used. One of these tools is e-mails. E-mails are easy to use and provide the opportunity to reach thousands of people at the same time. This advantage causes some bad uses. E-mail users are faced with dozens of unsolicited mails (spam) against their will. In this study, 1017 mails collected from about 20 different Gmail and Hotmail accounts were classified as spam or regular e-mail using the algorithms in the Weka program, and the success of the algorithms was compared. In the study, 45 different algorithms were tested. The highest classification success was obtained with the NavieBayesMultinominal and NavieBayesMultinominalUpdateable algorithms with 94.7886% correct classification. Among other classifier algorithms, Trees RandomForest algorithm 93.6087%, Meta. MultiClassClassifier and Functions SGD 92.4287%, Functions SMO 91.7404%, Meta RandomCommittee 91.0521%, Bayes NavieBayes and Bayes NavieBayesUpdateable 90.3638% classification success.

Keywords

References

  1. C. ÖZDEMİR, M. ATAŞ, ve A. B. ÖZER, “TÜRKÇE İSTENMEYEN ELEKTRONİK POSTALARIN YAPAY BAĞIŞIKLIK SİSTEMİ İLE SINIFLANDIRILMASI CLASSIFICATION OF TURKISH SPAM E-MAILS WITH ARTIFICIAL IMMUNE SYSTEM”.
  2. C. Özdemir, “Yapay bağışıklık sistemi ile spam filtreleme”, Master’s Thesis, Fen Bilimleri Enstitüsü, 2013.
  3. M. E. Yüksel ve Ş. D. Odabaşı, “SMTP Protokolü ve Spam Mail Problemi”, Akad. Bilişim, 2010.
  4. E. E. ERYILMAZ ve E. KILIÇ, “İstenmeyen Epostaların Tespiti için Kullanılan Yöntemlerin İncelenmesi”, Dicle Üniversitesi Mühendis. Fakültesi Mühendis. Derg., c. 11, sy 3, ss. 977-987, 2020.
  5. C. Altunyaprak, “Bayes yöntemi kullanarak istenmeyen elektronik postaların filtrelenmesi”, PhD Thesis, Yüksek Lisans Tezi, Muğla Üniversitesi Fen Bilimleri Enstitüsü, 2006.
  6. Y. GEDİK, “E-Posta Pazarlama: Teorik Bir Bakış”, Uluslar. Önetim Akad. Derg., c. 3, sy 2, ss. 476-490, 2020.
  7. K. Tekeli ve R. Aşlıyan, “Çok Katmanlı Algılayıcı, K-NN ve C4. 5 Metotlarıyla İstenmeyen E-postaların Tespiti”, Adnan Menderes Üniversitesi, 2016.
  8. Ü. Cahide ve İ. ŞAHİN, “İstenmeyen Elektronik Postaların (SPAM) Filtrelenmesi için Bir Uzman Sistem Tasarımı ve Gerçekleştirilmesi”, Politek. Derg., c. 20, sy 2, ss. 267-274, 2017.

Details

Primary Language

English

Subjects

Computer Software

Journal Section

Research Article

Publication Date

June 28, 2022

Submission Date

April 17, 2022

Acceptance Date

May 1, 2022

Published in Issue

Year 2022 Volume: 3 Number: 1

APA
Şimşek, H., & Aydemir, E. (2022). Classification of Unwanted E-Mails (Spam) with Turkish Text by Different Algorithms in Weka Program. Journal of Soft Computing and Artificial Intelligence, 3(1), 1-10. https://doi.org/10.55195/jscai.1104694
AMA
1.Şimşek H, Aydemir E. Classification of Unwanted E-Mails (Spam) with Turkish Text by Different Algorithms in Weka Program. JSCAI. 2022;3(1):1-10. doi:10.55195/jscai.1104694
Chicago
Şimşek, Hüseyin, and Emrah Aydemir. 2022. “Classification of Unwanted E-Mails (Spam) With Turkish Text by Different Algorithms in Weka Program”. Journal of Soft Computing and Artificial Intelligence 3 (1): 1-10. https://doi.org/10.55195/jscai.1104694.
EndNote
Şimşek H, Aydemir E (June 1, 2022) Classification of Unwanted E-Mails (Spam) with Turkish Text by Different Algorithms in Weka Program. Journal of Soft Computing and Artificial Intelligence 3 1 1–10.
IEEE
[1]H. Şimşek and E. Aydemir, “Classification of Unwanted E-Mails (Spam) with Turkish Text by Different Algorithms in Weka Program”, JSCAI, vol. 3, no. 1, pp. 1–10, June 2022, doi: 10.55195/jscai.1104694.
ISNAD
Şimşek, Hüseyin - Aydemir, Emrah. “Classification of Unwanted E-Mails (Spam) With Turkish Text by Different Algorithms in Weka Program”. Journal of Soft Computing and Artificial Intelligence 3/1 (June 1, 2022): 1-10. https://doi.org/10.55195/jscai.1104694.
JAMA
1.Şimşek H, Aydemir E. Classification of Unwanted E-Mails (Spam) with Turkish Text by Different Algorithms in Weka Program. JSCAI. 2022;3:1–10.
MLA
Şimşek, Hüseyin, and Emrah Aydemir. “Classification of Unwanted E-Mails (Spam) With Turkish Text by Different Algorithms in Weka Program”. Journal of Soft Computing and Artificial Intelligence, vol. 3, no. 1, June 2022, pp. 1-10, doi:10.55195/jscai.1104694.
Vancouver
1.Hüseyin Şimşek, Emrah Aydemir. Classification of Unwanted E-Mails (Spam) with Turkish Text by Different Algorithms in Weka Program. JSCAI. 2022 Jun. 1;3(1):1-10. doi:10.55195/jscai.1104694

Cited By

COPE Logo           Crossref Logo                DergiPark Logo               Creative Commons Logo