Araştırma Makalesi

Effects of Feature Extraction Techniques on Classification of Turkish Texts

Cilt: 34 Sayı: 3 30 Eylül 2019
PDF İndir
EN TR

Effects of Feature Extraction Techniques on Classification of Turkish Texts

Öz

Feature extraction is the most important preprocessing step of text classification task. Effects of preprocessing techniques on text mining for English have been extensively studied. However, studies for Turkish are limited and generally belong to a specific problem domain. In this study, we investigate the effects of feature extraction techniques on four different Turkish text classification problems including news classification, spam e-mail detection, sentiment analysis, and author detection to show the differences and similarities among the problems. We also propose a new feature selection method to reduce feature space. The experimental analysis has showed that, stopword removal improves classification performance. However, stemming does not make any positive effect on classification accuracy. The most successful term weighting methods are tf and tf*idf. The proposed feature selection method improves classification performance and has higher accuracy than the well-known methods. 

Anahtar Kelimeler

Kaynakça

  1. 1. Hand, D., Mannila, H., Smyth, P., 2001. Principles of Data Mining, the MIT Press, England, 546.
  2. 2. İlhan, S., Duru, N., Karagöz, Ş., Sağır, M., 2008. Metin Madenciliği ile Soru Cevaplama Sistemi, ELECO-2008, 356-359.
  3. 3. Amasyalı, M.F., Diri, B., 2006. Automatic Turkish Text Categorization in Terms of Author, Genre and Gender. C. Kop et al. (Eds.): NLDB 2006, LNCS 3999, 221–226.
  4. 4. Yıldız, H.K., Gençtav, M., Usta N., Diri B., Amasyalı M.F., 2007. Metin Sınıflandırmada Yeni Özellik Çıkarımı, Signal Processing and Communications Applications (SIU 2007), Eskişehir, Turkey.
  5. 5. Cataltepe, Z., Turan, Y., Kesgin, F., 2007. Turkish Document Classification Using Shorter Roots, Signal Processing and Communications Applications (SIU 2007), Eskisehir, Turkey.
  6. 6. Güran, A., Akyokuş, S., Bayazıt, N.G., Gürbüz, M.Z., 2009. Turkish Text Categorization Using N-Gram Words. International Symposium on Innovations in Intelligent Systems and Applications (INISTA 2009), Trabzon, Turkey.
  7. 7. Torunoğlu, D., Çakırman, E., Ganiz, M., Akyokuş, S., Gürbüz, Z., 2011. Analysis of Preprocessing Methods on Text Classification of Turkish Texts, International Symposium on Innovations in Intelligent Systems and Applications (INISTA 2011), İstanbul, 112-117.
  8. 8. Uysal, K.U., Günal, S., 2013. The Impact of Preprocessing on Text Classification, Information Processing and Management, 104-112.

Ayrıntılar

Birincil Dil

İngilizce

Konular

-

Bölüm

Araştırma Makalesi

Yazarlar

Özge Akdoğan Bu kişi benim
Türkiye

Yayımlanma Tarihi

30 Eylül 2019

Gönderilme Tarihi

27 Mayıs 2019

Kabul Tarihi

30 Eylül 2019

Yayımlandığı Sayı

Yıl 2019 Cilt: 34 Sayı: 3

Kaynak Göster

APA
Akdoğan, Ö., & Özel, S. A. (2019). Effects of Feature Extraction Techniques on Classification of Turkish Texts. Çukurova Üniversitesi Mühendislik-Mimarlık Fakültesi Dergisi, 34(3), 95-108. https://doi.org/10.21605/cukurovaummfd.637643
AMA
1.Akdoğan Ö, Özel SA. Effects of Feature Extraction Techniques on Classification of Turkish Texts. cukurovaummfd. 2019;34(3):95-108. doi:10.21605/cukurovaummfd.637643
Chicago
Akdoğan, Özge, ve Selma Ayşe Özel. 2019. “Effects of Feature Extraction Techniques on Classification of Turkish Texts”. Çukurova Üniversitesi Mühendislik-Mimarlık Fakültesi Dergisi 34 (3): 95-108. https://doi.org/10.21605/cukurovaummfd.637643.
EndNote
Akdoğan Ö, Özel SA (01 Eylül 2019) Effects of Feature Extraction Techniques on Classification of Turkish Texts. Çukurova Üniversitesi Mühendislik-Mimarlık Fakültesi Dergisi 34 3 95–108.
IEEE
[1]Ö. Akdoğan ve S. A. Özel, “Effects of Feature Extraction Techniques on Classification of Turkish Texts”, cukurovaummfd, c. 34, sy 3, ss. 95–108, Eyl. 2019, doi: 10.21605/cukurovaummfd.637643.
ISNAD
Akdoğan, Özge - Özel, Selma Ayşe. “Effects of Feature Extraction Techniques on Classification of Turkish Texts”. Çukurova Üniversitesi Mühendislik-Mimarlık Fakültesi Dergisi 34/3 (01 Eylül 2019): 95-108. https://doi.org/10.21605/cukurovaummfd.637643.
JAMA
1.Akdoğan Ö, Özel SA. Effects of Feature Extraction Techniques on Classification of Turkish Texts. cukurovaummfd. 2019;34:95–108.
MLA
Akdoğan, Özge, ve Selma Ayşe Özel. “Effects of Feature Extraction Techniques on Classification of Turkish Texts”. Çukurova Üniversitesi Mühendislik-Mimarlık Fakültesi Dergisi, c. 34, sy 3, Eylül 2019, ss. 95-108, doi:10.21605/cukurovaummfd.637643.
Vancouver
1.Özge Akdoğan, Selma Ayşe Özel. Effects of Feature Extraction Techniques on Classification of Turkish Texts. cukurovaummfd. 01 Eylül 2019;34(3):95-108. doi:10.21605/cukurovaummfd.637643

Cited By