BibTex RIS Kaynak Göster

Classification Performance of the Different Stemming Methods

Yıl 2015, Cilt: 3 Sayı: 3, 208 - 210, 29.06.2015
https://doi.org/10.18100/ijamec.27805

Öz

Saving textual data and accessing them in many fields have become one of the basic problems nowadays. The usage of these data effectively is directly related to the development of storage and access tools that will be used. Therefore, software programs using different methods have been developed. One of the points that need to be taken into account is data classifying. Because using raw data in these classifying processes is harmful, finding the stem of the texts is useful. In this study, the successes of two different stemming algorithms in the text classifying are comparatively examined.

Kaynakça

  • Kantardzic M., 2003, Data Mining:Concepts, Models, Methods, and Algorithms, IEEE Pres, Wiley Interscience Publications.
  • Saracoğlu R., 2007, Searching For Similar Documents Using Fuzzy Clustering, PhD Thesis, Graduate School of Natural and Applied Sciences, Selçuk University, Konya.
  • Yıldırım P.(*), Uludağ M.(**), Görür A.(*), 2008, Hastane Bilgi Sistemlerinde Veri Madenciliği, Akademik Bilişim Konferansları’08, (*) Çankaya Üniversitesi, Bilgisayar Mühendisliği Bölümü, Ankara. (**) European Bioinformatics Institute, Cambridge, UK.
  • Porter, M.F., 1980, An Algorithm For Suffix Stripping, Program, 14(3):130-137.
  • Jurafsky, D. and Martin, J., 2000, Speech and Language Processing, Prentice Hall, New Jersey.
  • Kesgin F., 2007, Topıc Detectıon System For Turkısh Texts, Master Thesis, Graduate School of Natural and Applied Sciences, Istanbul Technical University,Istanbul.
  • Joachims, T., 2002, Learning to classify text using support vector machines, Kluwer Academic Publishers, Boston.
  • Jackson, P., Moulinier, I., 2002, Natural language processing for online applications: text retrieval, extraction, and categorization, Amsterdam.
  • Lassila O., 1998, Web Metadata : A Matter of Semantics. IEEE Iternet Computing, pp. 30-37.
  • Eroğlu M., 2000, A Study On The Effects Of Stemming And Thesaurus For Retrieving Information In Turkish Documents, Master Thesis, Hacettepe University, Computer Enginering, Ankara, Türkiye.
  • Keller J.M., Gray M.R., Givens J.A., A Fuzzy K-Nearest Neighbor Algorithm, Systems, Man and Cybernetics, IEEE Transactions on
  • Volume:SMC-15, Issue:41Page(s):580-585,1985.
  • Pilavcılar İ., 2007, Metin Madenciliği ile Metin Sınıflandırma(KNN Algoritması) – 3, Yazılım Mühendisliği İleri Seviye Makaleleri http://www.csharpnedir.com.
  • Balcı M., 2010 Comparative Analysis of The Longest Match Algorithm in Computer Based Text Processing, Master Thesis, Graduate School of Natural and Applied Sciences, Selçuk University, Konya.

Original Research Paper

Yıl 2015, Cilt: 3 Sayı: 3, 208 - 210, 29.06.2015
https://doi.org/10.18100/ijamec.27805

Öz

Kaynakça

  • Kantardzic M., 2003, Data Mining:Concepts, Models, Methods, and Algorithms, IEEE Pres, Wiley Interscience Publications.
  • Saracoğlu R., 2007, Searching For Similar Documents Using Fuzzy Clustering, PhD Thesis, Graduate School of Natural and Applied Sciences, Selçuk University, Konya.
  • Yıldırım P.(*), Uludağ M.(**), Görür A.(*), 2008, Hastane Bilgi Sistemlerinde Veri Madenciliği, Akademik Bilişim Konferansları’08, (*) Çankaya Üniversitesi, Bilgisayar Mühendisliği Bölümü, Ankara. (**) European Bioinformatics Institute, Cambridge, UK.
  • Porter, M.F., 1980, An Algorithm For Suffix Stripping, Program, 14(3):130-137.
  • Jurafsky, D. and Martin, J., 2000, Speech and Language Processing, Prentice Hall, New Jersey.
  • Kesgin F., 2007, Topıc Detectıon System For Turkısh Texts, Master Thesis, Graduate School of Natural and Applied Sciences, Istanbul Technical University,Istanbul.
  • Joachims, T., 2002, Learning to classify text using support vector machines, Kluwer Academic Publishers, Boston.
  • Jackson, P., Moulinier, I., 2002, Natural language processing for online applications: text retrieval, extraction, and categorization, Amsterdam.
  • Lassila O., 1998, Web Metadata : A Matter of Semantics. IEEE Iternet Computing, pp. 30-37.
  • Eroğlu M., 2000, A Study On The Effects Of Stemming And Thesaurus For Retrieving Information In Turkish Documents, Master Thesis, Hacettepe University, Computer Enginering, Ankara, Türkiye.
  • Keller J.M., Gray M.R., Givens J.A., A Fuzzy K-Nearest Neighbor Algorithm, Systems, Man and Cybernetics, IEEE Transactions on
  • Volume:SMC-15, Issue:41Page(s):580-585,1985.
  • Pilavcılar İ., 2007, Metin Madenciliği ile Metin Sınıflandırma(KNN Algoritması) – 3, Yazılım Mühendisliği İleri Seviye Makaleleri http://www.csharpnedir.com.
  • Balcı M., 2010 Comparative Analysis of The Longest Match Algorithm in Computer Based Text Processing, Master Thesis, Graduate School of Natural and Applied Sciences, Selçuk University, Konya.
Toplam 14 adet kaynakça vardır.

Ayrıntılar

Birincil Dil İngilizce
Bölüm Research Article
Yazarlar

Mehmet Balcı

Rıdvan Saraçoğlu Bu kişi benim

Şakir Taşdemir Bu kişi benim

Adem Gölcük Bu kişi benim

Yayımlanma Tarihi 29 Haziran 2015
Yayımlandığı Sayı Yıl 2015 Cilt: 3 Sayı: 3

Kaynak Göster

APA Balcı, M., Saraçoğlu, R., Taşdemir, Ş., Gölcük, A. (2015). Classification Performance of the Different Stemming Methods. International Journal of Applied Mathematics Electronics and Computers, 3(3), 208-210. https://doi.org/10.18100/ijamec.27805
AMA Balcı M, Saraçoğlu R, Taşdemir Ş, Gölcük A. Classification Performance of the Different Stemming Methods. International Journal of Applied Mathematics Electronics and Computers. Haziran 2015;3(3):208-210. doi:10.18100/ijamec.27805
Chicago Balcı, Mehmet, Rıdvan Saraçoğlu, Şakir Taşdemir, ve Adem Gölcük. “Classification Performance of the Different Stemming Methods”. International Journal of Applied Mathematics Electronics and Computers 3, sy. 3 (Haziran 2015): 208-10. https://doi.org/10.18100/ijamec.27805.
EndNote Balcı M, Saraçoğlu R, Taşdemir Ş, Gölcük A (01 Haziran 2015) Classification Performance of the Different Stemming Methods. International Journal of Applied Mathematics Electronics and Computers 3 3 208–210.
IEEE M. Balcı, R. Saraçoğlu, Ş. Taşdemir, ve A. Gölcük, “Classification Performance of the Different Stemming Methods”, International Journal of Applied Mathematics Electronics and Computers, c. 3, sy. 3, ss. 208–210, 2015, doi: 10.18100/ijamec.27805.
ISNAD Balcı, Mehmet vd. “Classification Performance of the Different Stemming Methods”. International Journal of Applied Mathematics Electronics and Computers 3/3 (Haziran 2015), 208-210. https://doi.org/10.18100/ijamec.27805.
JAMA Balcı M, Saraçoğlu R, Taşdemir Ş, Gölcük A. Classification Performance of the Different Stemming Methods. International Journal of Applied Mathematics Electronics and Computers. 2015;3:208–210.
MLA Balcı, Mehmet vd. “Classification Performance of the Different Stemming Methods”. International Journal of Applied Mathematics Electronics and Computers, c. 3, sy. 3, 2015, ss. 208-10, doi:10.18100/ijamec.27805.
Vancouver Balcı M, Saraçoğlu R, Taşdemir Ş, Gölcük A. Classification Performance of the Different Stemming Methods. International Journal of Applied Mathematics Electronics and Computers. 2015;3(3):208-10.