Araştırma Makalesi

SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning

Cilt: 24 Sayı: 4 1 Aralık 2021
PDF İndir
TR EN

SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning

Öz

There is growing interest in studies on text classification as a result of the exponential increase in the amount of data available. Many studies have been conducted in the field of text clustering, using different approaches. This study introduces Spectral Sentence Clustering (SSC) for text clustering problems, which is an unsupervised method based on graph-partitioning. The study explains how the proposed model proposed can be used in natural language applications to successfully cluster texts. A spectral graph theory method is used to partition the graph into non-intersecting sub-graphs, and an unsupervised and efficient solution is offered for the text clustering problem by providing a physical representation of the texts. Finally, tests have been conducted demonstrating that SSC can be successfully used for text categorization. A clustering success rate of 97.08% was achieved in tests conducted using the TTC-3600 dataset, which contains open-access unstructured Turkish texts, classified into categories. The SSC model proposed performed better compared to a popular k-means clustering algorithm.

Anahtar Kelimeler

Kaynakça

  1. [1] Ş. Canberk, G. , Sağıroğlu, Bilgi ve Bilgisayar Güvenliği : Casus Yazılımlar ve Korunma Yöntemleri. Ankara: Grafiker Yayıncılık, 2006.
  2. [2] Osman Durmaz;, “Metin Sınıflandırmada Boyut Azaltmanın Etkisi ve Özellik Seçimi,” Gazi Üniversitesi, 2011.
  3. [3] C. Hark, A. Seyyarer, T. Uçkan, and A. Karci, “Doǧal dil işleme yaklaşimlari ile yapisal olmayan dökümanlarin benzerliǧi,” in IDAP 2017 - International Artificial Intelligence and Data Processing Symposium, 2017.
  4. [4] D. D. Lewis, “Naive (Bayes) at forty: The independence assumption in information retrieval,” in European conference on machine learning, 1998, pp. 4–15.
  5. [5] K. Aas and L. Eikvil, “Text categorisation: A survey.” Technical report, Norwegian computing center, 1999.
  6. [6] N. O. Andrews and E. A. Fox, “Recent Developments in Document Clustering,” 2007.
  7. [7] N. Shah, “Document Clustering : A Detailed Review,” vol. 4, no. 5, pp. 30–38, 2012.
  8. [8] A. Pujari, “Data mining techniques,” 2001.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

1 Aralık 2021

Gönderilme Tarihi

5 Şubat 2020

Kabul Tarihi

5 Haziran 2020

Yayımlandığı Sayı

Yıl 2021 Cilt: 24 Sayı: 4

Kaynak Göster

APA
Uçkan, T., Hark, C., & Karci, A. (2021). SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning. Politeknik Dergisi, 24(4), 1433-1444. https://doi.org/10.2339/politeknik.684558
AMA
1.Uçkan T, Hark C, Karci A. SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning. Politeknik Dergisi. 2021;24(4):1433-1444. doi:10.2339/politeknik.684558
Chicago
Uçkan, Taner, Cengiz Hark, ve Ali Karci. 2021. “SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning”. Politeknik Dergisi 24 (4): 1433-44. https://doi.org/10.2339/politeknik.684558.
EndNote
Uçkan T, Hark C, Karci A (01 Aralık 2021) SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning. Politeknik Dergisi 24 4 1433–1444.
IEEE
[1]T. Uçkan, C. Hark, ve A. Karci, “SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning”, Politeknik Dergisi, c. 24, sy 4, ss. 1433–1444, Ara. 2021, doi: 10.2339/politeknik.684558.
ISNAD
Uçkan, Taner - Hark, Cengiz - Karci, Ali. “SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning”. Politeknik Dergisi 24/4 (01 Aralık 2021): 1433-1444. https://doi.org/10.2339/politeknik.684558.
JAMA
1.Uçkan T, Hark C, Karci A. SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning. Politeknik Dergisi. 2021;24:1433–1444.
MLA
Uçkan, Taner, vd. “SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning”. Politeknik Dergisi, c. 24, sy 4, Aralık 2021, ss. 1433-44, doi:10.2339/politeknik.684558.
Vancouver
1.Taner Uçkan, Cengiz Hark, Ali Karci. SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning. Politeknik Dergisi. 01 Aralık 2021;24(4):1433-44. doi:10.2339/politeknik.684558

Cited By

 
TARANDIĞIMIZ DİZİNLER (ABSTRACTING / INDEXING)
181341319013191 13189 13187 13188 18016 

download Bu eser Creative Commons Atıf-AynıLisanslaPaylaş 4.0 Uluslararası ile lisanslanmıştır.