Araştırma Makalesi

CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS

Cilt: 9 Sayı: 4 20 Aralık 2021
PDF İndir
EN TR

CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS

Öz

Automatic text classification and sentiment polarity detection are two important research problems of social media analysis. The meanings of the words are so important that they need to be captured by a document classification algorithm to reach an accurate classification performance. Another important issue with the text classification is the scarcity of labeled data. In this study, Combining Labeled and Unlabeled Data with Semantic Values of Terms (CLUDS) is presented. CLUDS has the following steps: preprocessing, instance labeling, combining labeled and unlabeled data, and prediction. In preprocessing step Latent Dirichlet Allocation (LDA) algorithm is used. In instance labeling step Logistic Regression is applied. In CLUDS, relevance values computation has been applied as a supervised term weighting methodology in the text classification field. Still, according to the literature, CLUDS is the first attempt that uses both relevance and weighting calculation in a semi-supervised semantic kernel for Support Vector Machines (SVM). In this study, Sprinkled-CLUDS and Adaptive-Sprinkled-CLUDS have also been implemented. Evaluated experimental results show that CLUDS, Sprinkled-CLUDS and Adaptive-Sprinkled-CLUDS generate a valuable performance gain over the baseline algorithms on test sets.

Anahtar Kelimeler

Destekleyen Kurum

TÜBİTAK

Proje Numarası

118E315

Kaynakça

  1. Ahmed, I., Ali, R., Guan, D., Lee, Y., Lee, S., Chung, T. 2015. Semi-Supervised Learning Using Frequent Itemset and Ensemble Learning for SMS Classification. Expert Systems with Applications, 42(3), 1065-1073.
  2. Akın, A. A., & Akın, M. D., 2007. Zemberek, an open source nlp framework for Turkish languages. Structure, 10, 1-5.
  3. Alsmadi, I., & Hoon, G. K., 2019. Term weighting scheme for short-text classification: Twitter corpuses. Neural Computing and Applications, 31(8), 3819-3831.
  4. Altınel, B., Diri, B., Ganiz, M.C., 2015. A Novel Semantic Smoothing Kernel for Text Classification with Class-based Weighting. Knowledge-Based Systems, 89(1), 265-277.
  5. Altınel, B., Ganiz, M. C., 2018. Semantic Text Classification: A Survey of Past and Recent Advances. Information Processing & Management, 54(6), 1129-1153.
  6. Amasyalı, M. F., Beken, A. Türkçe Kelimelerin Anlamsal Benzerliklerinin Ölçülmesi ve Metin Siniflandirmada Kullanilmasi, In Proceedings of IEEE Sinyal İşleme ve İletişim Uygulamalari Kurultayi (SIU), 2009.
  7. Amor, B. R. , Vuik, S. I. , Callahan, R. , Darzi, A. , Yaliraki, S. N. , & Barahona, M., 2016. Community detection and role identification in directed networks: Understand- ing the twitter network of the care. data debate. In Dynamic networks and cyber.
  8. Asiaee T, A., Tepper, M., Banerjee, A., & Sapiro, G., 2012. If you are happy and you know it... tweet. In Proceedings of the 21st ACM international conference on Information and knowledge management, 1602-1606.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Bilgisayar Yazılımı

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

20 Aralık 2021

Gönderilme Tarihi

13 Ağustos 2020

Kabul Tarihi

6 Eylül 2021

Yayımlandığı Sayı

Yıl 2021 Cilt: 9 Sayı: 4

Kaynak Göster

APA
Altınel, A. B. (2021). CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS. Mühendislik Bilimleri ve Tasarım Dergisi, 9(4), 1048-1061. https://doi.org/10.21923/jesd.780002
AMA
1.Altınel AB. CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS. MBTD. 2021;9(4):1048-1061. doi:10.21923/jesd.780002
Chicago
Altınel, Ayşe Berna. 2021. “CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS”. Mühendislik Bilimleri ve Tasarım Dergisi 9 (4): 1048-61. https://doi.org/10.21923/jesd.780002.
EndNote
Altınel AB (01 Aralık 2021) CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS. Mühendislik Bilimleri ve Tasarım Dergisi 9 4 1048–1061.
IEEE
[1]A. B. Altınel, “CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS”, MBTD, c. 9, sy 4, ss. 1048–1061, Ara. 2021, doi: 10.21923/jesd.780002.
ISNAD
Altınel, Ayşe Berna. “CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS”. Mühendislik Bilimleri ve Tasarım Dergisi 9/4 (01 Aralık 2021): 1048-1061. https://doi.org/10.21923/jesd.780002.
JAMA
1.Altınel AB. CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS. MBTD. 2021;9:1048–1061.
MLA
Altınel, Ayşe Berna. “CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS”. Mühendislik Bilimleri ve Tasarım Dergisi, c. 9, sy 4, Aralık 2021, ss. 1048-61, doi:10.21923/jesd.780002.
Vancouver
1.Ayşe Berna Altınel. CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS. MBTD. 01 Aralık 2021;9(4):1048-61. doi:10.21923/jesd.780002

Cited By