EN
TR
CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS
Öz
Automatic text classification and sentiment polarity detection are two important research problems of social media analysis. The meanings of the words are so important that they need to be captured by a document classification algorithm to reach an accurate classification performance. Another important issue with the text classification is the scarcity of labeled data. In this study, Combining Labeled and Unlabeled Data with Semantic Values of Terms (CLUDS) is presented. CLUDS has the following steps: preprocessing, instance labeling, combining labeled and unlabeled data, and prediction. In preprocessing step Latent Dirichlet Allocation (LDA) algorithm is used. In instance labeling step Logistic Regression is applied. In CLUDS, relevance values computation has been applied as a supervised term weighting methodology in the text classification field. Still, according to the literature, CLUDS is the first attempt that uses both relevance and weighting calculation in a semi-supervised semantic kernel for Support Vector Machines (SVM). In this study, Sprinkled-CLUDS and Adaptive-Sprinkled-CLUDS have also been implemented. Evaluated experimental results show that CLUDS, Sprinkled-CLUDS and Adaptive-Sprinkled-CLUDS generate a valuable performance gain over the baseline algorithms on test sets.
Anahtar Kelimeler
Destekleyen Kurum
TÜBİTAK
Proje Numarası
118E315
Kaynakça
- Ahmed, I., Ali, R., Guan, D., Lee, Y., Lee, S., Chung, T. 2015. Semi-Supervised Learning Using Frequent Itemset and Ensemble Learning for SMS Classification. Expert Systems with Applications, 42(3), 1065-1073.
- Akın, A. A., & Akın, M. D., 2007. Zemberek, an open source nlp framework for Turkish languages. Structure, 10, 1-5.
- Alsmadi, I., & Hoon, G. K., 2019. Term weighting scheme for short-text classification: Twitter corpuses. Neural Computing and Applications, 31(8), 3819-3831.
- Altınel, B., Diri, B., Ganiz, M.C., 2015. A Novel Semantic Smoothing Kernel for Text Classification with Class-based Weighting. Knowledge-Based Systems, 89(1), 265-277.
- Altınel, B., Ganiz, M. C., 2018. Semantic Text Classification: A Survey of Past and Recent Advances. Information Processing & Management, 54(6), 1129-1153.
- Amasyalı, M. F., Beken, A. Türkçe Kelimelerin Anlamsal Benzerliklerinin Ölçülmesi ve Metin Siniflandirmada Kullanilmasi, In Proceedings of IEEE Sinyal İşleme ve İletişim Uygulamalari Kurultayi (SIU), 2009.
- Amor, B. R. , Vuik, S. I. , Callahan, R. , Darzi, A. , Yaliraki, S. N. , & Barahona, M., 2016. Community detection and role identification in directed networks: Understand- ing the twitter network of the care. data debate. In Dynamic networks and cyber.
- Asiaee T, A., Tepper, M., Banerjee, A., & Sapiro, G., 2012. If you are happy and you know it... tweet. In Proceedings of the 21st ACM international conference on Information and knowledge management, 1602-1606.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Bilgisayar Yazılımı
Bölüm
Araştırma Makalesi
Yazarlar
Yayımlanma Tarihi
20 Aralık 2021
Gönderilme Tarihi
13 Ağustos 2020
Kabul Tarihi
6 Eylül 2021
Yayımlandığı Sayı
Yıl 2021 Cilt: 9 Sayı: 4
APA
Altınel, A. B. (2021). CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS. Mühendislik Bilimleri ve Tasarım Dergisi, 9(4), 1048-1061. https://doi.org/10.21923/jesd.780002
AMA
1.Altınel AB. CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS. MBTD. 2021;9(4):1048-1061. doi:10.21923/jesd.780002
Chicago
Altınel, Ayşe Berna. 2021. “CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS”. Mühendislik Bilimleri ve Tasarım Dergisi 9 (4): 1048-61. https://doi.org/10.21923/jesd.780002.
EndNote
Altınel AB (01 Aralık 2021) CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS. Mühendislik Bilimleri ve Tasarım Dergisi 9 4 1048–1061.
IEEE
[1]A. B. Altınel, “CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS”, MBTD, c. 9, sy 4, ss. 1048–1061, Ara. 2021, doi: 10.21923/jesd.780002.
ISNAD
Altınel, Ayşe Berna. “CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS”. Mühendislik Bilimleri ve Tasarım Dergisi 9/4 (01 Aralık 2021): 1048-1061. https://doi.org/10.21923/jesd.780002.
JAMA
1.Altınel AB. CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS. MBTD. 2021;9:1048–1061.
MLA
Altınel, Ayşe Berna. “CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS”. Mühendislik Bilimleri ve Tasarım Dergisi, c. 9, sy 4, Aralık 2021, ss. 1048-61, doi:10.21923/jesd.780002.
Vancouver
1.Ayşe Berna Altınel. CLUDS: COMBINING LABELED AND UNLABELED DATA WITH LOGISTIC REGRESSION FOR SOCIAL MEDIA ANALYSIS. MBTD. 01 Aralık 2021;9(4):1048-61. doi:10.21923/jesd.780002
Cited By
TÜRKÇE KONUŞMADA DUYGU TANIMA İÇİN MAKİNE ÖĞRENME YÖNTEMLERİ VE DERİN ÖĞRENME TABANLI MODELLERİN KARŞILAŞTIRILMASI
Mühendislik Bilimleri ve Tasarım Dergisi
https://doi.org/10.21923/jesd.1350375