Araştırma Makalesi

Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction

Cilt: 35 Sayı: 1 30 Mart 2023
PDF İndir
TR EN

Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction

Öz

Sentiment analysis is a challenging problem in Natural Language Processing since every language has its own character within several difficulties such as ambiguity, synonymy, negative suffixes…etc. Since words with ambiguity can have different sentiment scores depending on the meaning they have in their corresponding context, we accomplished a study on Turkish language to determine whether the polarity scores of these polysemous words may differ according to their meaning. For a word with ambiguity, we first made a polarity calculation module to calculate its polarity score. This way, we calculated the polarity scores of 100 Turkish polysemous words. Then, since negation directly affects the correct meaning of the word in the sentiment analysis, a negation handler module is also implemented. After that, we prepared a sentiment polarity corpus which consists of 159,876 Turkish words including 100 Turkish polysemous words. Actually, the main purpose of this study is to detect sentiment polarity of Turkish texts by considering and building a specialized module for polysemous words. In short, we built a system for Turkish sentiment polarity detection task including these modules: 1) Pre-processing, 2) Polarity Calculation Module, 3) Negation Handling Module, 4) Feature Generation Module, and 5) Classification Module. According to our knowledge, this is the first study which includes all of these modules in one Turkish sentiment analysis task. Finally, we conducted this corpus using an ensemble hybrid regularized learning algorithm on two self-collected Twitter-datasets. Experimental results show that the suggested approach improves the classification performance on Turkish sentiment analysis task.

Anahtar Kelimeler

Destekleyen Kurum

TÜBİTAK

Proje Numarası

120E187

Teşekkür

This work is supported in part by The Scientific and Technological Research Council of Turkey (TÜBİTAK) grant number [120E187]. Points of view in this document are those of the authors and do not necessarily represent the official position or policies of TÜBİTAK.

Kaynakça

  1. [1] Navigli, R., Word sense disambiguation: A survey. ACM Comput Surv, 41(2), 1-69, (2009).
  2. [2] Boyd-Graber, J., Blei, D. & Zhu, X.A. Topic model for word sense disambiguation. In Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL), Prague, (2007).
  3. [3] Açıkgöz, O., Gürkan, A. T., Ertopçu, B., Topsakal, O., Özenç, B., Kanburoğlu, A. B., & Yıldız, O. T. All-words word sense disambiguation for Turkish. In International Conference on Computer Science and Engineering (UBMK), Antalya, Turkey, (2017).
  4. [4] Orhan, Z., & Altan, Z. Effective features for disambiguation of Turkish verbs. Int J. Comp and Inf Eng, 1(7), 2264-2268, (2007).
  5. [5] Gezici, G., & Yanıkoğlu, B. Sentiment analysis in Turkish. Turkish natural language processing, 255-271, (2018).
  6. [6] Türkmenoglu, C., & Tantug, A. C. Sentiment analysis in Turkish media. In International Conference on Machine Learning (ICML), Beijing, China, (2014).
  7. [7] Çetiner, M., Yıldırım, A., Onay, B., & Öksüz, C. Word Sense Disambiguation using KeNet. In 29th Signal Processing and Communications Applications Conference (SIU), Istanbul, Turkey, (2021).
  8. [8] Mert, E., & Dalkilic, G. Word sense disambiguation for Turkish. In 24th International Symposium on Computer and Information Sciences, Cyprus, (2009).

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

30 Mart 2023

Gönderilme Tarihi

9 Şubat 2023

Kabul Tarihi

28 Şubat 2023

Yayımlandığı Sayı

Yıl 2023 Cilt: 35 Sayı: 1

Kaynak Göster

APA
Altınel Girgin, A. B., & Şahin, S. (2023). Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction. International Journal of Advances in Engineering and Pure Sciences, 35(1), 125-141. https://doi.org/10.7240/jeps.1249586
AMA
1.Altınel Girgin AB, Şahin S. Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction. JEPS. 2023;35(1):125-141. doi:10.7240/jeps.1249586
Chicago
Altınel Girgin, Ayşe Berna, ve Sema Şahin. 2023. “Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction”. International Journal of Advances in Engineering and Pure Sciences 35 (1): 125-41. https://doi.org/10.7240/jeps.1249586.
EndNote
Altınel Girgin AB, Şahin S (01 Mart 2023) Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction. International Journal of Advances in Engineering and Pure Sciences 35 1 125–141.
IEEE
[1]A. B. Altınel Girgin ve S. Şahin, “Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction”, JEPS, c. 35, sy 1, ss. 125–141, Mar. 2023, doi: 10.7240/jeps.1249586.
ISNAD
Altınel Girgin, Ayşe Berna - Şahin, Sema. “Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction”. International Journal of Advances in Engineering and Pure Sciences 35/1 (01 Mart 2023): 125-141. https://doi.org/10.7240/jeps.1249586.
JAMA
1.Altınel Girgin AB, Şahin S. Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction. JEPS. 2023;35:125–141.
MLA
Altınel Girgin, Ayşe Berna, ve Sema Şahin. “Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction”. International Journal of Advances in Engineering and Pure Sciences, c. 35, sy 1, Mart 2023, ss. 125-41, doi:10.7240/jeps.1249586.
Vancouver
1.Ayşe Berna Altınel Girgin, Sema Şahin. Improving the Performance of Sentiment Analysis by Ensemble Hybrid Learning Algorithm With NLP And Cascaded Feature Extraction. JEPS. 01 Mart 2023;35(1):125-41. doi:10.7240/jeps.1249586

Cited By