Araştırma Makalesi

Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets

Cilt: 6 Sayı: 2 30 Nisan 2018
PDF İndir
EN

Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets

Öz

With the advances in information and communication technologies, social media and microblogging platforms serve as an important source of information. In microblogging platforms, people can share their opinions, complaints, sentiments and attitudes towards topics, current issues and products. Sentiment analysis is an important research direction in natural language processing, which aims to identify the sentiment orientation of source materials. Twitter is a popular microblogging platform, where people all over the world can interact by user-generated text messages. Information obtained from Twitter can serve as an essential source for several applications, including event detection, news recommendation and crisis management. In sentiment classification, the identification of an appropriate feature subset plays an important role. LIWC (Linguistic Inquiry and Word Count) is an exploratory text analysis software to extract psycholinguistic features from text documents. In this paper, we present a psycholinguistic approach to sentiment analysis on Twitter. In this scheme, we utilized five main LIWC categories (namely, linguistic processes, psychological processes, personal concerns, spoken categories and punctuation) as feature sets. In the experimental analysis, five LIWC categories and their ensemble combinations are taken into consideration. To explore the predictive performance of different feature engineering schemes, four supervised learning algorithms (namely, Naïve Bayes, support vector machines, k-nearest neighbor algorithm and logistic regression) and three ensemble learning methods (namely, AdaBoost, Bagging and Random Subspace) are utilized. The experimental results indicate that ensemble feature sets yield higher predictive performance compared to the individual feature sets. 

Anahtar Kelimeler

Kaynakça

  1. [1] A. Onan, “Twitter mesajları üzerinde makine öğrenmesi yöntemlerine dayalı duygu analizi”, Yönetim Bilişim Sistemleri Dergisi, Vol. 3, No. 2, 2017, pp. 1-14.
  2. [2] A. Onan, S. Korukoğlu, and H. Bulut, “A multiobjective weighted voting ensemble classifier based on differential evolution algorithm for text sentiment classification”, Expert Systems with Applications, Vol.62, 2016, pp.1-16.
  3. [3] A.Onan, “A machine learning based approach to identify geo-location of Twitter users”, in Proceedings of the ICC 2017, UK, 2017, pp.1-7.
  4. [4] J. Mahmud, J. Nichols, and C. Drews, “Home location identification of twitter users”, ACM Transactions on Intelligent Systems and Technology, Vol. 5, No.3, 2014, pp.47.
  5. [5] Z. Cheng, J. Caverlee, and K.Lee, “You are where you tweet: a content-based approach to geo-location twitter users”, in Proceedings of the 19th ACM International Conference on Information and Knowledge Management, USA, 2010, pp.759-768.
  6. [6] B.Hecht, L.Hong, B. Suh and E.D.Chi, “Tweets from Justin Bieber’s heart: the dynamics of the location field in user profiles”, in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, USA, 2011, pp.237-246.
  7. [7] A. Onan and S. Korukoğlu, “Makine öğrenmesi yöntemlerinin görüş madenciliğinde kullanılması üzerine bir literatür araştırması”, Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, Vol. 22, No. 2, 2016, pp. 111-122.
  8. [8] W. Medhat, A. Hassan and H. Korashy, “Sentiment analysis algorithms and applications: a survey”, Ain Shams Engineering Journal, Vol. 5, No. 4, 2014, pp. 1093-1113.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yazarlar

Yayımlanma Tarihi

30 Nisan 2018

Gönderilme Tarihi

25 Temmuz 2017

Kabul Tarihi

16 Kasım 2017

Yayımlandığı Sayı

Yıl 2018 Cilt: 6 Sayı: 2

Kaynak Göster

APA
Onan, A. (2018). Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets. Balkan Journal of Electrical and Computer Engineering, 6(2), 69-77. https://doi.org/10.17694/bajece.419538
AMA
1.Onan A. Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets. Balkan Journal of Electrical and Computer Engineering. 2018;6(2):69-77. doi:10.17694/bajece.419538
Chicago
Onan, Aytuğ. 2018. “Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets”. Balkan Journal of Electrical and Computer Engineering 6 (2): 69-77. https://doi.org/10.17694/bajece.419538.
EndNote
Onan A (01 Nisan 2018) Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets. Balkan Journal of Electrical and Computer Engineering 6 2 69–77.
IEEE
[1]A. Onan, “Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets”, Balkan Journal of Electrical and Computer Engineering, c. 6, sy 2, ss. 69–77, Nis. 2018, doi: 10.17694/bajece.419538.
ISNAD
Onan, Aytuğ. “Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets”. Balkan Journal of Electrical and Computer Engineering 6/2 (01 Nisan 2018): 69-77. https://doi.org/10.17694/bajece.419538.
JAMA
1.Onan A. Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets. Balkan Journal of Electrical and Computer Engineering. 2018;6:69–77.
MLA
Onan, Aytuğ. “Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets”. Balkan Journal of Electrical and Computer Engineering, c. 6, sy 2, Nisan 2018, ss. 69-77, doi:10.17694/bajece.419538.
Vancouver
1.Aytuğ Onan. Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets. Balkan Journal of Electrical and Computer Engineering. 01 Nisan 2018;6(2):69-77. doi:10.17694/bajece.419538

Cited By

Aspect Based Opinion Mining on Hotel Reviews

International Journal of Advances in Engineering and Pure Sciences

https://doi.org/10.7240/jeps.896515

All articles published by BAJECE are licensed under the Creative Commons Attribution 4.0 International License. This permits anyone to copy, redistribute, remix, transmit and adapt the work provided the original work and source is appropriately cited.Creative Commons Lisans