Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets

Aytuğ Onan

doi:10.17694/bajece.419538

Araştırma Makalesi

Yıl 2018, Cilt: 6 Sayı: 2, 69 - 77, 30.04.2018

Aytuğ Onan

https://doi.org/10.17694/bajece.419538

Cited By: 24

Öz

Kaynakça

[1] A. Onan, “Twitter mesajları üzerinde makine öğrenmesi yöntemlerine dayalı duygu analizi”, Yönetim Bilişim Sistemleri Dergisi, Vol. 3, No. 2, 2017, pp. 1-14.
[2] A. Onan, S. Korukoğlu, and H. Bulut, “A multiobjective weighted voting ensemble classifier based on differential evolution algorithm for text sentiment classification”, Expert Systems with Applications, Vol.62, 2016, pp.1-16.
[3] A.Onan, “A machine learning based approach to identify geo-location of Twitter users”, in Proceedings of the ICC 2017, UK, 2017, pp.1-7.
[4] J. Mahmud, J. Nichols, and C. Drews, “Home location identification of twitter users”, ACM Transactions on Intelligent Systems and Technology, Vol. 5, No.3, 2014, pp.47.
[5] Z. Cheng, J. Caverlee, and K.Lee, “You are where you tweet: a content-based approach to geo-location twitter users”, in Proceedings of the 19th ACM International Conference on Information and Knowledge Management, USA, 2010, pp.759-768.
[6] B.Hecht, L.Hong, B. Suh and E.D.Chi, “Tweets from Justin Bieber’s heart: the dynamics of the location field in user profiles”, in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, USA, 2011, pp.237-246.
[7] A. Onan and S. Korukoğlu, “Makine öğrenmesi yöntemlerinin görüş madenciliğinde kullanılması üzerine bir literatür araştırması”, Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, Vol. 22, No. 2, 2016, pp. 111-122.
[8] W. Medhat, A. Hassan and H. Korashy, “Sentiment analysis algorithms and applications: a survey”, Ain Shams Engineering Journal, Vol. 5, No. 4, 2014, pp. 1093-1113.
[9] A. Onan and S. Korukoğlu, “A feature selection model based on genetic rank aggregation for text sentiment classification”, Journal of Information Science, Vol. 43, No.1, 2017, pp.25-38.
[10] M.P. Salas-Zarate, E.Lopez-Lopez, R.Valencia-Garcia, N. Gilles, A.Almela and G.Alor-Hernandez, “A study on LIWC categories for opinion mining in Spanish reviews”, Journal of Information Science, Vol.40, No.6, 2014, pp.749-760.
[11] A.Go, R. Bhayani, and L. Huang, “Twitter sentiment classification using distant supervision”, CS224N Project Report, 2009.
[12] L. Barbosa and J. Feng, “Robust sentiment detection on twitter from biased and noisy data”, in Proceedings of ACL, USA, 2010, pp. 36-44.
[13] A.Pak and P.Paroubek, “Twitter as a corpus for sentiment analysis and opinion mining”, in Proceedings of LREC 2010, USA, 2010, pp. 1320-1326.
[14] E. Kouloumpis, T.Wilson and J.D.Moore, “Twitter sentiment analysis: the good, the bad and the omg!”, in Proceedings of ICWSM 2011, USA, 2011, pp. 538-541.
[15] A.Agarwal, B.Xie, I.Vovsha, O.Rambow and R. Passonneau, “Sentiment analysis of twitter data”, in Proceedings of ACL 2011, USA, 2011, pp. 30-38.
[16] H.Saif, Y.He and H.Alani, “Semantic sentiment analysis of twitter”, in Proceedings of ISWC 2012, USA, 2012, pp.508-524.
[17] M.Salas-Zarate, M.A. Paredes-Valverde, M.A.Rodriguez-Garcia, R.Valencia-Garcia and G.Alor-Hernandez, “Automatic detection of satire in Twitter: a psycholinguistic-based approach”, Knowledge-Based Systems, Vol.128, 2017, pp.20-33.
[18] J.M.Cotelo, F.L.Cruz, J.A.Troyano and F.J.Ortega, “A modular approach for lexical normalization applied to Spanish tweets”, Expert Systems with Applications, Vol. 42, No.10, 2015,pp. 4743-4754.
[19] E.Kontopoulos, C.Berberidis, T.Dergiades and N.Bassiliades, “Ontolog-based sentiment analysis of twitter posts”, Expert Systems with Applications, Vol.40, No.10, 2013, pp.4065-4074.
[20] R.Justo, T.Corcoran, S.M.Lukin, M.Walker and M.I.Torres, “Extracting relevant knowledge for the detection of sarcasm and nastiness in the social web”, Knowledge-Based Systems, Vol. 69, 2014, pp.124-133.
[21] S.Skalicky and S.Crossley, “A statistical analysis of satirical Amazon.com product reviews”, European Journal of Humour Research, Vol.2, 2015, pp.66-85.
[22] J.W.Pennebaker, R.L.Boyd, K.Jordan and K.Blackburn, “The development and psychometric properties of LIWC 2015”.
[23] A.Onan, “Classifier and feature set ensembles for web page classification”, Journal of Information Science, Vol. 42, No.2, pp.150-165.
[24] A.Onan, “Sarcasm identification on twitter: a machine learning approach”, in Proceedings of CSOC 2017, Germany, 2017, pp.374-383.
[25] M.Kantardzic, Data mining: concepts, models, methods and algorithms, John Wiley & Sons, 2011, p.552.
[26] L.Breiman, “Bagging predictors”, Machine Learning, Vol.4, No.2, pp.123-140.
[27] Y.Freund and R.E.Schapire, “Experiments with a new boosting algorithm”, in Proceedings of the Thirteenth International Conference on Machine Learning, Italy, 1996, pp.148-156.
[28] T.K. Ho, “The random subspace method for constructing decision forests”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 20, No.8, pp.832-844.
[29] A.Onan, “Artificial immune system based web page classification”, in Proceedings of CSOC 2015, Germany, 2015, pp.189-199.

Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets

Yıl 2018, Cilt: 6 Sayı: 2, 69 - 77, 30.04.2018

Aytuğ Onan

https://doi.org/10.17694/bajece.419538

Cited By: 24

Öz

With the advances in information and
communication technologies, social media and microblogging platforms serve as
an important source of information. In microblogging platforms, people can
share their opinions, complaints, sentiments and attitudes towards topics,
current issues and products. Sentiment analysis is an important research
direction in natural language processing, which aims to identify the sentiment
orientation of source materials. Twitter is a popular microblogging platform,
where people all over the world can interact by user-generated text messages.
Information obtained from Twitter can serve as an essential source for several
applications, including event detection, news recommendation and crisis
management. In sentiment classification, the identification of an appropriate
feature subset plays an important role. LIWC (Linguistic Inquiry and Word
Count) is an exploratory text analysis software to extract psycholinguistic
features from text documents. In this paper, we present a psycholinguistic
approach to sentiment analysis on Twitter. In this scheme, we utilized five
main LIWC categories (namely, linguistic processes, psychological processes,
personal concerns, spoken categories and punctuation) as feature sets. In the
experimental analysis, five LIWC categories and their ensemble combinations are
taken into consideration. To explore the predictive performance of different
feature engineering schemes, four supervised learning algorithms (namely, Naïve
Bayes, support vector machines, k-nearest neighbor algorithm and logistic
regression) and three ensemble learning methods (namely, AdaBoost, Bagging and
Random Subspace) are utilized. The experimental results indicate that ensemble
feature sets yield higher predictive performance compared to the individual
feature sets.

Anahtar Kelimeler

Machine learning, psychological feature sets, sentiment analysis, Twitter

Kaynakça

[1] A. Onan, “Twitter mesajları üzerinde makine öğrenmesi yöntemlerine dayalı duygu analizi”, Yönetim Bilişim Sistemleri Dergisi, Vol. 3, No. 2, 2017, pp. 1-14.
[2] A. Onan, S. Korukoğlu, and H. Bulut, “A multiobjective weighted voting ensemble classifier based on differential evolution algorithm for text sentiment classification”, Expert Systems with Applications, Vol.62, 2016, pp.1-16.
[3] A.Onan, “A machine learning based approach to identify geo-location of Twitter users”, in Proceedings of the ICC 2017, UK, 2017, pp.1-7.
[4] J. Mahmud, J. Nichols, and C. Drews, “Home location identification of twitter users”, ACM Transactions on Intelligent Systems and Technology, Vol. 5, No.3, 2014, pp.47.
[5] Z. Cheng, J. Caverlee, and K.Lee, “You are where you tweet: a content-based approach to geo-location twitter users”, in Proceedings of the 19th ACM International Conference on Information and Knowledge Management, USA, 2010, pp.759-768.
[6] B.Hecht, L.Hong, B. Suh and E.D.Chi, “Tweets from Justin Bieber’s heart: the dynamics of the location field in user profiles”, in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, USA, 2011, pp.237-246.
[7] A. Onan and S. Korukoğlu, “Makine öğrenmesi yöntemlerinin görüş madenciliğinde kullanılması üzerine bir literatür araştırması”, Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, Vol. 22, No. 2, 2016, pp. 111-122.
[8] W. Medhat, A. Hassan and H. Korashy, “Sentiment analysis algorithms and applications: a survey”, Ain Shams Engineering Journal, Vol. 5, No. 4, 2014, pp. 1093-1113.
[9] A. Onan and S. Korukoğlu, “A feature selection model based on genetic rank aggregation for text sentiment classification”, Journal of Information Science, Vol. 43, No.1, 2017, pp.25-38.
[10] M.P. Salas-Zarate, E.Lopez-Lopez, R.Valencia-Garcia, N. Gilles, A.Almela and G.Alor-Hernandez, “A study on LIWC categories for opinion mining in Spanish reviews”, Journal of Information Science, Vol.40, No.6, 2014, pp.749-760.
[11] A.Go, R. Bhayani, and L. Huang, “Twitter sentiment classification using distant supervision”, CS224N Project Report, 2009.
[12] L. Barbosa and J. Feng, “Robust sentiment detection on twitter from biased and noisy data”, in Proceedings of ACL, USA, 2010, pp. 36-44.
[13] A.Pak and P.Paroubek, “Twitter as a corpus for sentiment analysis and opinion mining”, in Proceedings of LREC 2010, USA, 2010, pp. 1320-1326.
[14] E. Kouloumpis, T.Wilson and J.D.Moore, “Twitter sentiment analysis: the good, the bad and the omg!”, in Proceedings of ICWSM 2011, USA, 2011, pp. 538-541.
[15] A.Agarwal, B.Xie, I.Vovsha, O.Rambow and R. Passonneau, “Sentiment analysis of twitter data”, in Proceedings of ACL 2011, USA, 2011, pp. 30-38.
[16] H.Saif, Y.He and H.Alani, “Semantic sentiment analysis of twitter”, in Proceedings of ISWC 2012, USA, 2012, pp.508-524.
[17] M.Salas-Zarate, M.A. Paredes-Valverde, M.A.Rodriguez-Garcia, R.Valencia-Garcia and G.Alor-Hernandez, “Automatic detection of satire in Twitter: a psycholinguistic-based approach”, Knowledge-Based Systems, Vol.128, 2017, pp.20-33.
[18] J.M.Cotelo, F.L.Cruz, J.A.Troyano and F.J.Ortega, “A modular approach for lexical normalization applied to Spanish tweets”, Expert Systems with Applications, Vol. 42, No.10, 2015,pp. 4743-4754.
[19] E.Kontopoulos, C.Berberidis, T.Dergiades and N.Bassiliades, “Ontolog-based sentiment analysis of twitter posts”, Expert Systems with Applications, Vol.40, No.10, 2013, pp.4065-4074.
[20] R.Justo, T.Corcoran, S.M.Lukin, M.Walker and M.I.Torres, “Extracting relevant knowledge for the detection of sarcasm and nastiness in the social web”, Knowledge-Based Systems, Vol. 69, 2014, pp.124-133.
[21] S.Skalicky and S.Crossley, “A statistical analysis of satirical Amazon.com product reviews”, European Journal of Humour Research, Vol.2, 2015, pp.66-85.
[22] J.W.Pennebaker, R.L.Boyd, K.Jordan and K.Blackburn, “The development and psychometric properties of LIWC 2015”.
[23] A.Onan, “Classifier and feature set ensembles for web page classification”, Journal of Information Science, Vol. 42, No.2, pp.150-165.
[24] A.Onan, “Sarcasm identification on twitter: a machine learning approach”, in Proceedings of CSOC 2017, Germany, 2017, pp.374-383.
[25] M.Kantardzic, Data mining: concepts, models, methods and algorithms, John Wiley & Sons, 2011, p.552.
[26] L.Breiman, “Bagging predictors”, Machine Learning, Vol.4, No.2, pp.123-140.
[27] Y.Freund and R.E.Schapire, “Experiments with a new boosting algorithm”, in Proceedings of the Thirteenth International Conference on Machine Learning, Italy, 1996, pp.148-156.
[28] T.K. Ho, “The random subspace method for constructing decision forests”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 20, No.8, pp.832-844.
[29] A.Onan, “Artificial immune system based web page classification”, in Proceedings of CSOC 2015, Germany, 2015, pp.189-199.

Toplam 29 adet kaynakça vardır.

Ayrıntılar

Birincil Dil	İngilizce
Konular	Mühendislik
Bölüm	Araştırma Makalesi
Yazarlar	Aytuğ Onan
Yayımlanma Tarihi	30 Nisan 2018
Yayımlandığı Sayı	Yıl 2018 Cilt: 6 Sayı: 2

Kaynak Göster

APA	Onan, A. (2018). Sentiment Analysis on Twitter Based on Ensemble of Psychological and Linguistic Feature Sets. Balkan Journal of Electrical and Computer Engineering, 6(2), 69-77. https://doi.org/10.17694/bajece.419538

Cited By

A Study of Lightweight Approaches to Analyze Crime Conditions in India

Journal of Applied Security Research

https://doi.org/10.1080/19361610.2021.2006031

A systematic literature review on machine learning applications for consumer sentiment analysis using online reviews

Computer Science Review

https://doi.org/10.1016/j.cosrev.2021.100413

Using artificial intelligence techniques for detecting Covid-19 epidemic fake news in Moroccan tweets

Results in Physics

https://doi.org/10.1016/j.rinp.2021.104266

A comprehensive review and evaluation on text predictive and entertainment systems

Soft Computing

https://doi.org/10.1007/s00500-021-06691-4

Sentence Classification Using N-Grams in Urdu Language Text

Scientific Programming

https://doi.org/10.1155/2021/1296076

A New Big Data Feature Selection Approach for Text Classification

Scientific Programming

https://doi.org/10.1155/2021/6645345

Cost-sensitive regression learning on small dataset through intra-cluster product favoured feature selection

Connection Science

https://doi.org/10.1080/09540091.2021.1970719

Using the Ship-Gram Model for Japanese Keyword Extraction Based on News Reports

Complexity

https://doi.org/10.1155/2021/9965843

Bayesian Attribute Bagging-Based Extreme Learning Machine for High-Dimensional Classification and Regression

ACM Transactions on Intelligent Systems and Technology

https://doi.org/10.1145/3495164

Equity Research Report-Driven Investment Strategy in Korea Using Binary Classification on Stock Price Direction

IEEE Access

https://doi.org/10.1109/ACCESS.2021.3067691

Dental Impression Tray Selection From Maxillary Arch Images Using Multi-Feature Fusion and Ensemble Classifier

IEEE Access

https://doi.org/10.1109/ACCESS.2021.3059785

Arabic sentiment analysis using GCL-based architectures and a customized regularization function

Engineering Science and Technology, an International Journal

https://doi.org/10.1016/j.jestch.2023.101433

Aspect Based Opinion Mining on Hotel Reviews

International Journal of Advances in Engineering and Pure Sciences

https://doi.org/10.7240/jeps.896515

Automatic Personality Evaluation from Transliterations of YouTube Vlogs Using Classical and State of the art Word Embeddings

Ingeniería e Investigación

https://doi.org/10.15446/ing.investig.93803

Research on Diagnosis Prediction of Traditional Chinese Medicine Diseases Based on Improved Bayesian Combination Model

Evidence-Based Complementary and Alternative Medicine

Zhulv Zhang

https://doi.org/10.1155/2021/5513748

Solving Misclassification of the Credit Card Imbalance Problem Using Near Miss

Mathematical Problems in Engineering

Nhlakanipho Michael Mqadi

https://doi.org/10.1155/2021/7194728

Predicting Learning Behavior Using Log Data in Blended Teaching

Scientific Programming

Shu-Tong Xie

https://doi.org/10.1155/2021/4327896

Improving Arabic Sentiment Analysis Using CNN-Based Architectures and Text Preprocessing

Computational Intelligence and Neuroscience

Mustafa Mhamed

https://doi.org/10.1155/2021/5538791

Grade Prediction in Blended Learning Using Multisource Data

Scientific Programming

Ling-qing Chen

https://doi.org/10.1155/2021/4513610

The power of ensemble learning in sentiment analysis

Expert Systems with Applications

Jacqueline Kazmaier

https://doi.org/10.1016/j.eswa.2021.115819

An Incremental Approach to Corpus Design and Construction: Application to a Large Contemporary Saudi Corpus

IEEE Access

Hebah Elgibreen

https://doi.org/10.1109/ACCESS.2021.3089924

A comparative study of keyword extraction algorithms for English texts

Journal of Intelligent Systems

Jinye Li

https://doi.org/10.1515/jisys-2021-0040

Ensemble of Classifiers and Term Weighting Schemes for Sentiment Analysis in Turkish

Scientific Research Communications

Aytuğ Onan

https://doi.org/10.52460/src.2021.004

A Meta-Ensemble Classifier Approach: Random Rotation Forest

Balkan Journal of Electrical and Computer Engineering

Erdal Taşcı

https://doi.org/10.17694/bajece.502156

Kapak Resmi İndir

Makale Dosyaları

Tam Metin

All articles published by BAJECE are licensed under the Creative Commons Attribution 4.0 International License. This permits anyone to copy, redistribute, remix, transmit and adapt the work provided the original work and source is appropriately cited. Creative Commons LisansÄ±