Research Article

Sentiment analysis with ensemble and machine learning methods in multi-domain datasets

Volume: 7 Number: 2 April 15, 2023
EN

Sentiment analysis with ensemble and machine learning methods in multi-domain datasets

Abstract

The first place to get ideas on all the activities considered to occur in everyday life was the comments on the websites. This is an area that deals with these interpretations in the natural language processing, which is a sub-branch of artificial intelligence. Sentiment analysis studies, which is a task of natural language processing are carried out to give people an idea and even guide them with such comments. In this study, sentiment analysis was implemented on public user feedback on websites in two different areas. TripAdvisor dataset includes positive or negative user comments about hotels. And Rotten Tomatoes dataset includes positive (fresh) or negative (rotten) user comments about films. Sentiments analysis on datasets have been carried out by using Word2Vec word embedding model, which learns the vector representations of each word containing the positive or negative meaning of the sentences, and the Term Frequency Inverse Document Frequency text representation model with four machine learning methods (Naïve Bayes-NB, Support Vector Machines-SVM, Logistic Regression-LR, K-Nearest Neighbour-kNN) and two ensemble learning methods (Stacking, Majority Voting-MV). Accuracy and F-measure is used as a performance metric experiments. According to the results, Ensemble learning methods have shown better results than single machine learning algorithms. Among the overall approaches, MV outperformed Stacking.

Keywords

Supporting Institution

yok

Project Number

yok

References

  1. Mostafa, L. (2020). Machine learning-based sentiment analysis for analyzing the travelers reviews on Egyptian hotels. In Joint European-US Workshop on Applications of Invariance in Computer Vision. Springer, Cham, 405-413.
  2. Dehkharghani, R., Yanikoglu, B., Tapucu, D., & Saygin, Y. (2012). Adaptation and Use of Subjectivity Lexicons for Domain Dependent Sentiment Classification. IEEE 12th International Conference on Data Mining Workshops, 10 December, Washington, 669–673.
  3. Raut, V. B., & Londhe, D. D. (2014). Opinion Mining and Summarization of Hotel Reviews. International Conference on Computational Intelligence and Communication Networks, November, Bhopal, 556–559.
  4. Tiwari, P., Mishra, B. K., Kumar, S., & Kumar, V. (2017). Implementation of n-gram methodology for rotten tomatoes review dataset sentiment analysis. International Journal of Knowledge Discovery in Bioinformatics (IJKDB), 7(1),30–41.
  5. Zhou, Y. (2019). Sentiment Classification with Deep Neural Networks. Master's Thesis. Tampere University. Finland.
  6. Sahu, T. P., & Ahuja, S. (2016). Sentiment analysis of movie reviews: A study on feature selection and classification algorithms. International Conference on Microelectronics, Computing, and Communications (MicroCom), 23-25 January, Durgapur, 1–6.
  7. Oswin, H. R., Virginia, G., & Antonius, R. C. (2016). Sentiment Classification of Film Reviews Using IB1. 7th International Conference on Intelligent Systems, Modelling, and Simulation (ISMS), 23-25 January, Bangkok 78–82.
  8. Mostafa, L. (2021). Egyptian Student Sentiment Analysis Using Word2vec During the Coronavirus (Covid-19) Pandemic. In: Hassanien A.E., Slowik A., Snášel V., El-Deeb H., Tolba F.M. (eds) Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2020. AISI 2020. Advances in Intelligent Systems and Computing, vol 1261. Springer, Cham.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

April 15, 2023

Submission Date

February 26, 2022

Acceptance Date

April 7, 2022

Published in Issue

Year 2023 Volume: 7 Number: 2

APA
Başarslan, M. S., & Kayaalp, F. (2023). Sentiment analysis with ensemble and machine learning methods in multi-domain datasets. Turkish Journal of Engineering, 7(2), 141-148. https://doi.org/10.31127/tuje.1079698
AMA
1.Başarslan MS, Kayaalp F. Sentiment analysis with ensemble and machine learning methods in multi-domain datasets. TUJE. 2023;7(2):141-148. doi:10.31127/tuje.1079698
Chicago
Başarslan, Muhammet Sinan, and Fatih Kayaalp. 2023. “Sentiment Analysis With Ensemble and Machine Learning Methods in Multi-Domain Datasets”. Turkish Journal of Engineering 7 (2): 141-48. https://doi.org/10.31127/tuje.1079698.
EndNote
Başarslan MS, Kayaalp F (April 1, 2023) Sentiment analysis with ensemble and machine learning methods in multi-domain datasets. Turkish Journal of Engineering 7 2 141–148.
IEEE
[1]M. S. Başarslan and F. Kayaalp, “Sentiment analysis with ensemble and machine learning methods in multi-domain datasets”, TUJE, vol. 7, no. 2, pp. 141–148, Apr. 2023, doi: 10.31127/tuje.1079698.
ISNAD
Başarslan, Muhammet Sinan - Kayaalp, Fatih. “Sentiment Analysis With Ensemble and Machine Learning Methods in Multi-Domain Datasets”. Turkish Journal of Engineering 7/2 (April 1, 2023): 141-148. https://doi.org/10.31127/tuje.1079698.
JAMA
1.Başarslan MS, Kayaalp F. Sentiment analysis with ensemble and machine learning methods in multi-domain datasets. TUJE. 2023;7:141–148.
MLA
Başarslan, Muhammet Sinan, and Fatih Kayaalp. “Sentiment Analysis With Ensemble and Machine Learning Methods in Multi-Domain Datasets”. Turkish Journal of Engineering, vol. 7, no. 2, Apr. 2023, pp. 141-8, doi:10.31127/tuje.1079698.
Vancouver
1.Muhammet Sinan Başarslan, Fatih Kayaalp. Sentiment analysis with ensemble and machine learning methods in multi-domain datasets. TUJE. 2023 Apr. 1;7(2):141-8. doi:10.31127/tuje.1079698

Cited By

Flag Counter