Research Article

Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews

Volume: 7 Number: 1 February 29, 2024
EN TR

Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews

Abstract

Sentiment analysis can help extract meaningful information from these data piles from various websites and social media and measure consumers' reactions by classifying consumers' emotions as positive, negative or neutral. The success of sentiment analysis varies according to feature selection, vector space selection and machine learning method. For this reason, determining the most successful method in sentiment analysis is still controversial and important. A limited number of studies have been conducted comparing the success of various machine learning methods in sentiment analysis of hotel reviews in English. Considering this gap, the purpose of this research is to determine the most successful machine learning algorithm for sentiment analysis of hotel reviews. For this purpose, 708 reviews for 5-star hotels in Istanbul were collected manually. Obtained data were classified as positive and negative using logistic regression, k-nearest neighbor, naive Bayes and support vector machine methods. Analysis results show that the logistic regression method was the most successful classification algorithm, with an accuracy rate of 0.92. It is followed by support vector machine (0.90), naive Bayes method (0.77) and k-nearest neighbor algorithms (0.66).

Keywords

References

  1. Acar, A. & Uğur, İ. (2021). Uluslararası Zincir Otellere Yönelik Tripadvisor Yorumlarının Duygu Analizi Yöntemi ile Değerlendirilmesi: Ankara Örneği, Türk Turizm Araştırmaları Dergisi, 5(3): 1803-1814.
  2. Akgöbek, Ö., ve Çakır, F. (2009). Veri Madenciliğinde Bir Uzman Sistem Tasarımı. Akademik Bilişim, 9 , 801-806.
  3. Akın, B., Gürsoy Şimşek, U. T., (2018) Sosyal Medya Analitiği ile Değer Yaratma: Duygu Analizi İle Geleceğe Yönelim. Mehmet Akif Ersoy Üniversitesi İktisadi ve İdari Bilimler Fakültesi Dergisi, 5(3), 797-811.
  4. Al-Smadi, M., Qawasmeh, O., Al-Ayyoub, M., Jararweh, Y., & Gupta, B. (2018). Deep Recurrent neural network vs. support vector machine for aspect-based sentiment analysis of Arabic hotels’ reviews. Journal of computational science, 27, 386-393.
  5. Altunışık, R. (2015). Büyük Veri: Fırsatlar Kaynağı mı Yoksa Yeni Sorunlar Yumağı mı?. Yıldız Social Science Review, 1(1), 45-76.
  6. Altunkaynak, B. (2017). Veri Madenciliği Yöntemleri ve R Uygulamaları. Seçkin Yayıncılık, Ankara.
  7. Balaban, M. E., & Kartal, E. (2015). Veri Madenciliği ve Makine Öğrenmesi Temel Algoritmaları ve R Dili ile Uygulamaları. Çağlayan Kitabevi, İstanbul.
  8. Bagherzadeh, S., Shokouhyar, S., Jahani, H., & Sigala, M. (2021). A generalizable sentiment analysis method for creating a hotel dictionary: using big data on TripAdvisor hotel reviews. Journal of Hospitality and Tourism Technology, 12(2), 210-238.

Details

Primary Language

English

Subjects

Service Marketing, Tourism (Other)

Journal Section

Research Article

Publication Date

February 29, 2024

Submission Date

July 14, 2023

Acceptance Date

October 21, 2023

Published in Issue

Year 2024 Volume: 7 Number: 1

APA
İnan, H. E. (2024). Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews. GSI Journals Serie A: Advancements in Tourism Recreation and Sports Sciences, 7(1), 111-122. https://doi.org/10.53353/atrss.1327615
AMA
1.İnan HE. Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews. ATRSS. 2024;7(1):111-122. doi:10.53353/atrss.1327615
Chicago
İnan, Hüseyin Ertan. 2024. “Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews”. GSI Journals Serie A: Advancements in Tourism Recreation and Sports Sciences 7 (1): 111-22. https://doi.org/10.53353/atrss.1327615.
EndNote
İnan HE (February 1, 2024) Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews. GSI Journals Serie A: Advancements in Tourism Recreation and Sports Sciences 7 1 111–122.
IEEE
[1]H. E. İnan, “Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews”, ATRSS, vol. 7, no. 1, pp. 111–122, Feb. 2024, doi: 10.53353/atrss.1327615.
ISNAD
İnan, Hüseyin Ertan. “Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews”. GSI Journals Serie A: Advancements in Tourism Recreation and Sports Sciences 7/1 (February 1, 2024): 111-122. https://doi.org/10.53353/atrss.1327615.
JAMA
1.İnan HE. Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews. ATRSS. 2024;7:111–122.
MLA
İnan, Hüseyin Ertan. “Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews”. GSI Journals Serie A: Advancements in Tourism Recreation and Sports Sciences, vol. 7, no. 1, Feb. 2024, pp. 111-22, doi:10.53353/atrss.1327615.
Vancouver
1.Hüseyin Ertan İnan. Comparison of Machine Learning Algorithms for Classification of Hotel Reviews: Sentiment Analysis of TripAdvisor Reviews. ATRSS. 2024 Feb. 1;7(1):111-22. doi:10.53353/atrss.1327615

Cited By

22039
All articles published in this journal are licensed under the Creative Commons Attribution–NonCommercial 4.0 International License (CC BY-NC 4.0). This license permits the copying, distribution, and reuse of the published articles (the work) for non-commercial purposes, provided that appropriate credit is given. Commercial use is subject to permission. Authors retain the copyright of their published articles and grant the journal a non-exclusive right to publish and distribute the work.