Research Article

A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods

Volume: 9 Number: 4 October 30, 2021
EN

A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods

Abstract

The sentiment analysis system uses natural language processing techniques and a sentimental vocabulary network. Sentiment analysis means discovering and recognizing people's positive or negative feelings about an issue or product in the texts. Increasing the importance of sentiment analysis has coincided with social media's growth, such as opinion polls, weblogs, Twitter and other social networks. One of the applications of deep learning in NLP is sentiment analysis. The most common and successful type of RNN is the LSTM network. There is a lot of research that uses the LSTM ability to analyze sentiment. But large data volumes reduce the accuracy of LSTM network results in test data; in other words, the problem of over-fitting occurs. This problem occurs when there is a high correlation between independent variables. The model may not have high validity despite the high value of the correlation coefficient between the independent and dependent variables. In other words, although the model looks good, it does not have significant independent variables. Combining the LSTM network with feature selection methods can increase sentiment analysis accuracy to select effective features and solve this problem. In this study, we review state of the art to determine how previous research has addressed these tasks. We also proposed combining the feature selection method, Chi-Square with LSTM, Bi-LSTM and GRU models, the performance of each measured and compared in terms of accuracy, precision, recall, and F1 score for two benchmark datasets, YELP and US Airline. The results show that feature selection methods significantly increases classification accuracy in all cases. In the Yelp dataset, the maximum attained an accuracy of Bi-LSTM is 100% using chi-square when the number of features is 500 In the US Airline dataset, the maximum achieved an accuracy of GRU-LSTM is 97.9% using chi-square when the number of features is 20.

Keywords

References

  1. [1] S. Xu, H. Liang and T. Baldwin, "Unimelb at semeval-2016 tasks 4a and 4b: An ensemble of neural networks and a word2vec based model for sentiment classification," in Proceedings of the 10th international workshop on semantic evaluation (SemEval-2016), 2016.
  2. [2] Hameed, Z., & Garcia-Zapirain, B. (2020). Sentiment classification using a single-layered BiLSTM model. Ieee Access, 8, 73992-74001. H. Binali, V. Potdar and C. Wu, "A state of the art opinion mining and its application domains," in 2009 IEEE International Conference on Industrial Technology, 2009. [3] F. Chollet, Deep Learning with Python, Manning Publications Co., 2018.
  3. [4] Rustam, F., Ashraf, I., Mehmood, A., Ullah, S., & Choi, G. S. (2019). Tweets classification on the base of sentiments for US airline companies. Entropy, 21(11), 1078.
  4. [5] Cheng, Y., Yao, L., Xiang, G., Zhang, G., Tang, T., & Zhong, L. (2020). Text sentiment orientation analysis based on multi-channel CNN and bidirectional GRU with attention mechanism. IEEE Access, 8, 134964-134975.
  5. [6] Wazery, Y. M., Mohammed, H. S., & Houssein, E. H. (2018, December). Twitter sentiment analysis using deep neural network. In 2018 14th International Computer Engineering Conference (ICENCO) (pp. 177-182). IEEE.
  6. [7] Rane, A., & Kumar, A. (2018, July). Sentiment classification system of Twitter data for US airline service analysis. In 2018 IEEE 42nd Annual Computer Software and Applications Conference (COMPSAC) (Vol. 1, pp. 769-773). IEEE.
  7. [8] A. Kumar, A. Jaiswal ,Particle Swarm Optimized Ensemble Learning for Enhanced Predictive Sentiment Accuracy of Tweets. In: Singh P., Panigrahi B., Suryadevara N., Sharma S., Singh A. (eds) Proceedings of ICETIT 2019. Lecture Notes in Electrical Engineering, vol 605. (2020) , Springer, Cham. https://doi.org/11007/978-3-030-30577-2_56.
  8. [9] Zainuddin, N., Selamat, A., & Ibrahim, R. (2016, August). Twitter feature selection and classification using support vector machine for aspect-based sentiment analysis. In International conference on industrial, engineering and other applications of applied intelligent systems (pp. 269-279). Springer, Cham.

Details

Primary Language

English

Subjects

Artificial Intelligence

Journal Section

Research Article

Publication Date

October 30, 2021

Submission Date

February 26, 2021

Acceptance Date

September 19, 2021

Published in Issue

Year 2021 Volume: 9 Number: 4

APA
Hussein, M., & Özyurt, F. (2021). A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods. Balkan Journal of Electrical and Computer Engineering, 9(4), 320-326. https://izlik.org/JA56NG54LW
AMA
1.Hussein M, Özyurt F. A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods. Balkan Journal of Electrical and Computer Engineering. 2021;9(4):320-326. https://izlik.org/JA56NG54LW
Chicago
Hussein, Mohammed, and Fatih Özyurt. 2021. “A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods”. Balkan Journal of Electrical and Computer Engineering 9 (4): 320-26. https://izlik.org/JA56NG54LW.
EndNote
Hussein M, Özyurt F (October 1, 2021) A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods. Balkan Journal of Electrical and Computer Engineering 9 4 320–326.
IEEE
[1]M. Hussein and F. Özyurt, “A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods”, Balkan Journal of Electrical and Computer Engineering, vol. 9, no. 4, pp. 320–326, Oct. 2021, [Online]. Available: https://izlik.org/JA56NG54LW
ISNAD
Hussein, Mohammed - Özyurt, Fatih. “A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods”. Balkan Journal of Electrical and Computer Engineering 9/4 (October 1, 2021): 320-326. https://izlik.org/JA56NG54LW.
JAMA
1.Hussein M, Özyurt F. A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods. Balkan Journal of Electrical and Computer Engineering. 2021;9:320–326.
MLA
Hussein, Mohammed, and Fatih Özyurt. “A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods”. Balkan Journal of Electrical and Computer Engineering, vol. 9, no. 4, Oct. 2021, pp. 320-6, https://izlik.org/JA56NG54LW.
Vancouver
1.Mohammed Hussein, Fatih Özyurt. A New Technique for Sentiment Analysis System Based on Deep Learning Using Chi-Square Feature Selection Methods. Balkan Journal of Electrical and Computer Engineering [Internet]. 2021 Oct. 1;9(4):320-6. Available from: https://izlik.org/JA56NG54LW

All articles published by BAJECE are licensed under the Creative Commons Attribution 4.0 International License. This permits anyone to copy, redistribute, remix, transmit and adapt the work provided the original work and source is appropriately cited.Creative Commons Lisansı