Sentiment analysis is considered the process to extract the overall expression, opinions, or feelings from reviews about something such as products, services, or movies. Pre-processing is considered a crucial phase in sentiment analysis for text mining because it allows us to analyze the reviews according to their intended meaning by removing all of the appendages which are the words that do not affect the semantics of sentences. And therefore, the number of features will decrease and thus accuracy will increase. Accordingly, we have decided to evaluate our experiment in identifying the best influencing technique of pre-processing for several features by making a comparison between the features and by combining them together to reach the best result based on the feature number for each pre-processing technique and classification accuracy. this comparison was done by using three algorithms for classification SVM, NB, and DT after applying tools for feature selection and feature extraction with three techniques for tokenization. We concluded that there are some of these techniques that have a negative effect like lemmatization and the part of them is not due to any difference, other, which a little part, have an effect
Sentiment analysis Pre-processing Techniques Machine Learning Approach. Feature Selection.
Altinbas university
Thanks
Birincil Dil | İngilizce |
---|---|
Konular | Bilgisayar Yazılımı |
Bölüm | Araştırma Makalesi |
Yazarlar | |
Yayımlanma Tarihi | 31 Aralık 2022 |
Gönderilme Tarihi | 12 Ekim 2022 |
Kabul Tarihi | 3 Kasım 2022 |
Yayımlandığı Sayı | Yıl 2022 |