Research Article
PDF EndNote BibTex RIS Cite

MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM

Year 2020, Volume 6, Issue 1, 42 - 50, 15.06.2020

Abstract

Digital information is continuously generated from various sources such as social media, user reviews for services. The processing of this written information to extract user opinions is critical for developing customer satisfaction. In particular, medical services may be improved with customer feedbacks if the user opinions or sentiments are inferred from user reviews. There is an ongoing effort to develop automated software systems to evaluate these customer reviews. Machine Learning (ML) algorithms combined with Natural Language Processing (NLP) techniques are used to assess customer feedbacks. There are many studies related to English language in the literature to evaluate sentiments of user reviews. However, Turkish language needs research and it has abundant search opportunities in terms of sentiment classification. This work develops a soft voting ensemble (SVE) algorithm that combines predictions of Logistic Regression (LR), Random Forest (RF) and Decision Tree (DT) to analyze a newly collected medical review data. The accuracies of sentiment classifications of LR, RF and DT are 90.68%, 89.03% and 85.41%. The sentiment classification accuracy of SVE, combination of three algorithms, is 91.12%. The obtained results are promising for an automated Turkish medical sentiment identification algorithm.

References

  • Al-Ayyoub, M., Khamaiseh, A.A., Jararweh & Y., Al-Kabi, M. (2019). A comprehensive survey of arabic sentiment analysis, Information Processing and Management: 320-342.
  • Bomaccorso, G. (2018). Machine Learning Algorithms: Popular algorithms for data science and machine learning, Packt Publishing, Birmingham, United Kingdom: 281-282.
  • Coban, O., & Ozel, S. A. (2018). An Empirical Study of the Extreme Learning Machine for Twitter Sentiment Analysis, International Journal of Intelligent Systems and Applications in Engineering: 178-184.
  • Crannell, C.W., Clark, E., Jones, J., James, T., & Moore, J. (2016). A pattern-matched Twitter analysis of US cancer-patient sentiments, Journal of Surgical Research: 536-542.
  • Das, B. & Chakraborty, S. (2018). An Improved Text Sentiment Classification Model Using TF-IDF and Next Word Negation, Computation and Language, arXiv:1806.06407.
  • Dekharghani, R., Yanikoglu, B., Saygin & Y., Oflazer, K. (2016).Sentiment analysis in Turkish at different granularity levels,Natural Language Engineering: 535-559.
  • Ersahin, B., Aktas, O., Kilinc, D. & Ersahin, M. (2019). A hybrid sentiment analysis method for Turkish,, Turkish Journal of Electrical Engineering & Computer Sciences: 1780-1793.
  • Kaewrod, N. & Kietikul, J. (2018). Improving ID3 Algorithm by Ignoring Minor Instances, International Computer Science and Engineering Conference (ICSEC), Chiang Mai, Thailand.
  • Karcioglu, A. A. & Aydin, T. (2019). Sentiment Analysis of Turkish and English Twitter Feeds Using Word2Vec Model, 2019 27th Signal Processing and Communications Applications Conference (SIU), Sivas, Turkey.
  • Korkontzelos, I., Nikfarjam, A., Shardlow, M., Sarker, A., Ananiadou, S. & Gonzalez, G. (2019). Analysis of the effect of sentiment analysis on extracting adverse drug reactions from tweets and forum posts, Journal of Biomedical Informatics: 148-158.
  • Kumar, S., Yadava, M. & Roy, P.P. (2019). Fusion of EEG response and sentiment analysis of products review to predict customer satisfaction, Information Fusion: 41-52.
  • Makinist, S., Hallac, R., Karakus, B. & Aydın, G. (2017). Preparation of Improved Turkish DataSet for Sentiment Analysis in Social Media, ITM Web of Conferences, 01-03 October 2017.
  • Obulesu, O., Mahendra, M. & ThrilokReddy, M. (2018). Machine Learning Techniques and Tools: A Survey, Proceedings of the International Conference on Inventive Research in Computing Applications (ICIRCA 2018).
  • Omari, M., Al-Hajj, M., Hammami, N. & Sabra, A. (2019). Sentiment Classifier: Logistic Regression for Arabic Services’ Reviews in Lebanon, International Conference on Computer & Information Science (ICCIS), Sakaka, Saudi Arabia.
  • Peng, W., Silan, N. & Zhonghua, S. (2019). Random Forest Classification of Rice Planting Area Using Multi-Temporal Polarimetric Radarsat-2 Data, IEEE International Symposium Geoscience and Remote Sensing (IGARSS), Yokohama, Japan. Rajput, A. (2020). Natural Language Processing, Sentiment Analysis, and Clinical Analytics, Innovation in Health Informatics, Cambridge, Massachusetts: 79-97.
  • Rani, R. & Lobiyal, D. K. (2018).Automatic Construction of Generic Stop Words List for Hindi Text, Procedia Computer Science: 362-370.
  • Ribeiro, M.H. & Coelho, L.S. (2020). Ensemble approach based on bagging, boosting and stacking for short-term prediction in agribusiness time series, Applied Soft Computing Journal.
  • Rodrigues, R.G., Dores, R.M., Camilo-Junior, C. & Rosa, T. (2016). SentiHealth-Cancer: A sentiment analysis tool to help detecting mood of patients in online social networks, International Journal of Medical Informatics: 80-95
  • Sagi, O. & Rokach, L. (2018). Ensemble learning: A survey, WIREs Data Mining Knowledge Discovery: 1-18.
  • Shehu, H.A., Tokat, S., Haidar, S. & Uyaver, S. (2019). Sentiment analysis of Turkish Twitter data, AIP Conference Proceedings.
  • Tabassum, N & Tanvir, A. (2016). A theoretical study on classifier ensemble methods and its applications, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India.
  • Uysal, A. & Gunal, S. (2014). The impact of preprocessing on text classification, Information Processing & Management: 104-112.
  • Wael, E. & Naymat, G. (2017). The Impact of applying Different Preprocessing Steps on Review Spam Detection, Procedia Computer Science: 273-279.
  • Zafra, S.M.,Valdivia, T., González, M. & Lopez, A. (2019). How do we talk about doctors and drugs? Sentiment analysis in forums expressing opinions for medical domain, Artificial Intelligence in Medicine: 50-57.

Year 2020, Volume 6, Issue 1, 42 - 50, 15.06.2020

Abstract

References

  • Al-Ayyoub, M., Khamaiseh, A.A., Jararweh & Y., Al-Kabi, M. (2019). A comprehensive survey of arabic sentiment analysis, Information Processing and Management: 320-342.
  • Bomaccorso, G. (2018). Machine Learning Algorithms: Popular algorithms for data science and machine learning, Packt Publishing, Birmingham, United Kingdom: 281-282.
  • Coban, O., & Ozel, S. A. (2018). An Empirical Study of the Extreme Learning Machine for Twitter Sentiment Analysis, International Journal of Intelligent Systems and Applications in Engineering: 178-184.
  • Crannell, C.W., Clark, E., Jones, J., James, T., & Moore, J. (2016). A pattern-matched Twitter analysis of US cancer-patient sentiments, Journal of Surgical Research: 536-542.
  • Das, B. & Chakraborty, S. (2018). An Improved Text Sentiment Classification Model Using TF-IDF and Next Word Negation, Computation and Language, arXiv:1806.06407.
  • Dekharghani, R., Yanikoglu, B., Saygin & Y., Oflazer, K. (2016).Sentiment analysis in Turkish at different granularity levels,Natural Language Engineering: 535-559.
  • Ersahin, B., Aktas, O., Kilinc, D. & Ersahin, M. (2019). A hybrid sentiment analysis method for Turkish,, Turkish Journal of Electrical Engineering & Computer Sciences: 1780-1793.
  • Kaewrod, N. & Kietikul, J. (2018). Improving ID3 Algorithm by Ignoring Minor Instances, International Computer Science and Engineering Conference (ICSEC), Chiang Mai, Thailand.
  • Karcioglu, A. A. & Aydin, T. (2019). Sentiment Analysis of Turkish and English Twitter Feeds Using Word2Vec Model, 2019 27th Signal Processing and Communications Applications Conference (SIU), Sivas, Turkey.
  • Korkontzelos, I., Nikfarjam, A., Shardlow, M., Sarker, A., Ananiadou, S. & Gonzalez, G. (2019). Analysis of the effect of sentiment analysis on extracting adverse drug reactions from tweets and forum posts, Journal of Biomedical Informatics: 148-158.
  • Kumar, S., Yadava, M. & Roy, P.P. (2019). Fusion of EEG response and sentiment analysis of products review to predict customer satisfaction, Information Fusion: 41-52.
  • Makinist, S., Hallac, R., Karakus, B. & Aydın, G. (2017). Preparation of Improved Turkish DataSet for Sentiment Analysis in Social Media, ITM Web of Conferences, 01-03 October 2017.
  • Obulesu, O., Mahendra, M. & ThrilokReddy, M. (2018). Machine Learning Techniques and Tools: A Survey, Proceedings of the International Conference on Inventive Research in Computing Applications (ICIRCA 2018).
  • Omari, M., Al-Hajj, M., Hammami, N. & Sabra, A. (2019). Sentiment Classifier: Logistic Regression for Arabic Services’ Reviews in Lebanon, International Conference on Computer & Information Science (ICCIS), Sakaka, Saudi Arabia.
  • Peng, W., Silan, N. & Zhonghua, S. (2019). Random Forest Classification of Rice Planting Area Using Multi-Temporal Polarimetric Radarsat-2 Data, IEEE International Symposium Geoscience and Remote Sensing (IGARSS), Yokohama, Japan. Rajput, A. (2020). Natural Language Processing, Sentiment Analysis, and Clinical Analytics, Innovation in Health Informatics, Cambridge, Massachusetts: 79-97.
  • Rani, R. & Lobiyal, D. K. (2018).Automatic Construction of Generic Stop Words List for Hindi Text, Procedia Computer Science: 362-370.
  • Ribeiro, M.H. & Coelho, L.S. (2020). Ensemble approach based on bagging, boosting and stacking for short-term prediction in agribusiness time series, Applied Soft Computing Journal.
  • Rodrigues, R.G., Dores, R.M., Camilo-Junior, C. & Rosa, T. (2016). SentiHealth-Cancer: A sentiment analysis tool to help detecting mood of patients in online social networks, International Journal of Medical Informatics: 80-95
  • Sagi, O. & Rokach, L. (2018). Ensemble learning: A survey, WIREs Data Mining Knowledge Discovery: 1-18.
  • Shehu, H.A., Tokat, S., Haidar, S. & Uyaver, S. (2019). Sentiment analysis of Turkish Twitter data, AIP Conference Proceedings.
  • Tabassum, N & Tanvir, A. (2016). A theoretical study on classifier ensemble methods and its applications, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India.
  • Uysal, A. & Gunal, S. (2014). The impact of preprocessing on text classification, Information Processing & Management: 104-112.
  • Wael, E. & Naymat, G. (2017). The Impact of applying Different Preprocessing Steps on Review Spam Detection, Procedia Computer Science: 273-279.
  • Zafra, S.M.,Valdivia, T., González, M. & Lopez, A. (2019). How do we talk about doctors and drugs? Sentiment analysis in forums expressing opinions for medical domain, Artificial Intelligence in Medicine: 50-57.

Details

Primary Language English
Subjects Social
Journal Section Articles
Authors

Akın ÖZÇİFT> (Primary Author)
CELÂL BAYAR ÜNİVERSİTESİ, HASAN FERDİ TURGUTLU TEKNOLOJİ FAKÜLTESİ, YAZILIM MÜHENDİSLİĞİ BÖLÜMÜ
0000-0002-5317-5678
Türkiye

Supporting Institution Manisa Celal Bayar Üniversitesi
Project Number 2019-057 Bilimsel Alt-Yapı Projesi
Publication Date June 15, 2020
Published in Issue Year 2020, Volume 6, Issue 1

Cite

Bibtex @research article { ybs685786, journal = {Yönetim Bilişim Sistemleri Dergisi}, issn = {2630-550X}, eissn = {2630-550X}, address = {}, publisher = {Vahap TECİM}, year = {2020}, volume = {6}, number = {1}, pages = {42 - 50}, title = {MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM}, key = {cite}, author = {Özçift, Akın} }
APA Özçift, A. (2020). MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM . Yönetim Bilişim Sistemleri Dergisi , 6 (1) , 42-50 . Retrieved from https://dergipark.org.tr/en/pub/ybs/issue/54333/685786
MLA Özçift, A. "MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM" . Yönetim Bilişim Sistemleri Dergisi 6 (2020 ): 42-50 <https://dergipark.org.tr/en/pub/ybs/issue/54333/685786>
Chicago Özçift, A. "MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM". Yönetim Bilişim Sistemleri Dergisi 6 (2020 ): 42-50
RIS TY - JOUR T1 - MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM AU - AkınÖzçift Y1 - 2020 PY - 2020 N1 - DO - T2 - Yönetim Bilişim Sistemleri Dergisi JF - Journal JO - JOR SP - 42 EP - 50 VL - 6 IS - 1 SN - 2630-550X-2630-550X M3 - UR - Y2 - 2020 ER -
EndNote %0 Yönetim Bilişim Sistemleri Dergisi MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM %A Akın Özçift %T MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM %D 2020 %J Yönetim Bilişim Sistemleri Dergisi %P 2630-550X-2630-550X %V 6 %N 1 %R %U
ISNAD Özçift, Akın . "MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM". Yönetim Bilişim Sistemleri Dergisi 6 / 1 (June 2020): 42-50 .
AMA Özçift A. MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM. Yönetim Bilişim Sistemleri Dergisi. 2020; 6(1): 42-50.
Vancouver Özçift A. MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM. Yönetim Bilişim Sistemleri Dergisi. 2020; 6(1): 42-50.
IEEE A. Özçift , "MEDICAL SENTIMENT ANALYSIS BASED ON SOFT VOTING ENSEMBLE ALGORITHM", Yönetim Bilişim Sistemleri Dergisi, vol. 6, no. 1, pp. 42-50, Jun. 2020