Research Article

Efficient Turkish Text Classification Approach for Crisis Management Systems

Volume: 34 Number: 3 September 1, 2021
EN

Efficient Turkish Text Classification Approach for Crisis Management Systems

Abstract

In this paper, an effective tweet classification system that fully supports the Turkish language has been developed. The proposed system can be used for mining (classifying) the recently published and publicly available tweets to find the crisis’s most related and useful tweets to gain situational awareness, which can help in taking the correct responses in order to prevent or at least decrease the effect of such situations. A deep study was carried out to improve and optimize the proposed system. In more detail, some intensive experiments were performed to investigate the performance of some well-known machine learning algorithms, i.e., K-Nearest Neighbor (KNN), Support Vector Machine (SVM), and Naive Bayes (NB) when used for text (tweets) classification. Then, the performances of the ensemble systems of the studied algorithms and the Random Forest (RF), AdaBoost Classifier (AdaBoost), GradientBoosting Classifier (GBC) ensemble systems have also been observed. As shown in the experimental evaluation and analysis, the proposed approach has stability, robustness, and can achieve quite good performance when processing the Turkish language. The performance of the proposed classifier was also compared with two state-of-the-art text classification approaches, i.e., "Empirical" and “Turkish Deep ".

Keywords

References

  1. [1] Domala, J., Dogra, M., Masrani, V., Fernandes, D., D'souza, K., Fernandes, D., & Carvalho, T., “Automated Identification of Disaster News for Crisis Management using Machine Learning and Natural Language Processing”, In 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), 503-508, (2020).
  2. [2] Alshehri, A., & Alahamri, S., “An Ensemble Learning for Detecting Situational Awareness Tweets during Environmental Hazards”, In 2019 IEEE International Systems Conference (SysCon), 1-8, (2019).
  3. [3] Kumar, A., Singh, J. P., & Saumya, S., “A Comparative Analysis of Machine Learning Techniques for Disaster-Related Tweet Classification”, In 2019 IEEE R10 Humanitarian Technology Conference (R10-HTC), 222-227, (2019).
  4. [4] Nalluru, G., Pandey, R., & Purohit, H., “Relevancy classification of multimodal social media streams for emergency services”, In 2019 IEEE International Conference on Smart Computing (SMARTCOMP), 121-125, (2019).
  5. [5] Ayata, D., Saraçlar, M., & Özgür, A., “Turkish tweet sentiment analysis with word embedding and machine learning”, In 2017 25th Signal Processing and Communications Applications Conference (SIU), 1-4, (2017).
  6. [6] Naili, M., Chaibi, A. H., & Ghezala, H. H. B., “Comparative study of word embedding methods in topic segmentation”, Procedia Computer Science, 112, 340-349, (2017).
  7. [7] Mikolov, T., Chen, K., Corrado, G., & Dean, J., “Efficient estimation of word representations in vector space”, arXiv preprint arXiv: 1301.3781, (2013).
  8. [8] Şahin, G., Turkish document classification based on Word2Vec and SVM classifier”, In 2017 25th Signal Processing and Communications Applications Conference (SIU), 1-4, (2017).

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

September 1, 2021

Submission Date

April 6, 2020

Acceptance Date

January 3, 2021

Published in Issue

Year 2021 Volume: 34 Number: 3

APA
Alqaraleh, S. (2021). Efficient Turkish Text Classification Approach for Crisis Management Systems. Gazi University Journal of Science, 34(3), 718-731. https://doi.org/10.35378/gujs.715296
AMA
1.Alqaraleh S. Efficient Turkish Text Classification Approach for Crisis Management Systems. Gazi University Journal of Science. 2021;34(3):718-731. doi:10.35378/gujs.715296
Chicago
Alqaraleh, Saed. 2021. “Efficient Turkish Text Classification Approach for Crisis Management Systems”. Gazi University Journal of Science 34 (3): 718-31. https://doi.org/10.35378/gujs.715296.
EndNote
Alqaraleh S (September 1, 2021) Efficient Turkish Text Classification Approach for Crisis Management Systems. Gazi University Journal of Science 34 3 718–731.
IEEE
[1]S. Alqaraleh, “Efficient Turkish Text Classification Approach for Crisis Management Systems”, Gazi University Journal of Science, vol. 34, no. 3, pp. 718–731, Sept. 2021, doi: 10.35378/gujs.715296.
ISNAD
Alqaraleh, Saed. “Efficient Turkish Text Classification Approach for Crisis Management Systems”. Gazi University Journal of Science 34/3 (September 1, 2021): 718-731. https://doi.org/10.35378/gujs.715296.
JAMA
1.Alqaraleh S. Efficient Turkish Text Classification Approach for Crisis Management Systems. Gazi University Journal of Science. 2021;34:718–731.
MLA
Alqaraleh, Saed. “Efficient Turkish Text Classification Approach for Crisis Management Systems”. Gazi University Journal of Science, vol. 34, no. 3, Sept. 2021, pp. 718-31, doi:10.35378/gujs.715296.
Vancouver
1.Saed Alqaraleh. Efficient Turkish Text Classification Approach for Crisis Management Systems. Gazi University Journal of Science. 2021 Sep. 1;34(3):718-31. doi:10.35378/gujs.715296

Cited By