Research Article

Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data

Volume: 7 Number: 3 September 28, 2019
TR EN

Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data

Abstract

There is a huge interaction between users of various social media platforms. This communication produces enormous amount of user data worth to be analyzed from numerous aspects. One of the research area emerging from the user data is a major security issue known as cyberbullying. Since this problem has been recognized as the source of cybercrimes, design of a system to detect cyberbullying attacks/sources through the micro-blog texts is evident. Most of the academic search of this topic has been conducted in English language. The originality of this paper is that we develop an accurate cyberbullying detection system for Turkish language. We used data from Twitter to develop a supervised machine learning model on top of Bayesian Logistic Regression whose parameters are tuned with the use of grid-search algorithm. Since the text data produces a high dimensional training space for machine learning algorithms, we also used Chi-Squared (CH2) feature selection strategy to obtain best subset of features. The optimized version of the proposed algorithm on top of reduced feature dimension has produced an f-measure value of 0.925. Finally, we also compared the results of the proposed algorithm with the frequently used machine learning methods from literature and we provided the corresponding results in related sections.

Keywords

References

  1. [1] M. A. Al-garadi, K. D. Varathan, and S. D. Ravana, “Cybercrime detection in online communications: The experimental case of cyberbullying detection in the Twitter network”, Computers in Human Behavior, vol. 63, pp. 433–443, 2016.
  2. [2] N. Tahmasbi and A. Fuchberger, “Challenges and future directions of automated cyberbullying detection”, in Twenty-fourth Americas Conference on Information Systems, New Orleans, USA, (2018).
  3. [3] M. Arntfield, “Toward a cybervictimology: Cyberbullying, routine activities theory, and the anti-sociality of social media”, Canadian Journal of Communication, vol. 40, pp. 371-388, 2015.
  4. [4] C. Salmivalli, “Bullying and the peer group: A review”, Aggression and Violent Behavior, vol. 15, pp. 112-120, 2010.
  5. [5] R. M. Kowalski, G. W. Giumetti, A. N. Schroeder, and M. R. Lattanner, “Bullying in the digital age: A critical review and meta-analysis of cyberbullying research among youth”, Psychological Bulletin, vol. 140, no. 4, pp. 1073-1137, 2014.
  6. [6] E. Menesini et al., “Cyberbullying definition among adolescents: A comparison across six european countries”, Cyberpsychology, Behavior, and Social Networking, vol. 15, no. 9, pp. 455–463, 2012.
  7. [7] K. Dinakar, B. Jones, C. Havasi, H. Lieberman, and R. Picard, “Common sense reasoning for detection, prevention, and mitigation of cyberbullying”, ACM Transactions on Interactive Intelligent Systems, vol. 2, no. 3, pp. 1–30, 2012.
  8. [8] S. Nadali, M. A. A. Murad, N. M. Sharef, A. Mustapha, and S. Shojaee, “A Review of cyberbullying detection . An overview”, in 2013 13th International Conference on Intelligent Systems Design and Applications (ISDA), Kuala Lumpur, Malaysia, 325-330, 2013.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

September 28, 2019

Submission Date

December 12, 2018

Acceptance Date

April 24, 2019

Published in Issue

Year 2019 Volume: 7 Number: 3

APA
Özçift, A., Kılınç, D., & Bozyiğit, F. (2019). Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data. Academic Platform - Journal of Engineering and Science, 7(3), 355-361. https://doi.org/10.21541/apjes.496018
AMA
1.Özçift A, Kılınç D, Bozyiğit F. Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data. APJES. 2019;7(3):355-361. doi:10.21541/apjes.496018
Chicago
Özçift, Akın, Deniz Kılınç, and Fatma Bozyiğit. 2019. “Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data”. Academic Platform - Journal of Engineering and Science 7 (3): 355-61. https://doi.org/10.21541/apjes.496018.
EndNote
Özçift A, Kılınç D, Bozyiğit F (September 1, 2019) Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data. Academic Platform - Journal of Engineering and Science 7 3 355–361.
IEEE
[1]A. Özçift, D. Kılınç, and F. Bozyiğit, “Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data”, APJES, vol. 7, no. 3, pp. 355–361, Sept. 2019, doi: 10.21541/apjes.496018.
ISNAD
Özçift, Akın - Kılınç, Deniz - Bozyiğit, Fatma. “Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data”. Academic Platform - Journal of Engineering and Science 7/3 (September 1, 2019): 355-361. https://doi.org/10.21541/apjes.496018.
JAMA
1.Özçift A, Kılınç D, Bozyiğit F. Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data. APJES. 2019;7:355–361.
MLA
Özçift, Akın, et al. “Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data”. Academic Platform - Journal of Engineering and Science, vol. 7, no. 3, Sept. 2019, pp. 355-61, doi:10.21541/apjes.496018.
Vancouver
1.Akın Özçift, Deniz Kılınç, Fatma Bozyiğit. Application of Grid Search Parameter Optimized Bayesian Logistic Regression Algorithm to Detect Cyberbullying in Turkish Microblog Data. APJES. 2019 Sep. 1;7(3):355-61. doi:10.21541/apjes.496018

Cited By