Effects of Feature Extraction and Classification Methods on Cyberbully Detection
Abstract
Cyberbullying is defined as an aggressive, intentional action against a defenseless person by using the Internet, or other electronic contents. Researchers have found that many of the bullying cases have tragically ended in suicides; hence automatic detection of cyberbullying has become important. In this study we show the effects of feature extraction, feature selection, and classification methods that are used, on the performance of automatic detection of cyberbullying. To perform the experiments FormSpring.me dataset is used and the effects of preprocessing methods; several classifiers like C4.5, Naïve Bayes, kNN, and SVM; and information gain and chi square feature selection methods are investigated. Experimental results indicate that the best classification results are obtained when alphabetic tokenization, no stemming, and no stopwords removal are applied. Using feature selection also improves cyberbully detection performance. When classifiers are compared, C4.5 performs the best for the used dataset.
Keywords
References
- [1] Snakenborg, J., Van Acker, R., and Gable, R. A. 2011. Cyberbullying: Prevention and Intervention to Protect our Children and Youth. Preventing School Failure: Alternative Education for Children and Youth, 55(2011), 88-95.
- [2] Smith, P.K., Mahdavi, J., Carvalho, M., Fisher, S., Russell, S., and Tippett, N. 2008. Cyberbullying: its Nature and Impact in Secondary School Pupils. Journal of Child Psychology and Psychiatry 49(2008), 376–385.
- [3] Li, Q. 2006. Cyberbullying in Schools: A Research of Gender Differences. School Psychology International, 27(2)(2006), 157-170.
- [4] Agatston, P.W., Kowalski, R., and Limber, S. 2007. Students’ Perspectives on Cyber Bullying. Journal of Adolescent Health 41(2007), S59–S60.
- [5] Beran, T., and Li, Q. 2005. Cyber-harassment: A Study of a New Method for an old Behavior. Journal of Educational Computing Research, 32(2005), 265-277.
- [6] Hinduja, S., and Patchin, J. W. 2008. Cyberbullying: An Exploratory Analysis of Factors Related to Offending and Victimization. Deviant Behavior, 29(2008), 129-156.
- [7] Kowalski, R. M., and Limber, S. P. 2007. Electronic Bullying among Middle School Students. Journal of Adolescent Health, 41(6, Suppl. 1)(2007), 22-30.
- [8] Ortega, R., Elipe, P., Mora-Merchán, J. A., Calmaestra, J., and Vega, E. 2009. The Emotional Impact on Victims of Traditional Bullying and Cyberbullying: A study of Spanish Adolescents. Zeitschrift Für Psychologie/Journal of Psychology, 217(4)(2009), 197-204.
Details
Primary Language
Turkish
Subjects
-
Journal Section
-
Publication Date
April 15, 2017
Submission Date
June 14, 2016
Acceptance Date
-
Published in Issue
Year 2017 Volume: 21 Number: 1
Cited By
A Bi-GRU with attention and CapsNet hybrid model for cyberbullying detection on social media
World Wide Web
https://doi.org/10.1007/s11280-021-00920-4Cyberbullying in text content detection: an analytical review
International Journal of Computers and Applications
https://doi.org/10.1080/1206212X.2023.2256048A Comprehensive Review of Cyberbullying-related Content Classification In Online Social Media
Expert Systems with Applications
https://doi.org/10.1016/j.eswa.2023.122644