Research Article

Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform

Volume: 11 Number: 3 December 31, 2025

Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform

Abstract

Social media platforms offer significant opportunities for information dissemination, yet they also pose considerable risks regarding the spread of spam content. This study comparatively evaluates the performance of large language models (LLMs) in generating content analysis-based features for detecting spam content in visual posts shared on the X social media platform. Using the same dataset, two different LLMs — Google Gemini and Deepseek — were employed to generate semantically scaled features (e.g., tag consistency, inter-tag relationships, text matching) from the post texts. Visual analyses were supported by Cloud Vision AI. The resulting features were tested using five different machine learning algorithms: Decision Trees, Random Forest, Support Vector Machines (SVM), Logistic Regression, and Multilayer Perceptron. The analysis results indicated that the Random Forest algorithm, in particular, achieved the highest F1 scores and ROC AUC values with both models. However, the features generated by the Gemini and Deepseek models resulted in significant differences in classification performance. This study highlights the differences between LLMs in generating scaled semantic features and underscores the impact of LLM selection on classification performance in spam detection tasks.

Keywords

References

  1. [1] P. Sharma, T. Nagpal, G. Shrivastava, and J. D. Kumar, "A systematic review on social bots account detection using machine learning," in Proc. of the 2023 5th Int. Conf. on Advances in Computing, Communication Control and Networking (ICAC3N), Greater Noida, India, Dec. 2023. New York, NY, USA: IEEE, 2023, pp. 862–866. doi: 10.1109/icac3n60023.2023.10541657
  2. [2] S. Ghosh, et al., "Understanding and combating link farming in the Twitter social network," in Proc. of The Web Conference (WWW 2012), Lyon, France, Apr. 2012. New York, NY, USA: ACM, 2012. doi: 10.1145/2187836.2187846
  3. [3] L. Jin, Y. Chen, T. Wang, P. Hui, and A. V. Vasilakos, “Understanding user behavior in online social networks: a survey,” IEEE Communications Magazine, vol. 51, no. 9, pp. 144–150, Sept. 2013. doi: 10.1109/mcom.2013.6588663
  4. [4] K. Thomas, C. Grier, D. Song, and V. Paxson, "Suspended accounts in retrospect," in Proc. of the 2011 ACM SIGCOMM Conf. on Internet Measurement Conference (IMC 2011), Berlin, Germany, Nov. 2011. New York, NY, USA: ACM, 2011. doi: 10.1145/2068816.2068840
  5. [5] A. Aggarwal, A. Rajadesingan, and P. Kumaraguru, "PhishAri: Automatic realtime phishing detection on Twitter," in Proc. of the 2012 IEEE Int. Conf. on Communications (ICC), Ottawa, ON, Canada, Jun. 2012 [Online]. New York, NY, USA: IEEE, 2012. Available: IEEE Xplore, https://ieeexplore.ieee.org/document/6489521/. [Accessed: Jul. 16, 2025].
  6. [6] D. Wang, S. B. Navathe, L. Liu, D. Irani, A. Tamersoy, and C. Pu, "Click traffic analysis of short URL spam on Twitter," in Proc. of the 2013 IEEE Int. Conf. on Communications (ICC), Budapest, Hungary, Jun. 2013 [Online]. New York, NY, USA: IEEE, 2013. Available: IEEE Xplore, https://ieeexplore.ieee.org/abstract/document/6679991/. [Accessed: Jul. 16, 2025].
  7. [7] C. Grier, K. Thomas, V. Paxson, and M. Zhang, "@spam," in Proc. of the 17th ACM Conf. on Computer and Communications Security (CCS’10), Chicago, IL, USA, Oct. 2010. New York, NY, USA: ACM, 2010. doi: 10.1145/1866307.1866311
  8. [8] T. Wu, S. Liu, J. Zhang, and Y. Xiang, "Twitter spam detection based on deep learning," in Proc. of the Australasian Computer Science Week Multiconference (ACSW 2017), Geelong, Australia, Jan. 2017. New York, NY, USA: ACM, 2017. doi: 10.1145/3014812.3014815

Details

Primary Language

English

Subjects

Computer Software

Journal Section

Research Article

Publication Date

December 31, 2025

Submission Date

July 19, 2025

Acceptance Date

September 1, 2025

Published in Issue

Year 2025 Volume: 11 Number: 3

APA
Camadan, E., & Şimşek, M. (2025). Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform. Gazi Journal of Engineering Sciences, 11(3), 376-397. https://izlik.org/JA66KG37SS
AMA
1.Camadan E, Şimşek M. Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform. GJES. 2025;11(3):376-397. https://izlik.org/JA66KG37SS
Chicago
Camadan, Ebutalha, and Mehmet Şimşek. 2025. “Spam Content Detection With Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform”. Gazi Journal of Engineering Sciences 11 (3): 376-97. https://izlik.org/JA66KG37SS.
EndNote
Camadan E, Şimşek M (December 1, 2025) Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform. Gazi Journal of Engineering Sciences 11 3 376–397.
IEEE
[1]E. Camadan and M. Şimşek, “Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform”, GJES, vol. 11, no. 3, pp. 376–397, Dec. 2025, [Online]. Available: https://izlik.org/JA66KG37SS
ISNAD
Camadan, Ebutalha - Şimşek, Mehmet. “Spam Content Detection With Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform”. Gazi Journal of Engineering Sciences 11/3 (December 1, 2025): 376-397. https://izlik.org/JA66KG37SS.
JAMA
1.Camadan E, Şimşek M. Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform. GJES. 2025;11:376–397.
MLA
Camadan, Ebutalha, and Mehmet Şimşek. “Spam Content Detection With Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform”. Gazi Journal of Engineering Sciences, vol. 11, no. 3, Dec. 2025, pp. 376-97, https://izlik.org/JA66KG37SS.
Vancouver
1.Ebutalha Camadan, Mehmet Şimşek. Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform. GJES [Internet]. 2025 Dec. 1;11(3):376-97. Available from: https://izlik.org/JA66KG37SS

GJES is indexed and archived by:

3311333114331153311633117

Gazi Journal of Engineering Sciences (GJES) publishes open access articles under a Creative Commons Attribution 4.0 International License (CC BY) 1366_2000-copia-2.jpg