Research Article

Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform

Volume: 11 Number: 3 December 31, 2025

Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform

Abstract

Social media platforms offer significant opportunities for information dissemination, yet they also pose considerable risks regarding the spread of spam content. This study comparatively evaluates the performance of large language models (LLMs) in generating content analysis-based features for detecting spam content in visual posts shared on the X social media platform. Using the same dataset, two different LLMs — Google Gemini and Deepseek — were employed to generate semantically scaled features (e.g., tag consistency, inter-tag relationships, text matching) from the post texts. Visual analyses were supported by Cloud Vision AI. The resulting features were tested using five different machine learning algorithms: Decision Trees, Random Forest, Support Vector Machines (SVM), Logistic Regression, and Multilayer Perceptron. The analysis results indicated that the Random Forest algorithm, in particular, achieved the highest F1 scores and ROC AUC values with both models. However, the features generated by the Gemini and Deepseek models resulted in significant differences in classification performance. This study highlights the differences between LLMs in generating scaled semantic features and underscores the impact of LLM selection on classification performance in spam detection tasks.

Keywords

X social media platform , spam , machine learning , large language models , computer vision

References

  1. [1] P. Sharma, T. Nagpal, G. Shrivastava, and J. D. Kumar, "A systematic review on social bots account detection using machine learning," in Proc. of the 2023 5th Int. Conf. on Advances in Computing, Communication Control and Networking (ICAC3N), Greater Noida, India, Dec. 2023. New York, NY, USA: IEEE, 2023, pp. 862–866. doi: 10.1109/icac3n60023.2023.10541657
  2. [2] S. Ghosh, et al., "Understanding and combating link farming in the Twitter social network," in Proc. of The Web Conference (WWW 2012), Lyon, France, Apr. 2012. New York, NY, USA: ACM, 2012. doi: 10.1145/2187836.2187846
  3. [3] L. Jin, Y. Chen, T. Wang, P. Hui, and A. V. Vasilakos, “Understanding user behavior in online social networks: a survey,” IEEE Communications Magazine, vol. 51, no. 9, pp. 144–150, Sept. 2013. doi: 10.1109/mcom.2013.6588663
  4. [4] K. Thomas, C. Grier, D. Song, and V. Paxson, "Suspended accounts in retrospect," in Proc. of the 2011 ACM SIGCOMM Conf. on Internet Measurement Conference (IMC 2011), Berlin, Germany, Nov. 2011. New York, NY, USA: ACM, 2011. doi: 10.1145/2068816.2068840
  5. [5] A. Aggarwal, A. Rajadesingan, and P. Kumaraguru, "PhishAri: Automatic realtime phishing detection on Twitter," in Proc. of the 2012 IEEE Int. Conf. on Communications (ICC), Ottawa, ON, Canada, Jun. 2012 [Online]. New York, NY, USA: IEEE, 2012. Available: IEEE Xplore, https://ieeexplore.ieee.org/document/6489521/. [Accessed: Jul. 16, 2025].
  6. [6] D. Wang, S. B. Navathe, L. Liu, D. Irani, A. Tamersoy, and C. Pu, "Click traffic analysis of short URL spam on Twitter," in Proc. of the 2013 IEEE Int. Conf. on Communications (ICC), Budapest, Hungary, Jun. 2013 [Online]. New York, NY, USA: IEEE, 2013. Available: IEEE Xplore, https://ieeexplore.ieee.org/abstract/document/6679991/. [Accessed: Jul. 16, 2025].
  7. [7] C. Grier, K. Thomas, V. Paxson, and M. Zhang, "@spam," in Proc. of the 17th ACM Conf. on Computer and Communications Security (CCS’10), Chicago, IL, USA, Oct. 2010. New York, NY, USA: ACM, 2010. doi: 10.1145/1866307.1866311
  8. [8] T. Wu, S. Liu, J. Zhang, and Y. Xiang, "Twitter spam detection based on deep learning," in Proc. of the Australasian Computer Science Week Multiconference (ACSW 2017), Geelong, Australia, Jan. 2017. New York, NY, USA: ACM, 2017. doi: 10.1145/3014812.3014815
  9. [9] M. Mateen, M. A. Iqbal, M. Aleem, and M. A. Islam, "A hybrid approach for spam detection for Twitter," in Proc. of the 2017 14th Int. Bhurban Conf. on Applied Sciences and Technology (IBCAST), Islamabad, Pakistan, Jan. 2017. New York, NY, USA: IEEE, 2017. doi: 10.1109/ibcast.2017.7868095
  10. [10] S. Sedhai and A. Sun, “Semi-supervised spam detection in Twitter stream,” IEEE Transactions on Computational Social Systems, vol. 5, no. 1, pp. 169–175, Mar. 2018. doi: 10.1109/tcss.2017.2773581
IEEE
[1]E. Camadan and M. Şimşek, “Spam Content Detection with Large Language Models: Comparative Analysis of Gemini and Deepseek Models on X Platform”, GJES, vol. 11, no. 3, pp. 376–397, Dec. 2025, [Online]. Available: https://izlik.org/JA66KG37SS