Research Article

Occupation Prediction from Twitter Data

Volume: 27 Number: 80 May 23, 2025
EN TR

Occupation Prediction from Twitter Data

Abstract

Today, the use of social media has become quite widespread. Among social media platforms, Twitter, now known as X, stands out with its number of users and abundance of data. This data can be used in many studies. In this study, it is aimed to predict occupation based on Turkish tweets. In the study, 5 datasets of different sizes were used. The tweets are evaluated and compared as single and pairwise. In the pre-processing step, different machine learning and deep learning methods and pre-trained models were tested using 2 different natural language processing libraries. Among the machine learning methods, the highest accuracy of 88% was obtained from the Logistic Regression model with pairwise tweet data, while the highest accuracy of 88% was obtained with the Multi-layer Perceptron from deep learning models. The BERT and "ytu-ce-cosmos/turkish-base-bert-uncased" developed by Yıldız Technical University COSMOS AI Research Team were used as pre-trained models. Although these models gave different results on different datasets, both of them achieved the highest success with a ratio of 89% on pairwise tweet data.

Keywords

References

  1. [1] Smart Insights. 2024. Global social media statistics research summary May 2024. https://www.smartinsights.com/social-media-marketing/social-media-strategy/new-global-social-media-research/(Accessed: 2024-05-01).
  2. [2] Backlinko. 2024. X (Twitter) Statistics: How Many People Use X? https://www.statista.com/statistics/303681/twitter-users-worldwide/ (Accessed: 2024-05-24).
  3. [3] Preoţiuc-Pietro, D., Lampos, V., Aletras, N. 2015. An analysis of the user occupational class through Twitter content, in: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1754-1764.
  4. [4] Hu, T., Xiao, H., Luo, J., Nguyen, T. 2016. What the language you tweet says about your occupation, in: Proceedings of the International AAAI Conference on Web and Social Media, Vol. 10, No. 1, pp. 181-190.
  5. [5] Aletras, N., Chamberlain, B. P. 2018. Predicting Twitter user socioeconomic attributes with network and language information, in: Proceedings of the 29th on Hypertext and Social Media, pp. 20-24.
  6. [6] Pan, J., Bhardwaj, R., Lu, W., Chieu, H. L., Pan, X., Puay, N. Y. 2019. Twitter homophily: Network based prediction of user's occupation, in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2633-2638.
  7. [7] Kern, M. L., McCarthy, P. X., Chakrabarty, D., Rizoiu, M. 2019. Social media-predicted personality traits and values can help match people to their ideal jobs, Proc. Natl. Acad. Sci. USA, Vol. 116, No. 52, pp. 26459-26464.
  8. [8] Zainab, K., Srivastava, G., Mago, V. 2021. Identifying health related occupations of Twitter users through word embedding and deep neural networks, BMC Bioinformatics, Vol. 22, Suppl 10, p. 630.

Details

Primary Language

English

Subjects

Distributed Computing and Systems Software (Other)

Journal Section

Research Article

Early Pub Date

May 12, 2025

Publication Date

May 23, 2025

Submission Date

August 21, 2024

Acceptance Date

September 18, 2024

Published in Issue

Year 2025 Volume: 27 Number: 80

APA
İzdaş, T., İskifoğlu, H., & Diri, B. (2025). Occupation Prediction from Twitter Data. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen Ve Mühendislik Dergisi, 27(80), 267-271. https://doi.org/10.21205/deufmd.2025278013
AMA
1.İzdaş T, İskifoğlu H, Diri B. Occupation Prediction from Twitter Data. DEUFMD. 2025;27(80):267-271. doi:10.21205/deufmd.2025278013
Chicago
İzdaş, Tolga, Hikmet İskifoğlu, and Banu Diri. 2025. “Occupation Prediction from Twitter Data”. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen Ve Mühendislik Dergisi 27 (80): 267-71. https://doi.org/10.21205/deufmd.2025278013.
EndNote
İzdaş T, İskifoğlu H, Diri B (May 1, 2025) Occupation Prediction from Twitter Data. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen ve Mühendislik Dergisi 27 80 267–271.
IEEE
[1]T. İzdaş, H. İskifoğlu, and B. Diri, “Occupation Prediction from Twitter Data”, DEUFMD, vol. 27, no. 80, pp. 267–271, May 2025, doi: 10.21205/deufmd.2025278013.
ISNAD
İzdaş, Tolga - İskifoğlu, Hikmet - Diri, Banu. “Occupation Prediction from Twitter Data”. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen ve Mühendislik Dergisi 27/80 (May 1, 2025): 267-271. https://doi.org/10.21205/deufmd.2025278013.
JAMA
1.İzdaş T, İskifoğlu H, Diri B. Occupation Prediction from Twitter Data. DEUFMD. 2025;27:267–271.
MLA
İzdaş, Tolga, et al. “Occupation Prediction from Twitter Data”. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen Ve Mühendislik Dergisi, vol. 27, no. 80, May 2025, pp. 267-71, doi:10.21205/deufmd.2025278013.
Vancouver
1.Tolga İzdaş, Hikmet İskifoğlu, Banu Diri. Occupation Prediction from Twitter Data. DEUFMD. 2025 May 1;27(80):267-71. doi:10.21205/deufmd.2025278013

Cited By

This journal is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).

download?token=eyJhdXRoX3JvbGVzIjpbXSwiZW5kcG9pbnQiOiJmaWxlIiwicGF0aCI6IjliNTAvMDBjMi8xZmIxLzY5MjZmZDIyOGE1NzgyLjA3MzU5MTk2LnBuZyIsImV4cCI6MTc2NDE2OTMzMSwibm9uY2UiOiI2MTU1ODg1NGZlYzhkZTA1OThkNTU2NGFmYTQzYTc0YiJ9.O5b4Ex8bMlFv5797LL8VnE9YWS_X5880dfbmOp2-kc8