Data Pre-processing Approach with ML Algorithms for Accuracy and Authenticity Detection of Big Data Sourced from the Social Internet of Things and a Case Study
Abstract
The Social Internet of Things (SIoT), which integrates sensor data with social media interactions, produces massive volumes of unstructured data that require accurate and reliable preprocessing for meaningful analysis. This study investigates the effectiveness of various machine learning (ML) classification algorithms in detecting the accuracy and authenticity of SIoT-derived data.A dataset comprising 17,500 user records collected from mobile devices and social media platforms was analyzed using five ML classifiers: Logistic Regression, Naive Bayes, K-Nearest Neighbor (K-NN), Random Forest, and Support Vector Machines (SVM). Through extensive hyperparameter tuning and 5-fold cross-validation, the Random Forest and SVM models exhibited the highest performance, achieving accuracy scores of 0.58 and 0.57, respectively. SVM also obtained the best AUC value of 0.64, highlighting its strength in distinguishing authentic from manipulated data. Additionally, the results emphasize the need for larger, more diverse datasets, and suggest incorporating deep learning and automated bias mitigation methods in future research.
Keywords
References
- [1] Shahab, S., Agarwal, P., Mufti, T., & Obaid, A. J. (2022). SIoT (Social Internet of Things): A Review. Evolutionary Computing and Mobile Sustainable Networks, 313–323.
- [2] Dhelim, S., Ning, H., Farha, F., Chen, L., Atzori, L., & Daneshmand, M. (2021). IoT-Enabled Social Relationships Meet Artificial Social Intelligence. IEEE Internet of Things Journal, 8(20), 15364–15375.
- [3] Nejad, H. V., Farimani, Z. M., & Tavakolifar, A. (2020). Social Internet of Things and New Generation Computing—A Survey. Toward Social Internet of Things (SIoT), 846, 129–152.
- [4] Kaya, Ş. M., & Kaya, E. (2022). The (Un)seen Influence of S-IoT on the Political Economic Decisions. In 6th International Congress of Social Sciences, Istanbul.
- [5] Rad, M. M., Rahmani, A. M., Sahafi, A., & Qader, N. N. (2020). Social Internet of Things: vision, challenges, and trends. Human-centric Computing and Information Sciences, 10.
- [6] Kaur, N., & Sood, S. K. (2023). Social Internet of Things (SIoT): A decade’s journey and future directions. Journal of Network and Computer Applications, 210.
- [7] Dhelim, S., Ning, H., Farha, F., Chen, L., Atzori, L., & Daneshmand, M. (2021). IoT-Enabled Social Relationships Meet Artificial Social Intelligence. IEEE Internet of Things Journal, 8(20), 15364–15375.
- [8] İşler, B., Kaya, Ş. M., & Kılıç, F. R. (2025). Fog-Enabled Machine Learning Approaches for Weather Prediction in IoT Systems: A Case Study. Sensors, 25(13), 4070.
Details
Primary Language
English
Subjects
Software Architecture, Software Testing, Verification and Validation
Journal Section
Research Article
Authors
Deniz Kızılaslan
0009-0009-7128-4075
Türkiye
Publication Date
March 28, 2026
Submission Date
June 3, 2025
Acceptance Date
December 3, 2025
Published in Issue
Year 2026 Volume: 14
