Research Article

Large Language Models vs. Human Interpretation: Which is More Accurate in Text Classification?

Volume: 13 Number: 2 June 30, 2025
TR EN

Large Language Models vs. Human Interpretation: Which is More Accurate in Text Classification?

Abstract

Ekşi Sözlük is a widely used social network where numerous unusual events are discussed. In this context, it serves as a real-time news source for emergency response teams and digital news platforms. In this study, a dataset was compiled from comments shared on the Ekşi Sözlük platform regarding the Kahramanmaraş earthquake on February 6, 2023. These comments were classified into four categories: Source-Based Information, Emotional Reaction, Social Inference, and Personal Experience using the Gemma2 9B (9-billion-parameter) model, developed by Google with advanced natural language processing capabilities. A dataset of 500 comments in Excel format was analyzed, comparing the model outputs with human evaluations to assess classification accuracy. For this purpose, four evaluation columns were created for each comment based on category classification. The consistency between model-assigned categories and manually determined categories was examined using these columns. In cases where inconsistencies were detected, the model-generated explanations were subjected to qualitative evaluation. Model outputs that provide satisfactory explanations are considered acceptable, the manually classified category was assigned as the final evaluation. This process systematically resolved inconsistencies between model and human assessments, ensuring the final and validated category assignments for each comment. The highest accuracy values were observed for Social Inference (0.99), Source-Based Information (0.98), Personal Experience (0.88), and Emotional Reaction (0.83), respectively. In conclusion, this study presents a methodology for improving model performance through human supervision, contributing to the development of strategies for disaster management and crisis communication.

Keywords

References

  1. [1] AFAD, “06 Şubat 2023 Pazarcık-Elbistan Kahramanmaraş (Mw 7.7; Mw 7.6) depremleri raporu,” Deprem ve Risk Azaltma Genel Müdürlüğü, 2023.
  2. [2] Y. Argüden and B. Erşahin, Veri madenciliği: Veriden bilgiye, masraftan değere, ARGE Danışmanlık Yayınları, 2008.
  3. [3] Z. Bakan and F. Kanbay, “Makine öğrenmesi yöntemleri ile eğitim başarısına etki eden faktörlerin modellenmesi,” İstanbul Ticaret Üniversitesi Fen Bilimleri Dergisi, vol. 23, no. 45, pp. 27–41, 2024. [Online]. Available: https://doi.org/10.55071/ticaretfbd.1442084
  4. [4] G. Burel and H. Alani, “Crisis event extraction service (CREES)—Automatic detection and classification of crisis-related content on social media,” in Proc. 15th Int. Conf. Inf. Syst. Crisis Response and Manage., 2018.
  5. [5] C. Coşkun and A. Baykal, “Veri madenciliğinde sınıflandırma algoritmalarının bir örnek üzerinde karşılaştırılması,” in Akademik Bilişim Konferansı (AB'11) Bildirileri, 2011, pp. 51–58.
  6. [6] J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,” arXiv preprint arXiv:1810.04805, 2019.
  7. [7] M. Imran, C. Castillo, J. Lucas, P. Meier, and S. Vieweg, “AIDR: Artificial intelligence for disaster response,” in Proc. 22nd Int. Conf. World Wide Web, 2015, pp. 159–162.
  8. [8] O. H. Kwon et al., “Sentiment analysis of the United States public support of nuclear power on social media using large language models,” Renewable and Sustainable Energy Reviews, vol. 200, 114570, 2024. [Online]. Available: https://doi.org/10.1016/j.rser.2024.114570

Details

Primary Language

Turkish

Subjects

Computer Software

Journal Section

Research Article

Early Pub Date

July 11, 2025

Publication Date

June 30, 2025

Submission Date

March 5, 2025

Acceptance Date

April 14, 2025

Published in Issue

Year 2025 Volume: 13 Number: 2

APA
Özkurt, A. H., Aydemir, E., & Sönmez, Y. (2025). Large Language Models vs. Human Interpretation: Which is More Accurate in Text Classification? Balkan Journal of Electrical and Computer Engineering, 13(2), 174-182. https://doi.org/10.17694/bajece.1652268
AMA
1.Özkurt AH, Aydemir E, Sönmez Y. Large Language Models vs. Human Interpretation: Which is More Accurate in Text Classification? Balkan Journal of Electrical and Computer Engineering. 2025;13(2):174-182. doi:10.17694/bajece.1652268
Chicago
Özkurt, Ahmet Hamdi, Emrah Aydemir, and Yasin Sönmez. 2025. “Large Language Models Vs. Human Interpretation: Which Is More Accurate in Text Classification?”. Balkan Journal of Electrical and Computer Engineering 13 (2): 174-82. https://doi.org/10.17694/bajece.1652268.
EndNote
Özkurt AH, Aydemir E, Sönmez Y (June 1, 2025) Large Language Models vs. Human Interpretation: Which is More Accurate in Text Classification? Balkan Journal of Electrical and Computer Engineering 13 2 174–182.
IEEE
[1]A. H. Özkurt, E. Aydemir, and Y. Sönmez, “Large Language Models vs. Human Interpretation: Which is More Accurate in Text Classification?”, Balkan Journal of Electrical and Computer Engineering, vol. 13, no. 2, pp. 174–182, June 2025, doi: 10.17694/bajece.1652268.
ISNAD
Özkurt, Ahmet Hamdi - Aydemir, Emrah - Sönmez, Yasin. “Large Language Models Vs. Human Interpretation: Which Is More Accurate in Text Classification?”. Balkan Journal of Electrical and Computer Engineering 13/2 (June 1, 2025): 174-182. https://doi.org/10.17694/bajece.1652268.
JAMA
1.Özkurt AH, Aydemir E, Sönmez Y. Large Language Models vs. Human Interpretation: Which is More Accurate in Text Classification? Balkan Journal of Electrical and Computer Engineering. 2025;13:174–182.
MLA
Özkurt, Ahmet Hamdi, et al. “Large Language Models Vs. Human Interpretation: Which Is More Accurate in Text Classification?”. Balkan Journal of Electrical and Computer Engineering, vol. 13, no. 2, June 2025, pp. 174-82, doi:10.17694/bajece.1652268.
Vancouver
1.Ahmet Hamdi Özkurt, Emrah Aydemir, Yasin Sönmez. Large Language Models vs. Human Interpretation: Which is More Accurate in Text Classification? Balkan Journal of Electrical and Computer Engineering. 2025 Jun. 1;13(2):174-82. doi:10.17694/bajece.1652268

All articles published by BAJECE are licensed under the Creative Commons Attribution 4.0 International License. This permits anyone to copy, redistribute, remix, transmit and adapt the work provided the original work and source is appropriately cited.Creative Commons Lisansı