Performance analysis of the most downloaded Turkish and English language models on the Hugging-Face platform
Abstract
This study analyzes the performance of the most popularly downloaded language models on the Hugging Face platform. For this purpose, the five most downloaded language models in Turkish and English were used. The analysis was evaluated in three phases. These stages were contextual learning, question and answer, and expert evaluation. ARC, Turkish sentiment analysis, Hellaswag, and MMLU datasets were used for contextual learning. For the question-and-answer test, the models trained with the text file created were asked questions from the text. Finally, six experts evaluated the answers given by the models from the developed mobile application. F1 score was used for context evaluation, Rouge-1, Rouge-2, and Rouge-L metrics were used for question and answer, and Elo and TrueSkill metrics were used for expert evaluations. The correlations of these metrics were calculated, and it was seen that there was a correlation of 0.74 between expert evaluations and question-answer performances. It was also observed that learning in context and question-answering performances were not correlated. When the language models were evaluated in general, the timpal0l/mdeberta-v3-base-squad2 language model performed the best. Turkish and English language models performed best on the sentiment analysis dataset with an F1 score above 0.85.
Keywords
References
- [1] J. Jones, W. Jiang, N. Synovic, G. Thiruvathukal, and J. Davis, “What do we know about Hugging Face? A systematic literature review and quantitative validation of qualitative claims,” in Proc. of the 18th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. 2024, pp. 13–24.
- [2] A. Ait, J. L. C. Izquierdo and J. Cabot, “HFCommunity: A Tool to Analyze the Hugging Face Hub Community,” in Proc. 2023 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER). Taipa, Macao, 2023, pp. 728-732, doi:10.1109/SANER56733.2023.00080
- [3] Z. Hussain, M. Binz, R. Mata et al. “A tutorial on open-source large language models for behavioural science,” Behav Res 56, pp. 8214–8237, 2024. doi:10.3758/s13428-024-02455-8
- [4] S. M. Jain, “Introduction to Transformers for NLP: With the Hugging Face Library and Models to Solve Problems,” Apress Media LLC, pp. 51-53, 2022, doi: 10.1007/978-1-4842-8844-3
- [5] F. Pepe, V. Nardone, A. Mastropaolo, G. Bavota, G. Canfora, and M. Di Penta, “How do Hugging Face Models Document Datasets, Bias, and Licenses? An Empirical Study,” in Proc. of the 32nd IEEE/ACM International Conference on Program Comprehension. 2024, pp. 370–381.
- [6] Hugging Face Inc., https://Huggingface.Co/ (accessed August. 13, 2024)
- [7] Y. Shen, K. Song, X. Tan, D. Li, W. Lu and Y. Zhuang, “HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face. Advances in Neural Information,” in Proc. Systems 36, New Orleans, USA, 2023, pp. 38154—38180, doi: 10.48550/arXiv.2303.17580.
- [8] A. Kathikar, A. Nair, B. Lazarine, A. Sachdeva and S. Samtani, “Assessing the Vulnerabilities of the Open-Source Artificial Intelligence (AI) Landscape: A Large-Scale Analysis of the Hugging Face Platform,” in Proc. 2023 IEEE International Conference on Intelligence and Security Informatics, Charlotte, NC, USA, 2023, pp.1-6, doi: 10.1109/ISI58743.2023.10297271.
Details
Primary Language
English
Subjects
Natural Language Processing
Journal Section
Research Article
Publication Date
June 30, 2025
Submission Date
December 11, 2024
Acceptance Date
May 5, 2025
Published in Issue
Year 2025 Number: 061
APA
Cizmeci, İ. H., & Gencer, K. (2025). Performance analysis of the most downloaded Turkish and English language models on the Hugging-Face platform. Journal of Scientific Reports-A, 061, 13-24. https://doi.org/10.59313/jsr-a.1599759
AMA
1.Cizmeci İH, Gencer K. Performance analysis of the most downloaded Turkish and English language models on the Hugging-Face platform. JSR-A. 2025;(061):13-24. doi:10.59313/jsr-a.1599759
Chicago
Cizmeci, İnayet Hakkı, and Kerem Gencer. 2025. “Performance Analysis of the Most Downloaded Turkish and English Language Models on the Hugging-Face Platform”. Journal of Scientific Reports-A, nos. 061: 13-24. https://doi.org/10.59313/jsr-a.1599759.
EndNote
Cizmeci İH, Gencer K (June 1, 2025) Performance analysis of the most downloaded Turkish and English language models on the Hugging-Face platform. Journal of Scientific Reports-A 061 13–24.
IEEE
[1]İ. H. Cizmeci and K. Gencer, “Performance analysis of the most downloaded Turkish and English language models on the Hugging-Face platform”, JSR-A, no. 061, pp. 13–24, June 2025, doi: 10.59313/jsr-a.1599759.
ISNAD
Cizmeci, İnayet Hakkı - Gencer, Kerem. “Performance Analysis of the Most Downloaded Turkish and English Language Models on the Hugging-Face Platform”. Journal of Scientific Reports-A. 061 (June 1, 2025): 13-24. https://doi.org/10.59313/jsr-a.1599759.
JAMA
1.Cizmeci İH, Gencer K. Performance analysis of the most downloaded Turkish and English language models on the Hugging-Face platform. JSR-A. 2025;:13–24.
MLA
Cizmeci, İnayet Hakkı, and Kerem Gencer. “Performance Analysis of the Most Downloaded Turkish and English Language Models on the Hugging-Face Platform”. Journal of Scientific Reports-A, no. 061, June 2025, pp. 13-24, doi:10.59313/jsr-a.1599759.
Vancouver
1.İnayet Hakkı Cizmeci, Kerem Gencer. Performance analysis of the most downloaded Turkish and English language models on the Hugging-Face platform. JSR-A. 2025 Jun. 1;(061):13-24. doi:10.59313/jsr-a.1599759