Research Article

Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility

Volume: 39 Number: 1 April 30, 2026
EN TR

Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility

Abstract

This study investigates how the intelligibility of Turkish learners of English as a foreign language (EFL) is assessed by both human raters and artificial intelligence (AI), specifically ChatGPT-4, across two different speaking tasks: a controlled read-aloud passage and a spontaneous picture-description task. Drawing on intelligibility-focused pronunciation research, the study aims to explore how task type, rater type, and pronunciation features (segmental and suprasegmental) affect intelligibility ratings. 30 intermediate-level Turkish learners of English completed both tasks, and their recordings were evaluated by three native English-speaking human raters and an AI model. Quantitative results showed that spontaneous speech received higher intelligibility scores than the read-aloud task, despite including more segmental errors. Suprasegmental features such as rhythm, stress, and phrasing played a greater role in determining intelligibility across tasks. While AI ratings closely matched human judgments in most cases, discrepancies emerged, particularly in samples where prosodic nuances were critical. Qualitative analysis further revealed that both rater types frequently flagged vowel distortions, stress misplacement, and a lack of rhythmic cohesion as common intelligibility detractors. The findings underscore the importance of integrating suprasegmental instruction in EFL pronunciation pedagogy and highlight the potential role of AI tools in supporting intelligibility assessment. Nevertheless, when it comes to assessing natural, prosody-rich speech, human judgment is still crucial. By providing empirical evidence from a Turkish EFL environment and linking L2 pronunciation research with emerging technology, this study contributes to applied linguistics.

Keywords

Ethical Statement

Ethics committee approval was obtained for the research at the meeting of Ataturk University Social and Human Sciences Ethics Committee numbered E.88656144-000-2500199482.

References

  1. Aoyama, K., & Guion, S. G. (2007). Prosody in second language acquisition: Acoustic analyses of duration and F0 range. In O.-S. Bohn & M. J. Munro (Eds.), Language experience in second-language speech learning (pp. 282–297). John Benjamins.
  2. Babaeian, A. (2023). Pronunciation assessment: Traditional vs modern modes. Journal of Education For Sustainable Innovation, 1(1), 61-68. https://doi.org/10.56916/jesi.v1i1.530
  3. Bayraktaroğlu, S. (2008). Orthographic interference and the teaching of British pronunciation to Turkish learners. Journal of Language and Linguistic Studies, 4(2), 1-36.
  4. Celce Murcia, M., Brinton, D. M., Goodwin, J. M., & Griner, B. (2010). Teaching pronunciation: A course book and reference guide (2nd ed.). Cambridge University Press.
  5. Chau, T., Huensch, A., Hoang, Y. K., & Chau, H. T. (2022). The effects of L2 pronunciation instruction on EFL learners’ intelligibility and fluency in spontaneous speech. TESL EJ, 25(4), n4.
  6. Demirezen, M. (2005). Rehabilitating a fossilized pronunciation error: The/v/and/w/contrast by using the audio-articulation method in teacher training in Türkiye. Journal of Language and Linguistic Studies, 1(2), 183-192.
  7. Derwing, T. M., & Munro, M. J. (1995). Foreign accent, comprehensibility, and intelligibility in L2 learner speech. Language Learning, 45(1), 73–97. https://doi.org/10.1111/j.1467-1770.1995.tb00963.x
  8. Derwing, T. M., & Munro, M. J. (1997). Accent, intelligibility, and comprehensibility: Evidence from four L1s. Studies in second language acquisition, 19(1), 1-16. https://doi.org/10.1017/S0272263197001010

Details

Primary Language

English

Subjects

English As A Second Language, Applied Linguistics and Educational Linguistics

Journal Section

Research Article

Publication Date

April 30, 2026

Submission Date

June 30, 2025

Acceptance Date

September 26, 2025

Published in Issue

Year 2026 Volume: 39 Number: 1

APA
Deveci, C. (2026). Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility. Journal of Uludag University Faculty of Education, 39(1), 33-49. https://doi.org/10.19171/uefad.1728485
AMA
1.Deveci C. Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility. Journal of Uludag University Faculty of Education. 2026;39(1):33-49. doi:10.19171/uefad.1728485
Chicago
Deveci, Canan. 2026. “Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility”. Journal of Uludag University Faculty of Education 39 (1): 33-49. https://doi.org/10.19171/uefad.1728485.
EndNote
Deveci C (April 1, 2026) Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility. Journal of Uludag University Faculty of Education 39 1 33–49.
IEEE
[1]C. Deveci, “Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility”, Journal of Uludag University Faculty of Education, vol. 39, no. 1, pp. 33–49, Apr. 2026, doi: 10.19171/uefad.1728485.
ISNAD
Deveci, Canan. “Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility”. Journal of Uludag University Faculty of Education 39/1 (April 1, 2026): 33-49. https://doi.org/10.19171/uefad.1728485.
JAMA
1.Deveci C. Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility. Journal of Uludag University Faculty of Education. 2026;39:33–49.
MLA
Deveci, Canan. “Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility”. Journal of Uludag University Faculty of Education, vol. 39, no. 1, Apr. 2026, pp. 33-49, doi:10.19171/uefad.1728485.
Vancouver
1.Canan Deveci. Comparing Human and AI Judgments of Turkish EFL Learners’ Intelligibility. Journal of Uludag University Faculty of Education. 2026 Apr. 1;39(1):33-49. doi:10.19171/uefad.1728485


Journal of Uludag University Faculty of Education ©2025 by Bursa Uludag University is licensed under CC BY-NC 4.0