Assessing second-language academic writing: AI vs. Human raters
Abstract
Keywords
References
- Alikaniotis, D., Yannakoudakis, H., & Rei, M. (2016). Automatic text scoring using neural networks. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics: Volume 1 Long Papers (pp. 715-725). Stroudsburg: Association for Computational Linguistics.
- Amorim, E. & Veloso, A. (2017). A multi aspect analysis of automatic essay scoring for Brazilian Portuguese. In Proceedings of the 15th Conference of the European Chapter of the Association for
- Computational Linguistics (pp. 94-102). Student Research Workshop: Association for Computational Linguistics.
- Arslan Mancar, S., & Gulleroglu, H. D. (2022). Comparison of inter-rater reliability techniques in performance-based assessment. International Journal of Assessment Tools in Education, 9(2), 515-533.
- Attali, Y., Lewis, W., & Steier, M. (2013). Scoring with the computer: Alternative procedures for improving the reliability of holistic essay scoring. Language Testing, 30(1), 125-141.
- Azmi, A. M., Al-Jouie, M. F., & Hussain, M. (2019). AAEE–Automated evaluation of students’ essays in
- Arabic language. Information Processing & Management, 56(5), 1736-1752.
- Bai, J. Y-H., Zawacki-Richter, O., Bozkurt, A., Lee, K., Fanguy, M., Sari, B. C., & Marin, V. I. (2022). Automated essay scoring (AES) systems: Opportunities and challenges for open and distance education. In Proceedings of the Tenth Pan-Commonwealth Forum on Open Learning (PCF10) (pp. 1-10). Canada Minutes of Congress.
Details
Primary Language
English
Subjects
Instructional Technologies
Journal Section
Research Article
Authors
Vasfiye Geckin
*
0000-0001-8532-8627
Türkiye
Ebru Kızıltaş
0000-0002-1275-9327
Türkiye
Çağatay Çınar
0009-0007-3852-3658
Türkiye
Publication Date
December 31, 2023
Submission Date
August 2, 2023
Acceptance Date
October 21, 2023
Published in Issue
Year 2023 Volume: 6 Number: 4
Cited By
Leveraging ChatGPT for Second Language Writing Feedback and Assessment
International Journal of Computer-Assisted Language Learning and Teaching
https://doi.org/10.4018/IJCALLT.360382Evaluating the quality of AI feedback: A comparative study of AI and human essay grading
Innovations in Education and Teaching International
https://doi.org/10.1080/14703297.2024.2437122Exploring the Landscape of Generative AI (ChatGPT)-Powered Writing Instruction in English as a Foreign Language Education: A Scoping Review
ECNU Review of Education
https://doi.org/10.1177/20965311241310881A systematic literature review on the application of generative artificial intelligence (GAI) in teaching within higher education: Instructional contexts, process, and strategies
The Internet and Higher Education
https://doi.org/10.1016/j.iheduc.2025.100996Exploring the potential use of generative ai for learner support in ODL at scale
Journal of Educational Technology and Online Learning
https://doi.org/10.31681/jetol.1559442ChatGPT: A reliable assistant for the evaluation of students’ written texts?
Education and Information Technologies
https://doi.org/10.1007/s10639-025-13553-1GenAI in Academic Writing- Empowering Learners or Redefining Traditional Pedagogical Practices?
International Journal of Artificial Intelligence
https://doi.org/10.4018/IJAITL.373582Investigating a customized generative AI chatbot for automated essay scoring in a disciplinary writing task
Assessing Writing
https://doi.org/10.1016/j.asw.2025.100959Comparing AI and Human Assessment of Academic Writing Skills: A Kappa Analysis
E3S Web of Conferences
https://doi.org/10.1051/e3sconf/202564506014Evaluating the scoring system of an AI-integrated app to assess foreign language phonological decoding
Research Methods in Applied Linguistics
https://doi.org/10.1016/j.rmal.2025.100257Can AI Assess Writing Skills Like a Human? A Reliability Analysis
Kuramsal Eğitimbilim
https://doi.org/10.30831/akukeg.1718511GenAI and human assessments of L2 Chinese writing: Interrater reliability and rater bias
Assessing Writing
https://doi.org/10.1016/j.asw.2025.100989AI and human scoring for postgraduate writing: Evaluating score reliability, variability, and rater behaviours
Studies in Educational Evaluation
https://doi.org/10.1016/j.stueduc.2026.101572Can ChatGPT score ESL writing? A correlation analysis between teacher and GenAI scores
Language Teaching Research
https://doi.org/10.1177/13621688261415584Human and AI Scoring of EFL Writing: The Influence of Rubrics and Genre on Reliability
Eğitim ve Yeni Yaklaşımlar Dergisi
https://doi.org/10.52974/jena.1785369Hey AI, Are You Sure? Analyzing Reflective Prompting Attempts in AI-Based Writing Assessment
Sakarya University Journal of Education
https://doi.org/10.19126/suje.1713879Discrepancies Between ChatGPT and Vietnamese EFL Teachers in Writing Assessment
International Journal of AI in Language Education
https://doi.org/10.54855/ijaile.26311Educational Measurement with Emerging Technologies: A Systematic Review Through Evidentiary Lens on Granularity and Constructing Measures Theory
Education Sciences
https://doi.org/10.3390/educsci16040661Evaluating GPT ratings of EFL writing: A scoping review
Assessing Writing
https://doi.org/10.1016/j.asw.2026.101044Yapay Zekâ Destekli İletişim Araçlarıyla Üretilen Verilerin Akademik Araştırmalarda Kullanımı
SELÇUK ÜNİVERSİTESİ İLETİŞİM FAKÜLTESİ AKADEMİK DERGİSİ
https://doi.org/10.18094/josc.1690737Educational Approaches of Generative Artificial Intelligence Use in the Creation, Checking and Feedback of Student Written Assignments in University Settings
European Journal of Engineering and Technology Research
https://doi.org/10.24018/ejeng.2025.1.CIE.70024''It's Safer to Say I Don't Use It'': Exploring ESL Students' Self-Perceptions as AI-Assisted Writers
Proceedings of the ACM on Human-Computer Interaction
https://doi.org/10.1145/3799448Mapping the landscape: generative AI in higher education assessment (2020-2024) - a scoping review
Interactive Learning Environments
https://doi.org/10.1080/10494820.2026.2614079Implementations Of Generative Artificial Intelligence Tools Within The Contexts Of English Language Teaching And Learning: A Systematic Review
Instructional Technology and Lifelong Learning
https://doi.org/10.52911/itall.1696814ChatGPT-4o as an automated scoring tool for writing assessment: Strengths and weaknesses
International Journal of Assessment Tools in Education
https://doi.org/10.21449/ijate.1701871