Research Article

The difference between estimated and perceived item difficulty: An empirical study

Volume: 11 Number: 2 June 20, 2024
EN TR

The difference between estimated and perceived item difficulty: An empirical study

Abstract

Test development is a complicated process that demands examining various factors, one of them being writing items of varying difficulty. It is important to use items of a different range of difficulty to ensure that the test results accurately indicate the test-taker's abilities. Therefore, the factors affecting item difficulty should be defined, and item difficulties should be estimated before testing. This study aims to investigate the factors that affect estimated and perceived item difficulty in the High School Entrance Examination in Türkiye and to improve estimation accuracy by giving feedback to the experts. The study started with estimating item difficulty for 40 items belonging to reading comprehension, grammar, and reasoning based on data. Then, the experts' predictions were compared with the estimated item difficulty and feedback was provided to improve the accuracy of their predictions. The study found that some item features (e.g., length and readability) did not affect the estimated difficulty but affected the experts' item difficulty perceptions. Based on these results, the study concludes that providing feedback to experts can improve the factors affecting their item difficulty estimates. So, it can help improve the quality of future tests and provide feedback to experts to improve their ability to estimate item difficulty accurately.

Keywords

Ethical Statement

This research was presented as an oral presentation at the NCME 2023 congress APRIL 12-15, 2023 - CHICAGO, IL, USA.

References

  1. Aljehani, D.K., Pullishery, F., Osman, O., & Abuzenada, B.M. (2020). Relationship of text length of multiple-choice questions on item psychometric properties–A retrospective study. Saudi J Health Sci, 9, 84-87. https://doi.org/10.4103/sjhs.sjhs_76_20
  2. AlKhuzaey, S., Grasso, F., Payne, T.R., & Tamma, V. (2021). A Systematic Review of Data-Driven Approaches to Item Difficulty Prediction. In I. Roll, D. McNamara, S. Sosnovsky, R. Luckin, & V. Dimitrova, Artificial Intelligence in Education Cham. https://doi.org/10.1007/978-3-030-78292-4_3
  3. Allalouf, A., Hambleton, R., & Sireci, S. (1999). Identifying the causes of dif in translated verbal items. Journal of Educational Measurement, 36(3), 185 198. https://doi.org/10.1111/j.1745-3984.1999.tb00553.x
  4. Attali, Y., Saldivia, L., Jackson, C., Schuppan, F., & Wanamaker, W. (2014). Estimating item difficulty with comparative judgments. ETS Research Report Series, 2014(2), 1-8. https://doi.org/10.1002/ets2.12042
  5. Bejar, I.I. (1983). Subject matter experts' assessment of item statistics. Applied Psychological Measurement, 7(3), 303-310. https://doi.org/10.1002/j.2333-8504.1981.tb01274.x
  6. Benton, T. (2020). How Useful Is Comparative Judgement of Item Difficulty for Standard Maintaining? Research Matters, 29, 27-35.
  7. Berenbon, R., & McHugh, B. (2023). Do subject matter experts’ judgments of multiple‐choice format suitability predict item quality?. Educational Measurement Issues and Practice, 42(3), 13-21. https://doi.org/10.1111/emip.12570
  8. Berk, R.A. (1986). A consumer’s guide to setting performance standards on criterion-referenced tests. Review of Educational Research, 56(1), 137 172. https://doi.org/10.3102/00346543056001137

Details

Primary Language

English

Subjects

Measurement Theories and Applications in Education and Psychology

Journal Section

Research Article

Early Pub Date

May 22, 2024

Publication Date

June 20, 2024

Submission Date

October 15, 2023

Acceptance Date

May 2, 2024

Published in Issue

Year 2024 Volume: 11 Number: 2

APA
Sayın, A., & Bulut, O. (2024). The difference between estimated and perceived item difficulty: An empirical study. International Journal of Assessment Tools in Education, 11(2), 368-387. https://doi.org/10.21449/ijate.1376160

Cited By

23823             23825             23824