EN
Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model
Abstract
One’s experience can greatly contribute to a diversified rating performance in educational scoring. Heterogeneous ratings can negatively affect examinees’ results. The aim of the study is to examine raters’ rating performance in assessing oral tests among lower secondary school students using Multi-facet Rasch Measurement (MFRM) model indicated by raters’ severity. Respondents are thirty English Language teachers clustered into two groups based on their rating experience in high-stakes assessment. The respondents listened to ten examinees’ recorded answers of three oral test items and provided their ratings. Instruments include items, examinees’ answers, scoring rubric, and scoring sheet used to appraise examinees’ competence in three domains which are vocabulary, grammar, and communicative competence. MFRM analysis showed that raters exhibited diversity in their severity level with chi-square χ2=2.661. Raters’ severity measures ranged from 2.13 to -1.45 logits. Independent t-test indicated that there was a significant difference in ratings provided by the inexperienced and the experienced raters, t-value = -0.96, df = 28, p<0.01. The findings of this study suggest that assessment developers must ensure raters are well versed before they can rate examinees in operational settings gained through assessment practices or rater training. Further research is needed to account for the varying effects of rating experience in other assessment contexts and the effects of interaction between facets on estimates of examinees’ measures. The present study provides additional evidence with respect to the role of rating experience in inspiring raters to provide accurate ratings.
Keywords
References
- Ahmadi Shirazi, M. (2019). For a greater good: Bias analysis in writing assessment. SAGE Open, 9(1), 1–14. https://doi.org/10.1177/2158244018822377
- Albano, A. D., & Rodrigues, M. (2018). Item development research and practice. Handbook of Accessible Instruction and Testing Practices: Issues, Innovations, and Applications, 181–198. https://doi.org/10.1007/978-3-319-71126-3_12
- Alp, P., Epner, A., & Pajupuu, H. (2018). The influence of rater empathy, age and experience on writing performance assessment. Linguistics Beyond And Within, 3(2017), 7–19. Retrieved from https://www.ceeol.com/search/article-detail?id=716601
- Anthony, L., & Miriam, S. (2019). Drill in English Skills Practice: CEFR-Alligned Curriculum. Selangor: Oxford Fajar
- Attali, Y. (2016). A comparison of newly-trained and experienced raters on a standardized writing assessment. Language Testing, 33(1), 99–115. https://doi.org/10.1177/0265532215582283
- Bond, T. G., & Fox, C. M. (2015). Applying the Rasch Model Fundamental Measurement in the Human Sciences. New Jersey: Lawrence Erlbaum Associates.
- Creswell, J. W., & Creswell, J. D. (2018). Research Design: Qualitative, Quantitative, And Mixed Methods Approaches (5th ed.). California: Sage Publications.
- Davis, L. (2016). The influence of training and experience on rater performance in scoring spoken language. Language Testing, 33(1), 117–135. https://doi.org/10.1177/0265532215582282
Details
Primary Language
English
Subjects
-
Journal Section
Research Article
Authors
Publication Date
June 13, 2020
Submission Date
December 25, 2019
Acceptance Date
May 9, 2020
Published in Issue
Year 2020 Volume: 11 Number: 2
APA
Mohd Noh, M. F., & Mohd Matore, M. E. E. (2020). Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model. Journal of Measurement and Evaluation in Education and Psychology, 11(2), 147-162. https://doi.org/10.21031/epod.662964
AMA
1.Mohd Noh MF, Mohd Matore MEE. Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model. JMEEP. 2020;11(2):147-162. doi:10.21031/epod.662964
Chicago
Mohd Noh, Muhamad Firdaus, and Mohd Effendi Ewan Mohd Matore. 2020. “Rating Performance Among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model”. Journal of Measurement and Evaluation in Education and Psychology 11 (2): 147-62. https://doi.org/10.21031/epod.662964.
EndNote
Mohd Noh MF, Mohd Matore MEE (June 1, 2020) Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model. Journal of Measurement and Evaluation in Education and Psychology 11 2 147–162.
IEEE
[1]M. F. Mohd Noh and M. E. E. Mohd Matore, “Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model”, JMEEP, vol. 11, no. 2, pp. 147–162, June 2020, doi: 10.21031/epod.662964.
ISNAD
Mohd Noh, Muhamad Firdaus - Mohd Matore, Mohd Effendi Ewan. “Rating Performance Among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model”. Journal of Measurement and Evaluation in Education and Psychology 11/2 (June 1, 2020): 147-162. https://doi.org/10.21031/epod.662964.
JAMA
1.Mohd Noh MF, Mohd Matore MEE. Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model. JMEEP. 2020;11:147–162.
MLA
Mohd Noh, Muhamad Firdaus, and Mohd Effendi Ewan Mohd Matore. “Rating Performance Among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model”. Journal of Measurement and Evaluation in Education and Psychology, vol. 11, no. 2, June 2020, pp. 147-62, doi:10.21031/epod.662964.
Vancouver
1.Muhamad Firdaus Mohd Noh, Mohd Effendi Ewan Mohd Matore. Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model. JMEEP. 2020 Jun. 1;11(2):147-62. doi:10.21031/epod.662964
Cited By
A cross-age odyssey of cognitive reading attributes: a scoping review
Frontiers in Psychology
https://doi.org/10.3389/fpsyg.2025.1605898Charting the trajectory of cognitive diagnostic assessment in language assessment: A systematic literature review
International Journal of Assessment Tools in Education
https://doi.org/10.21449/ijate.1533622