Research Article

Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model

Volume: 11 Number: 2 June 13, 2020
EN

Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model

Abstract

One’s experience can greatly contribute to a diversified rating performance in educational scoring. Heterogeneous ratings can negatively affect examinees’ results. The aim of the study is to examine raters’ rating performance in assessing oral tests among lower secondary school students using Multi-facet Rasch Measurement (MFRM) model indicated by raters’ severity. Respondents are thirty English Language teachers clustered into two groups based on their rating experience in high-stakes assessment. The respondents listened to ten examinees’ recorded answers of three oral test items and provided their ratings. Instruments include items, examinees’ answers, scoring rubric, and scoring sheet used to appraise examinees’ competence in three domains which are vocabulary, grammar, and communicative competence. MFRM analysis showed that raters exhibited diversity in their severity level with chi-square χ2=2.661. Raters’ severity measures ranged from 2.13 to -1.45 logits. Independent t-test indicated that there was a significant difference in ratings provided by the inexperienced and the experienced raters, t-value = -0.96, df = 28, p<0.01. The findings of this study suggest that assessment developers must ensure raters are well versed before they can rate examinees in operational settings gained through assessment practices or rater training. Further research is needed to account for the varying effects of rating experience in other assessment contexts and the effects of interaction between facets on estimates of examinees’ measures. The present study provides additional evidence with respect to the role of rating experience in inspiring raters to provide accurate ratings.

Keywords

References

  1. Ahmadi Shirazi, M. (2019). For a greater good: Bias analysis in writing assessment. SAGE Open, 9(1), 1–14. https://doi.org/10.1177/2158244018822377
  2. Albano, A. D., & Rodrigues, M. (2018). Item development research and practice. Handbook of Accessible Instruction and Testing Practices: Issues, Innovations, and Applications, 181–198. https://doi.org/10.1007/978-3-319-71126-3_12
  3. Alp, P., Epner, A., & Pajupuu, H. (2018). The influence of rater empathy, age and experience on writing performance assessment. Linguistics Beyond And Within, 3(2017), 7–19. Retrieved from https://www.ceeol.com/search/article-detail?id=716601
  4. Anthony, L., & Miriam, S. (2019). Drill in English Skills Practice: CEFR-Alligned Curriculum. Selangor: Oxford Fajar
  5. Attali, Y. (2016). A comparison of newly-trained and experienced raters on a standardized writing assessment. Language Testing, 33(1), 99–115. https://doi.org/10.1177/0265532215582283
  6. Bond, T. G., & Fox, C. M. (2015). Applying the Rasch Model Fundamental Measurement in the Human Sciences. New Jersey: Lawrence Erlbaum Associates.
  7. Creswell, J. W., & Creswell, J. D. (2018). Research Design: Qualitative, Quantitative, And Mixed Methods Approaches (5th ed.). California: Sage Publications.
  8. Davis, L. (2016). The influence of training and experience on rater performance in scoring spoken language. Language Testing, 33(1), 117–135. https://doi.org/10.1177/0265532215582282

Details

Primary Language

English

Subjects

-

Journal Section

Research Article

Publication Date

June 13, 2020

Submission Date

December 25, 2019

Acceptance Date

May 9, 2020

Published in Issue

Year 2020 Volume: 11 Number: 2

APA
Mohd Noh, M. F., & Mohd Matore, M. E. E. (2020). Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model. Journal of Measurement and Evaluation in Education and Psychology, 11(2), 147-162. https://doi.org/10.21031/epod.662964
AMA
1.Mohd Noh MF, Mohd Matore MEE. Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model. JMEEP. 2020;11(2):147-162. doi:10.21031/epod.662964
Chicago
Mohd Noh, Muhamad Firdaus, and Mohd Effendi Ewan Mohd Matore. 2020. “Rating Performance Among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model”. Journal of Measurement and Evaluation in Education and Psychology 11 (2): 147-62. https://doi.org/10.21031/epod.662964.
EndNote
Mohd Noh MF, Mohd Matore MEE (June 1, 2020) Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model. Journal of Measurement and Evaluation in Education and Psychology 11 2 147–162.
IEEE
[1]M. F. Mohd Noh and M. E. E. Mohd Matore, “Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model”, JMEEP, vol. 11, no. 2, pp. 147–162, June 2020, doi: 10.21031/epod.662964.
ISNAD
Mohd Noh, Muhamad Firdaus - Mohd Matore, Mohd Effendi Ewan. “Rating Performance Among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model”. Journal of Measurement and Evaluation in Education and Psychology 11/2 (June 1, 2020): 147-162. https://doi.org/10.21031/epod.662964.
JAMA
1.Mohd Noh MF, Mohd Matore MEE. Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model. JMEEP. 2020;11:147–162.
MLA
Mohd Noh, Muhamad Firdaus, and Mohd Effendi Ewan Mohd Matore. “Rating Performance Among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model”. Journal of Measurement and Evaluation in Education and Psychology, vol. 11, no. 2, June 2020, pp. 147-62, doi:10.21031/epod.662964.
Vancouver
1.Muhamad Firdaus Mohd Noh, Mohd Effendi Ewan Mohd Matore. Rating Performance among Raters of Different Experience Through Multi-Facet Rasch Measurement (MFRM) Model. JMEEP. 2020 Jun. 1;11(2):147-62. doi:10.21031/epod.662964

Cited By