Research Article

An Empirical Study for the Statistical Adjustment of Rater Bias

Volume: 6 Number: 2 July 15, 2019
EN TR

An Empirical Study for the Statistical Adjustment of Rater Bias

Abstract

This study investigated the effectiveness of statistical adjustments applied to rater bias in many-facet Rasch analysis. Some changes were first made in the dataset that did not include rater × examinee bias to cause to have rater × examinee bias. Later, bias adjustment was applied to rater bias included in the data file, and the effectiveness of the statistical adjustment was further examined. The outcomes pertaining to the datasets with and without bias, and to which the bias adjustment was applied, were compared. It was concluded that diversities created by rater × examinee bias in examinees’ ability estimation, item difficulty indices and measures of rater severity and leniency were, to a large extent, eliminated by bias adjustment. This result indicates that the bias adjustment using many-facet Rasch analysis is a viable way to control rater bias.

Keywords

References

  1. Aubin, A. S., St-Onge, C., & Renaud, J. S. (2018). Detecting rater bias using a person-fit statistic: A Monte Carlo simulation study. Perspectives on Medical Education, 7(2), 83-92. http://dx.doi.org/10.1007/s40037-017-0391-8
  2. Bailey, K. (1994). Methods of social research. New York: The Free.
  3. Bennett, R. E. (1991). On the meanings of constructed response. ETS Research Report Series, 2, 1-46. http://dx.doi.org/10.1002/j.2333-8504.1991.tb01429.x
  4. Bennett, R. E., Ward, W. C., Rock, D. A., & LaHart, C. (1990). Toward a framework for constructed response items. ETS Research Report Series, 1, 1 - 29. http://dx.doi.org/10.1002/j.2333-8504.1990.tb01348.x
  5. Connaway, L. S., & Powell, R. R. (2010). Basic research methods for librarians. Santa Barbara, CA: Libraries Unlimited.
  6. DeMars, C. (2010). Item response theory. Oxford, UK: Oxford University.
  7. Eckes, T. (2005). Examining rater effects in TestDaF writing and speaking performance assessments: A many-facet Rasch analysis. Language Assessment Quarterly, 2(3), 197-221. http://dx.doi.org/10.1207/s15434311laq0203_2
  8. Fahim, M., & Bijani, H. (2011). The effects of rater training on raters’ severity and bias in second language writing assessment. Iranian Journal of Language Testing, 1(1), 1-16. Retrieved from http://www.ijlt.ir/portal/files/401-2011-01-01.pdf

Details

Primary Language

English

Subjects

Studies on Education

Journal Section

Research Article

Publication Date

July 15, 2019

Submission Date

February 28, 2019

Acceptance Date

April 27, 2019

Published in Issue

Year 2019 Volume: 6 Number: 2

APA
İlhan, M. (2019). An Empirical Study for the Statistical Adjustment of Rater Bias. International Journal of Assessment Tools in Education, 6(2), 193-201. https://doi.org/10.21449/ijate.533517

Cited By

23823             23825             23824