Research Article

A Comparison of the efficacies of differential item functioning detection methods

Volume: 10 Number: 1 March 20, 2023
EN TR

A Comparison of the efficacies of differential item functioning detection methods

Abstract

To ensure the validity of the tests is to check that all items have similar results across different groups of individuals. However, differential item functioning (DIF) occurs when the results of individuals with equal ability levels from different groups differ from each other on the same test item. Based on Item Response Theory and Classic Test Theory, there are some methods, with different advantages and limitations to identify items that show DIF. This study aims to compare the performances of five methods for detecting DIF. The efficacies of Mantel-Haenszel (MH), Logistic Regression (LR), Crossing simultaneous item bias test (CSIBTEST), Lord's chi-square (LORD), and Raju's area measure (RAJU) methods are examined considering conditions of the sample size, DIF ratio, and test length. In this study, to compare the detection methods, power and Type I error rates are evaluated using a simulation study with 100 replications conducted for each condition. Results show that LR and MH have the lowest Type I error and the highest power rate in detecting uniform DIF. In addition, CSIBTEST has a similar power rate to MH and LR. Under DIF conditions, sample size, DIF ratio, test length and their interactions affect Type I error and power rates.

Keywords

References

  1. Apinyapibal, S., Lawthong, N., & Kanjanawasee, S. (2015). A comparative analysis of the efficacy of differential item functioning detection for dichotomously scored items among logistic regression, SIBTEST and raschtree methods. Procedia-Social and Behavioral Sciences, 191, 21-25. https://doi.org/10.1016/j.sbspro.2015.04.664
  2. Atalay Kabasakal, K., Arsan, N., Gök, B., & Kelecioğlu, H. (2014). Comparing performances (type I error and power) of IRT likelihood ratio SIBTEST and Mantel-Haenszel methods in the determination of differential item functioning, Educational Sciences: Theory and Practice, 14(6), 2175-2193. https://doi.org/10.12738/estp.2014.6.2165
  3. Atar, B. (2007). Differential item functioning analyses for mixed response data using IRT likelihood-ratio test, logistic regression, and GLLAMM procedures [Unpublished doctoral dissertation]. University of Florida State.
  4. Ayva Yörü, F.G., & Atar, H.Y. (2019). Determination of differential item functioning (DIF) according to SIBTEST, Lord's [Chi-squared], Raju's area measurement and Breslow-Day Methods. Journal of Pedagogical Research, 3(3), 139 150. https://doi.org/10.33902/jpr.v3i3.137
  5. Camilli, G., & Shepard, L.A. (1994). Methods for identifying biased test items. Sage Publications.
  6. Camilli, G. (2006). Test fairness. In R.L. Brennan (Ed), Educational Measurement (4th ed., pp. 221–257). Rowman & Littlefield.
  7. De Ayala, R.J. (2009). The theory and practice of item response theory. The Guilford Press.
  8. DeMars, C.E. (2009). Modification of the Mantel-Haenszel and logistic regression DIF procedures to incorporate the SIBTEST regression correction. Journal of Educational and Behavioral Statistics, 34, 149-170. https://doi.org/10.3102/1076998607313923

Details

Primary Language

English

Subjects

Other Fields of Education

Journal Section

Research Article

Publication Date

March 20, 2023

Submission Date

June 24, 2022

Acceptance Date

March 2, 2023

Published in Issue

Year 2023 Volume: 10 Number: 1

APA
Başman, M. (2023). A Comparison of the efficacies of differential item functioning detection methods. International Journal of Assessment Tools in Education, 10(1), 145-159. https://doi.org/10.21449/ijate.1135368

Cited By

23823             23825             23824