Research Article

Type I error and power rates: A comparative analysis of techniques in differential item functioning

Volume: 10 Number: 4 December 23, 2023
EN TR

Type I error and power rates: A comparative analysis of techniques in differential item functioning

Abstract

The main purpose of this study is to examine the Type I error and statistical power ratios of Differential Item Functioning (DIF) techniques based on different theories under different conditions. For this purpose, a simulation study was conducted by using Mantel-Haenszel (MH), Logistic Regression (LR), Lord’s χ2, and Raju’s Areas Measures techniques. In the simulation-based research model, the two-parameter item response model, group’s ability distribution, and DIF type were the fixed conditions while sample size (1800, 3000), rates of sample size (0.50, 1), test length (20, 80) and DIF- containing item rate (0, 0.05, 0.10) were manipulated conditions. The total number of conditions is 24 (2x2x2x3), and statistical analysis was performed in the R software. The current study found that the Type I error rates in all conditions were higher than the nominal error level. It was also demonstrated that MH had the highest error rate while Raju’s Areas Measures had the lowest error rate. Also, MH produced the highest statistical power rates. The analysis of the findings of Type 1 error and statistical power rates illustrated that techniques based on both of the theories performed better in the 1800 sample size. Furthermore, the increase in the sample size affected techniques based on CTT rather than IRT. Also, the findings demonstrated that the techniques’ Type 1 error rates were lower while their statistical power rates were higher under conditions where the test length was 80, and the sample sizes were not equal.

Keywords

References

  1. Ankenmann, R.D., Witt, E.A., & Dunbar, S.B. (1999). An investigation of the power of the likelihood ratio goodness-of-fit statistics in detecting differential item functioning. Journal of Educational Measurement, 36(4), 277–300. https://doi.org/10.1111/j.1745-3984.1999.tb00558.x
  2. Atalay Kabasakal, K., Arsan, N., Gök, B., & Kelecioğlu, H. (2014). Değişen madde fonksiyonunun belirlenmesinde MTK olabilirlik oranı, SIBTEST ve mantel- haenszel yöntemlerinin performanslarının (I. Tip hata ve güç) karşılaştırılması [Comparison of the performance (Type I error and power) of the IRT likelihood ratio, SIBTEST, and mantel-haenszel techniques in determining the differential item functioning]. Educational Sciences: Theory & Practice, 14(6), 2175- 2193.
  3. Atar, B. (2007). Differential item functioning analyses for mixed response data using IRT likelihood-ratio test, logistic regression, and GLLAMM procedures (FSU_migr_etd-0248) [Doctoral dissertation, Florida State University]. http://purl.flvc.org/fsu/fd/FSU_migr_etd-0248.
  4. Atar, B., & Kamata, A. (2011). Comparison of IRT likelihood ratio test and logistic regression DIF detection procedures. Hacettepe University Journal of Education, (41), 36–47.
  5. Basman, M. (2023). A comparison of the efficacies of differential item functioning detection methods. International Journal of Assessment Tools in Education, 10(1), 145-159. https://doi.org/10.21449/ijate.1135368
  6. Bradley, J.V. (1978). Robustness. British Journal of Mathematical and Statistical Psychology, 31(2), 144–152. http://dx.doi.org/10.1111/j.2044-8317.1978.tb00581.x
  7. Camilli, G., & Shepard, L.A. (1994). Methods for identifying biased test items. Sage Publications.
  8. Clauser, B.E., & Mazor, K.M. (1998). Using statistical procedure to identify differential item functioning test items. Educational Measurement: Issues and Practice, 17(1), 31‐44. https://doi.org/10.1111/j.1745-3992.1998.tb00619.x

Details

Primary Language

English

Subjects

Measurement Theories and Applications in Education and Psychology , Similation Study

Journal Section

Research Article

Publication Date

December 23, 2023

Submission Date

September 29, 2023

Acceptance Date

December 9, 2023

Published in Issue

Year 2023 Volume: 10 Number: 4

APA
Bilicioğlu Güneş, A., & Bıçak, B. (2023). Type I error and power rates: A comparative analysis of techniques in differential item functioning. International Journal of Assessment Tools in Education, 10(4), 781-795. https://doi.org/10.21449/ijate.1368341

23823             23825             23824