Research Article

Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory

Volume: 36 Number: 3 December 15, 2023
TR EN

Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory

Abstract

This study aims to choose the equating method with the least equating error by using the equating methods in Classical Test Theory and Item Response Theory. In this study, booklet 1 and booklet 3 data were used for PISA (Programme for International Student Assessment) 2012 Mathematics test. Data from Turkey, Indonesia, Shanghai/China and Finland, countries participating in PISA 2012, were selected for this study. Non-equivalent groups design was used in the test equating process. Linear equating methods [Tucker (w1=1, w1=0.5), Levine observed score (w1=1, w1=0.5), Levine true score, Classical Congeneric and Braun-Holland), equipercentile equating methods (pre smoothing according to C6 polynomial degree, beta4, post smoothing according to S 0.05 cubic function, frequency estimation (w1=1, w1=0.5) ] were used in the study. In Classical Test Theory, the least error is obtained from the frequency estimation method with a synthetic universe weight of w1 = 0.5. For the Item Response Theory, the calibration method was first decided, which is the Stocking-Lord method. After the scale transformation was achieved with the Stocking-Lord calibration method, the equating scores were calculated from the IRT's true and observed equating methods. The least error in IRT was obtained from the true score equating method. For error values, error coefficients were calculated according to Newton-Raphson's delta method and bootstrap methods. When the error coefficients (delta and bootstrap) of the equating methods in both theories were compared, it was found that the equating methods based on IRT had fewer errors than the equating methods in CTT, and the method with the least equating error was the IRT true score equating. The least equating error frequency estimation in CTT (w1=0.5) and the most error Levine true score equating method.

Keywords

References

  1. Aksekioğlu, B. (2017). Madde tepki kuramına dayalı test eşitleme yöntemlerinin karşılaştırılması: PISA 2012 fen testi örneği (Yayın No. 454879) [Yüksek lisans tez, Akdeniz üniversitesi]. YÖK. https://tez.yok.gov.tr/UlusalTezMerkezi/
  2. Angoff, W.H. (1987). Technical and practical issues in equating: A discussion of four papers. Applied Psychological Measurement, 11, 291-300.
  3. Braun, H. I.,& Holland, P. W. (1982). Observed-score test equating: A mathematical analysis of some ETS equating procedures. In P. W. Holland & D. B. Rubin (Eds.), Test equating (pp. 9–49). Academic.
  4. Brossman, B. G., & Lee, W.C. (2013). Observed score and true score equating procedures for multidimensional item response theory. Applied Psychological Measurement, 37(6), 460-481. https://doi.org/10.1177/0146621613484083
  5. Büyüköztürk, Ş., Çakmak, E. K., Akgün, Ö. E., Karadeniz, Ş., & Demirel, F. (2008). Bilimsel araştırma yöntemleri. Pegem Akademi.
  6. Chen, H. H., Livingston, S. A., & Holland, P. W. (2011). Generalized equating functions for NEAT designs. Statistical models for test equating, scaling and linking, 185-200.
  7. Cook, L. L.,& Eignor, D. R, (1991). IRT equating methods. Educational Measurement: Issues And Practice, 10(3), 37-45.
  8. Crocker, L., & Algina, J. (1986). Introduction to classical and modern test theory. Harcourt Brace Javonich College.

Details

Primary Language

English

Subjects

Measurement Theories and Applications in Education and Psychology

Journal Section

Research Article

Early Pub Date

October 30, 2023

Publication Date

December 15, 2023

Submission Date

July 10, 2023

Acceptance Date

September 22, 2023

Published in Issue

Year 2023 Volume: 36 Number: 3

APA
Mutluer, C., & Çakan, M. (2023). Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory. Journal of Uludag University Faculty of Education, 36(3), 866-906. https://doi.org/10.19171/uefad.1325587
AMA
1.Mutluer C, Çakan M. Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory. Journal of Uludag University Faculty of Education. 2023;36(3):866-906. doi:10.19171/uefad.1325587
Chicago
Mutluer, Ceren, and Mehtap Çakan. 2023. “Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory”. Journal of Uludag University Faculty of Education 36 (3): 866-906. https://doi.org/10.19171/uefad.1325587.
EndNote
Mutluer C, Çakan M (December 1, 2023) Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory. Journal of Uludag University Faculty of Education 36 3 866–906.
IEEE
[1]C. Mutluer and M. Çakan, “Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory”, Journal of Uludag University Faculty of Education, vol. 36, no. 3, pp. 866–906, Dec. 2023, doi: 10.19171/uefad.1325587.
ISNAD
Mutluer, Ceren - Çakan, Mehtap. “Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory”. Journal of Uludag University Faculty of Education 36/3 (December 1, 2023): 866-906. https://doi.org/10.19171/uefad.1325587.
JAMA
1.Mutluer C, Çakan M. Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory. Journal of Uludag University Faculty of Education. 2023;36:866–906.
MLA
Mutluer, Ceren, and Mehtap Çakan. “Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory”. Journal of Uludag University Faculty of Education, vol. 36, no. 3, Dec. 2023, pp. 866-0, doi:10.19171/uefad.1325587.
Vancouver
1.Ceren Mutluer, Mehtap Çakan. Comparison of Test Equating Methods Based on Classical Test Theory and Item Response Theory. Journal of Uludag University Faculty of Education. 2023 Dec. 1;36(3):866-90. doi:10.19171/uefad.1325587

Cited By


Journal of Uludag University Faculty of Education ©2025 by Bursa Uludag University is licensed under CC BY-NC 4.0