Research Article

A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education

Volume: 8 Number: 1 April 3, 2017
EN TR

A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education

Abstract

This study is based on a vertical scaling implemented with reference to the Item Response Theory, and involves a comparison of vertical scaling results obtained through the application of proficiency estimation methods and calibration methods. The vertical scales thus developed were assessed with reference to the criteria of grade-to-grade growth, grade-to-grade variability, and the separation of grade distributions. The data used in the study pertains to a dataset composed of a total of 1500 students from twelve primary schools in the province of Ankara, characterized by different levels of socio-economic cultural development. The comparison of the findings pertaining to the first and second sub-problems reveals that the mean differences found through separate calibration were lower than those applicable to concurrent calibration, while the standard deviation found in the case of separate calibration were again lower than the values established through concurrent calibration. Furthermore, the scale of impact in the case of separate calibration was again lower than the values applicable to concurrent calibration. The results reached for all three criteria, using the concurrent calibration method were ranked in the order ML < MAP < EAP, with ML leading to the lowest value while EAP producing the highest one. In case of separate calibration, on the other hand, the ranking of results was found to vary with reference to the criteria applied.

Keywords

References

  1. Boughton, K. A., Lorie, W. & Yao, L. (2005). A multidimensional multi-group irt models for vertical scales with complex test structure: An empirical evaluation of student growth using real data. Paper presented at the annual meeting of the National Council on Measurement in Education, Monreal, Canada.
  2. Creswell, J. W. (2013). Research design: Qualitative, quantitative and mixed methods approaches (4th edition). University of Nebraska, Lincoln: Sage.
  3. Çetin, E. (2009). Dikey ölçeklemede klasik test ve madde tepki kuramına dayalı yöntemlerin karşılaştırılması. Unpublished Doctoral Thesis, Ankara: Hacettepe University.
  4. Dongyang, L. (2009). Developing a common scale for testlet model parameter estimates under the common-item nonequivalent groups design. Unpublished Doctoral Thesis, University of Maryland.
  5. Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: principles and applications. Boston: Kluwer.
  6. Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory. Newbury Park, CA: Sage.
  7. Hanson, B. A., & Béguin, A. A. (2002). Obtaining a common scale for item response theory item parameters using separate versus concurrent estimation in the common- item equating design. Applied Psychological Measurement, 26, 3-24.
  8. Hanson, B. A., Zeng, L., & Chien, Y. (2004). ST: A Computer Program for IRT Scale Transformation [Computer software]. Retrieved January 24, 2005, from http://www.education.uiowa.edu/casma

Details

Primary Language

English

Subjects

-

Journal Section

Research Article

Authors

Aylin Albayrak Sarı
HACETTEPE UNIV
Türkiye

Hülya Kelecioğlu
HACETTEPE UNIV
Türkiye

Publication Date

April 3, 2017

Submission Date

January 17, 2017

Acceptance Date

March 3, 2017

Published in Issue

Year 2017 Volume: 8 Number: 1

APA
Albayrak Sarı, A., & Kelecioğlu, H. (2017). A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education. Journal of Measurement and Evaluation in Education and Psychology, 8(1), 98-111. https://doi.org/10.21031/epod.286221
AMA
1.Albayrak Sarı A, Kelecioğlu H. A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education. JMEEP. 2017;8(1):98-111. doi:10.21031/epod.286221
Chicago
Albayrak Sarı, Aylin, and Hülya Kelecioğlu. 2017. “A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education”. Journal of Measurement and Evaluation in Education and Psychology 8 (1): 98-111. https://doi.org/10.21031/epod.286221.
EndNote
Albayrak Sarı A, Kelecioğlu H (March 1, 2017) A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education. Journal of Measurement and Evaluation in Education and Psychology 8 1 98–111.
IEEE
[1]A. Albayrak Sarı and H. Kelecioğlu, “A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education”, JMEEP, vol. 8, no. 1, pp. 98–111, Mar. 2017, doi: 10.21031/epod.286221.
ISNAD
Albayrak Sarı, Aylin - Kelecioğlu, Hülya. “A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education”. Journal of Measurement and Evaluation in Education and Psychology 8/1 (March 1, 2017): 98-111. https://doi.org/10.21031/epod.286221.
JAMA
1.Albayrak Sarı A, Kelecioğlu H. A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education. JMEEP. 2017;8:98–111.
MLA
Albayrak Sarı, Aylin, and Hülya Kelecioğlu. “A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education”. Journal of Measurement and Evaluation in Education and Psychology, vol. 8, no. 1, Mar. 2017, pp. 98-111, doi:10.21031/epod.286221.
Vancouver
1.Aylin Albayrak Sarı, Hülya Kelecioğlu. A Comparison of IRT Vertical Scaling Methods in Determining of the Increase in Achievement of Science Education. JMEEP. 2017 Mar. 1;8(1):98-111. doi:10.21031/epod.286221

Cited By