Research Article

A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory

Volume: 14 Number: 2 June 30, 2023
EN

A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory

Abstract

This study compares the different designs obtained through four raters’ scoring the open-ended items used in PISA 2009 reading literacy altogether or alternately according to the Generalizability Theory. The sample of the research was composed of 362 students (out of 4996 students participating in PISA 2009) who responded to the items of reading skills and who were scored by more than one rater. Two designs were created so as to be used in generalizability theory in the study. One of them was the crossed design symbolized as “s x i x r” (student x item x rater), in which students are scored by each rater in terms of the same skills. The second was the nested design symbolized as “(r:s) x i”, where each rater scored only a group of students and raters are nested in students and the items were crossed with these variables. On comparing the s x i x r design with (r:s) x i design, it was found that the relative and absolute error variances estimated for (r:s) x i design were smaller than those for s x i x r design and that therefore the G and Phi coefficients took on bigger values. On increasing the number of raters in both designs, the G and Phi coefficients also increased in the D study. While acceptable values of G and Phi coefficients were reached on reducing the number of raters by half in Booklet 2, raising the number of raters seemed more appropriate in Booklet 8.

Keywords

References

  1. Atılgan, H. (2008). Using generalizability theory to assess the score realibility of the special ability selection examinations for music education programmes in higher education. International Journal of Research and Method Education, 31(1), 63-76. https://doi.org/10.1080/17437270801919925.
  2. Atılgan, H., Kan, A. & Doğan, N. (2011). Eğitimde ölçme ve değerlendirme. (5. Baskı). Anı Yayıncılık.
  3. Balbağ, M., Leblebicier, K., Karaer G., Sarıkahya E. & Erkan Ö. (2016). Türkiye'de fen eğitimi ve öğretimi sorunları. Eğitim ve Öğretim Araştırmaları Dergisi, 5(3), 1-12. http://www.jret.org/FileUpload/ks281142/File/02.m._zafer_balbag.pdf
  4. Baykul, Y. (2000). Eğitimde ve psikolojide ölçme: Klasik test teorisi ve uygulaması. ÖSYM
  5. Bernardin, H. J. & Villanova, P. (2005). Research streams in rater self-efficacy. Group and Organizational Management, 30, 61-88. https://doi.org/10.1177/1059601104267675
  6. Biemer, L. (1993). Trends-social studies /authentic assessment. Educational Leadership, 50 (8). https://www.ascd.org/el/articles/-authentic-assessment
  7. Brennan, R. L. (2001). Generalizability theory. Springer-Verlag Publishing. https://doi.org/10.1007/978-1-4757-3456-0
  8. Demir, E. (2010). Uluslararası öğrenci değerlendirme programı (PISA) bilişsel alan testlerinde yer alan soru tiplerine göre Türkiye’de öğrenci başarıları (Yayınlanmamış yüksek lisans tezi). Hacettepe Üniversitesi.

Details

Primary Language

English

Subjects

Test Theories

Journal Section

Research Article

Publication Date

June 30, 2023

Submission Date

November 28, 2022

Acceptance Date

June 12, 2023

Published in Issue

Year 2023 Volume: 14 Number: 2

APA
Alkan, M., & Doğan, N. (2023). A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory. Journal of Measurement and Evaluation in Education and Psychology, 14(2), 106-117. https://doi.org/10.21031/epod.1210917
AMA
1.Alkan M, Doğan N. A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory. JMEEP. 2023;14(2):106-117. doi:10.21031/epod.1210917
Chicago
Alkan, Meral, and Nuri Doğan. 2023. “A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory”. Journal of Measurement and Evaluation in Education and Psychology 14 (2): 106-17. https://doi.org/10.21031/epod.1210917.
EndNote
Alkan M, Doğan N (June 1, 2023) A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory. Journal of Measurement and Evaluation in Education and Psychology 14 2 106–117.
IEEE
[1]M. Alkan and N. Doğan, “A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory”, JMEEP, vol. 14, no. 2, pp. 106–117, June 2023, doi: 10.21031/epod.1210917.
ISNAD
Alkan, Meral - Doğan, Nuri. “A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory”. Journal of Measurement and Evaluation in Education and Psychology 14/2 (June 1, 2023): 106-117. https://doi.org/10.21031/epod.1210917.
JAMA
1.Alkan M, Doğan N. A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory. JMEEP. 2023;14:106–117.
MLA
Alkan, Meral, and Nuri Doğan. “A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory”. Journal of Measurement and Evaluation in Education and Psychology, vol. 14, no. 2, June 2023, pp. 106-17, doi:10.21031/epod.1210917.
Vancouver
1.Meral Alkan, Nuri Doğan. A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory. JMEEP. 2023 Jun. 1;14(2):106-17. doi:10.21031/epod.1210917