Comparison of Different Ability Estimation Methods Based on 3 and 4PL Item Response Theory
Öz
This research analyzed the two-category Item
Response Theory (IRT) models as part of different ability estimation methods.
The research was carried out in consideration of responses to 20 items under
the Mathematics subtest of TEOG (National Transition from Primary to Secondary
Education) exam by the 8th-grade students in 2015-2016. The study group
consisted of 400 students who were randomly selected from the students
participated in the TEOG exam. Ability estimations and standard error values
for these estimations were calculated based on the data. These estimations were
compared by two-way analysis of variance (ANOVA) for repeated measurements
According to the research findings; it was revealed that the four-parameter
logistic (4PL) item model fit better. In terms of ability estimation methods,
the accuracy of Weighted Likelihood Estimation (WLE) was higher than Maximum A
Posteriori (MAP) and Expected A Posteriori (EAP). WLE and MAP ability estimation
model gave lower standard error values compared to the 4PL and 3PL model,
respectively. The highest marginal reliability coefficient value for the 3PL
model was calculated using estimations made according to MAP while estimations
made according to WLE were used for the 4PL model. According to the research
findings, it was concluded that the accuracy of ability scores obtained by the
WLE estimation method under the 4PL model was higher
Anahtar Kelimeler
Kaynakça
- Baker, F. B. (1992). Item Response Theory: Parameter Estimation Technique. New York: Marcel Dekker.
- Bar-Hillel, M., Budescu, D., & Attali, Y. (2005). Scoring and keying multiple choice tests: A case study in irrationality. Mind & Society, 4, 3-12. http://doi.org/cp7ddc
- Barton, M. A., & Lord, F. M. (1981). An upper asymptote for the three-parameter logistic item-response model. Research Bulletin, 81-20. Princeton, NJ: Educational Testing Service.
- Baykul, Y. (1979). Örtük özellikler ve klasik test kuramları üzerine bir karşılaştırma (Unpublished Doctoral thesis). Hacettepe University, Graduate School of Social Sciences, Ankara.
- Berberoğlu, G. (1988). Seçme amacıyla kullanılan testlerde Rasch modelinin katkıları (Unpublished Doctoral thesis). Hacettepe University, Graduate School of Social Sciences, Ankara.
- Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee’s ability. F. M. Lord & M. R. Novick(Ed), Statistical theories of mental test scores içinde (pp. 397-472). Reading MA: Addison-Wesley.
- Borgatto, A. F., Azevedo, C. L. N., Pinheiro, A., & Andrade, D. F. (2015). Comparison of ability estimation methods using irt for test with different degrees of difficulty. Communications in Statistics-Simulation and Computation, 44(2), 474-488.
- Ching-Fung, B. S. (2002). Ability estimation under different item parametrization and scoring models (Unpublished Doctoral thesis). North Teksas University, Teksas.
- Can, S. (2003). The analyses of secondary education institutions student selection and placement test’s verbal section with respect to item response theory models (Unpublished Master's thesis). Middle East Technical University, Graduate School of Social Sciences, Ankara.
- Chalmers R. P. (2013). mirt: Multidimensional Item Response Theory. R package version 0.9.0, [Çevirim içi: http://CRAN.R-project.org/package=mirt].