JMETRIK: Classical Test Theory and Item Response Theory Data Analysis Software
Abstract
The aim of this study is to introduce the jMetric program which is one of the open source programs that can be used in the context of Item Response Theory and Classical Test Theory. In this context, the interface of the program, importing data to the program, a sample analysis, installing the jmetrik and support for the program are discussed. In sample analysis, the answers given by a total of 500 students from state and private schools, to a 10-item math test were analyzed to see whether they shows differentiating item functioning according to the type of school they attend. As a result of the analysis, it was found that two items were showing medium-level Differential Item Functioning (DIF). As a result of the study, it was found that the jMetric program, which is capable of performing Item Response Theory (IRT) analysis for two-category and multi-category items, is open to innovations, especially because it is open-source, and that researchers can easily add the suggested codes to the program and thus the program can be improved. In addition, an advantage of the program is producing visual results related to the analysis through the item characteristic curves.
Keywords
References
- Aksu, G., Reyhanlıoğlu, Ç., Eser M. T. (2017). Examining the two categorical datas by jMetrik, Bilog-MG and IRTPRO with application of mathematics exam. European Scientific Journal, 13(33).
- Crocker, L., & Algina, J. (1986). Introduction to classical &modern test theory. Orlando, FL: Holt, Rinehart & Winston.
- Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Lawrence Erlbaum Associate, Inc.
- Haebara, T. (1980). Equating logistic ability scales by a weighted least squares method. Japanese Psychological Research, 22(3), 144–149.
- Hambleton, R. K., & Swaminathan, H. (1985). Item response theory principles and applications. Boston-USA: Kluwer-Nijhoff Publishing.
- Kim, S. & Kolen, M. J. (2007). Effects of scale linking on different definitions of criterion functions for the IRT characteristic curve methods. Journal of Educational and Behavioral Statistics, 32(4), 371–397.
- Lord, F.M. & Novick, M.R. (1968) Statistical Theories of Mental Test Scores. Addison-Wesley, Menlo Park.
- Loyd, B. H. & Hoover, H. D. (1980). Vertical equating using the Rasch model. Journal of Educational Measurement, 17(3), 179–193.
Details
Primary Language
English
Subjects
-
Journal Section
Research Article
Authors
Gökhan Aksu
0000-0003-2563-6112
Türkiye
Publication Date
June 28, 2019
Submission Date
November 15, 2018
Acceptance Date
March 12, 2019
Published in Issue
Year 2019 Volume: 10 Number: 2
Cited By
The ProQol-20, a restructured version of the professional quality of life scale (ProQOL)
Current Psychology
https://doi.org/10.1007/s12144-022-02755-2Study on the Academic Competency Assessment of Herbology Test using Rasch Model
Journal of Korean Medicine
https://doi.org/10.13048/jkm.22017A scoping review of Rasch analysis and item response theory in otolaryngology: Implications and future possibilities
Laryngoscope Investigative Otolaryngology
https://doi.org/10.1002/lio2.1208