EN
Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests
Abstract
This study aims to compare Sequential Probability Ratio Test (SPRT) and Confidence Interval (CI) classification criteria, Maximum Fisher Information method on the basis of estimated-ability (MFI-EB) and Cut-Point (MFI-CB) item selection methods while ability estimation method is Weighted Likelihood Estimation (WLE) in Computerized Adaptive Classification Testing (CACT), according to the Average Classification Accuracy (ACA), Average Test Length (ATL), and measurement precision under content balancing (Constrained Computerized Adaptive Testing: CCAT and Modified Multinomial Model: MMM) and item exposure control (Sympson-Hetter Method: SH and Item Eligibility Method: IE) when the classification is done based on two, three, or four categories for a unidimensional pool of dichotomous items. Forty-eight conditions are created in Monte Carlo (MC) simulation for the data, generated in R software, including 500 items and 5000 examinees, and the results are calculated over 30 replications. As a result of the study, it was observed that CI performs better in terms of ATL, and SPRT performs better in ACA and correlation, bias, Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) values, sequentially; MFI-EB is more useful than MFI-CB. It was also seen that MMM is more successful in content balancing, whereas CCAT is better in terms of test efficiency (ATL and ACA), and IE is superior in terms of item exposure control though SH is more beneficial in test efficiency. Besides, increasing the number of classification categories increases ATL but decreases ACA, and it gives better results in terms of the correlation, bias, RMSE, and MAE values.
Keywords
References
- Bao, Y., Shen, Y., Wang, S., & Bradshaw, L. (2021). Flexible computerized adaptive tests to detect misconceptions and estimate ability simultaneously. Applied Psychological Measurement, 45(1), 3-21. doi: 10.1177/0146621620965730
- Dooley, K. (2002). Simulation research methods. In J. Baum (Ed.), Companion to organizations (pp. 829-848). London: Blackwell.
- Eggen, T. J. H. M. (1999). Item selection in adaptive testing with the sequential probability ratio test. Applied Psychological Measurement, 23(3), 249-261. doi: 10.1177/01466219922031365
- Eggen, T. J. H. M., & Straetmans, G. J. J. M. (2000). Computerized adaptive testing for classifying examinees into three categories. Educational and Psychological Measurement, 60(5), 713-734. doi: 10.1177/00131640021970862
- Fan, Z., Wang, C., Chang, H., & Douglas, J. (2012). Utilizing response time distributions for item selection in CAT. Journal of Educational and Behavioral Statistics, 37(5), 655-670. doi: 10.3102/1076998611422912
- Finkelman, M. (2008). On using stochastic curtailment to shorten the SPRT in sequential mastery testing. Journal of Educational and Behavioral Statistics, 33(4), 442-463. doi: 10.3102/1076998607308573
- Gündeğer, C., & Doğan, N. (2018a). A comparison of computerized adaptive classification test criteria in terms of test efficiency and measurement precision. Journal of Measurement and Evaluation in Education and Psychology, 9(2), 161-177. doi: 10.21031/epod.401077
- Gündeğer, C., & Doğan, N. (2018b). The effects of item pool characteristics on test length and classification accuracy in computerized adaptive classification testings. Hacettepe University Journal of Education, 33(4), 888-896. doi: 10.16986/HUJE.2016024284
Details
Primary Language
English
Subjects
-
Journal Section
Research Article
Publication Date
March 31, 2021
Submission Date
August 30, 2020
Acceptance Date
February 21, 2021
Published in Issue
Year 2021 Volume: 12 Number: 1
APA
Demir, S., & Atar, B. (2021). Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests. Journal of Measurement and Evaluation in Education and Psychology, 12(1), 15-27. https://doi.org/10.21031/epod.787865
AMA
1.Demir S, Atar B. Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests. JMEEP. 2021;12(1):15-27. doi:10.21031/epod.787865
Chicago
Demir, Seda, and Burcu Atar. 2021. “Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests”. Journal of Measurement and Evaluation in Education and Psychology 12 (1): 15-27. https://doi.org/10.21031/epod.787865.
EndNote
Demir S, Atar B (March 1, 2021) Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests. Journal of Measurement and Evaluation in Education and Psychology 12 1 15–27.
IEEE
[1]S. Demir and B. Atar, “Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests”, JMEEP, vol. 12, no. 1, pp. 15–27, Mar. 2021, doi: 10.21031/epod.787865.
ISNAD
Demir, Seda - Atar, Burcu. “Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests”. Journal of Measurement and Evaluation in Education and Psychology 12/1 (March 1, 2021): 15-27. https://doi.org/10.21031/epod.787865.
JAMA
1.Demir S, Atar B. Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests. JMEEP. 2021;12:15–27.
MLA
Demir, Seda, and Burcu Atar. “Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests”. Journal of Measurement and Evaluation in Education and Psychology, vol. 12, no. 1, Mar. 2021, pp. 15-27, doi:10.21031/epod.787865.
Vancouver
1.Seda Demir, Burcu Atar. Investigation of Classification Accuracy, Test Length and Measurement Precision at Computerized Adaptive Classification Tests. JMEEP. 2021 Mar. 1;12(1):15-27. doi:10.21031/epod.787865
Cited By
Comparison of Different Computerized Adaptive Testing Approaches with Shadow Test Under Different Test Length and Ability Estimation Method Conditions
Eğitimde ve Psikolojide Ölçme ve Değerlendirme Dergisi
https://doi.org/10.21031/epod.1202599