The Effects of Sample Size and Missing Data Rates on Generalizability Coefficients
Abstract
Purpose of the Study: Missing data are a common problem encountered while implementing measurement instruments. Yet the extent to which reliability, validity, average discrimination and difficulty of the test results are affected by the missing data has not been studied much. Since it is inevitable that missing data have an impact on the psychometric properties of measurement instruments, it was considered necessary to investigate this topic.Depending on the identified need, a simulative study was conducted on the effects of missing data on reliability. The reliability estimates were discussed in terms of generalizability theory (G theory). Research Methods: Depending on the research questions, complete data sets having different sample sizes (100, 200, 400, 1000) in weak and strong one-dimensional structures under normal distribution were produced. Missing data sets were created by deleting data at different rates (5%, 10%, 20%, 30%) randomly from the complete sets. Findings and Results: When the estimates obtained by missing and complete data sets were compared, it was found that G and phi coefficients were significantly affected for the weak one-dimensional design when the missingness was 20% and more. However, for the strong one-dimensional design, those coefficients were negligibly affected even when the missingness was 30%. Moreover, it was also found that the estimates obtained by missing coded incorrect in particularly weak one-dimensional data were lower than the estimates from missing data matrix. Also error statistics of the weak one-dimensional data based on missing coded incorrect were significantly higher than their strong one-dimensional data counterparts, especially at the rates of 20% and 30% missingness. Implications for Research and Practice Thus, missing coded incorrect is not suggested to be used as a missing data treatment method in reliability estimations. Instead, generalizability theory, which allows us to conduct analysis with missing data in matrices, might be recommended.
Keywords
References
- Allison, P. D. (2001). Missing data. Thousands Oaks, CA: Sage Publiation.
- Atilgan, H. (2013). Sample size for estimation of G and phi coefficients in generalizability theory. Eurasian Journal of Educational Research, 51, 215-227.
- Aydilek, İ. B. (2013). Veri kumelerindeki eksik degerlerin yeni yaklasimlar kullanilarak hesaplanmasi. (Unpublished doctoral dissertation). Selcuk University, Institute Of Science, Konya.
- Bakis, R., & Goncu S. (2015). Akarsu debi olcumlerinde eksik verilerin tamamlanmasi: Zap suyu havzasi ornegi [Completion of missing data in rivers flow measurement: case study of zab river basin]. Anadolu University Journal of Science and Technology A - Applied Sciences and Engineering, 16(1), 63–79.
- Baraldi, A.N., & Enders, C.K. (2009). An introduction to modern missing data analyses. Journal of School Psychology, 48, 5-37. Brennan, R. L. (2001). Generalizability theory. New York: Springer-Verlag
- Cheng, H. (2016). Principle components analysis with missing values and outliers. Retrieved April 19, 2016, from http://citeseerx.ist.psu.edu/viewdoc/versions?doi=10.1.1.4.6605&version=3
- Cool, A.L. (2000). A review methods for dealing with missing data. Paper presented annual meeting of the Southwest Educational Research Association, Dallas, January 28, TX.
- Cum, S., & Gelbal, S. (2015). Kayip veriler yerine yaklasik deger atamada kullanilan farkli yontemlerin model veri uyumu uzerindeki etkisi [The effects of different methods used for value imputation instead of missing values on model data fit statistics]. Mehmet Akif Ersoy University Journal of Education Faculty, 35, 87-111.
Details
Primary Language
English
Subjects
-
Journal Section
Research Article
Publication Date
May 20, 2018
Submission Date
May 20, 2018
Acceptance Date
-
Published in Issue
Year 2018 Volume: 18 Number: 75