<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>int. j. assess. tools educ.</journal-id>
            <journal-title-group>
                                                                                    <journal-title>International Journal of Assessment Tools in Education</journal-title>
            </journal-title-group>
                                        <issn pub-type="epub">2148-7456</issn>
                                                                                            <publisher>
                    <publisher-name>İzzet KARA</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id pub-id-type="doi">10.21449/ijate.330885</article-id>
                                                                <article-categories>
                                            <subj-group  xml:lang="en">
                                                            <subject>Studies on Education</subject>
                                                    </subj-group>
                                            <subj-group  xml:lang="tr">
                                                            <subject>Eğitim Üzerine Çalışmalar</subject>
                                                    </subj-group>
                                    </article-categories>
                                                                                                                                                        <title-group>
                                                                                                                                                            <article-title>Investigating the Impact of Missing Data Handling Methods on the Detection of Differential Item Functioning</article-title>
                                                                                                    </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                <name>
                                    <surname>Selvi</surname>
                                    <given-names>Hüseyin</given-names>
                                </name>
                                                                    <aff>Mersin University</aff>
                                                            </contrib>
                                                    <contrib contrib-type="author">
                                                                <name>
                                    <surname>Özdemir Alıcı</surname>
                                    <given-names>Devrim</given-names>
                                </name>
                                                                    <aff>Mersin University</aff>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20180101">
                    <day>01</day>
                    <month>01</month>
                    <year>2018</year>
                </pub-date>
                                        <volume>5</volume>
                                        <issue>1</issue>
                                        <fpage>1</fpage>
                                        <lpage>14</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20170725">
                        <day>07</day>
                        <month>25</month>
                        <year>2017</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20170723">
                        <day>07</day>
                        <month>23</month>
                        <year>2017</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 2014, International Journal of Assessment Tools in Education</copyright-statement>
                    <copyright-year>2014</copyright-year>
                    <copyright-holder>International Journal of Assessment Tools in Education</copyright-holder>
                </permissions>
            
                                                                                                                        <abstract><p>In this study, it is aimed to investigate theimpact of different missing data handling methods on the detection ofDifferential Item Functioning methods (Mantel Haenszel and Standardizationmethods based on Classical Test Theory and Likelihood Ratio Test method basedon Item Response Theory). In this regard, on the data acquired from 1046candidates who entered to Foreign National Student Exam (FNSE) held in year2016 by Mersin University (MEU) and answered Basic Skills subtest, usingdifferent missing data handling methods, differential item functioning analyseswith Mantel Haenszel, Standardization and Likelihood Ratio Test methods areperformed. Basic Skills test consists of 80 multiple choice items. The itemsare all binary scored (1-0) items. Among the participants 523 are female and523 are male. The findings showed that the number of items flagged as DIF haschanged with the used missing data handling methods. The DIF detection methodsbased on Classical Test Theory are more consistent within themselves comparedto DIF detection method based on Item Response Theory, whereas the used missingdata handling methods differentiate the DIF detected items and this differencereaches a significant level for Mantel Haenszel method</p></abstract>
                                                            
            
                                                                                        <kwd-group>
                                                    <kwd>Differential Item Functioning DIF</kwd>
                                                    <kwd>  Test and Item Bias</kwd>
                                                    <kwd>  Missing Values; Imputation of Missing Data</kwd>
                                                    <kwd>  Mantel Haenszel</kwd>
                                                    <kwd>  Likelihood Ratio Test</kwd>
                                            </kwd-group>
                            
                                                                                                                                                    </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">Abedlazeez, N. (2010). Exploring DIF: Comparison of CTT and IRT methods.  International Journal of Sustainable Development, 7(1), 11-46.</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">Allison, P. D. (2002). Missing data. California: Sage Publication Inc.</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">Alpar, R. (2011). Uygulamalı çok değişkenli istatistiksel yöntemler. Ankara: Detay Yayıncılık.</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">Angoff, W.H. (1993). Perspectives on differential item functioning methodology. In Holland &amp; Wainer (Ed.), Differential Item Functioning. New Jersey: Lawrence Erlbaum Associates Publishers.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">Banks, K., &amp; Walker, C. (2006). Performance of SIBTEST when focal group examinees have missing data. San Francisco: National Council of Measurement in Education.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">Banks, K. (2015). An introduction to missing data in the context of differential item                         functioning. Practical Assessment, Research &amp; Evaluation. 20(12).</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">Bennett, D. A. (2001). How can I deal with missing data in my study? Australian and New Zealand Journal of Public Health, 25, 464–469.</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">Bernhard, J., Celia, D.F., &amp;Coates, A.S. (1998). Missing quality of life data in cancer clinical trials: Serious problems and challenges. Statistics in Medicine, 17, 517-532.</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">Camili, G., &amp; Shepard, L.A. (1994). Methods for identifying biased test items. London: Sage Publication.</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">Demir, E., &amp; Parlak, B. (2012). Türkiye’de eğitim araştırmalarında kayıp veri sorunu. Journal of Measurement and Evaluation in Education and Psychology 3(1), 230-241.</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">Demir, E. (2013). Kayıp verilerin varlığında çoktan seçmeli testlerde madde ve test parametrelerinin kestirilmesi: SBS örneği [Item and test parameters estimations for multiple choice tests in the presence of missing data: The case of SBS]. Journal of Educational Sciences Research, 3(2), 47–68.</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">Dişçi, R. (2012). Temel ve klinik biyoistatistik. İstanbul: Tıp Kitapevi.</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">Doğan, N., &amp; Öğretmen, T. (2008). Değişen Madde Fonksiyonunu belirlemede Mantel–Haenszel, Ki-Kare ve Lojistik Regresyon tekniklerinin karşılaştırılması. Education and Science, 33(148).</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">Embretson, S.E., &amp; Reise, S.P. (2000). Item response theory for psychologists. London: Lawrence Erlbaum Associates.</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">Emenogu, B. C., Falenchuck, O., &amp; Childs, R. A. (2010). The effect of missing data treatment on Mantel-Haenszel DIF detection. The Alberta Journal of Educational Research, 56(4), 459-469.</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">Falenchuk, O., &amp; Herbert, M. (2009). Investigation of differential non-response as a factor affecting the results of Mantel-Haenszel DIF detection California: American Educational Research Association.</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">Finch, W.H. (2011). The impact of missing data on the detection of nonuniform differential ıtem functioning. Educational and Psychological Measurement, 71(4) 663–683.</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">Garrett, P. L. (2009). A monte carlo study investigating missing data, differential item functioning, and effect size. Georgia State University, Unpublished doctoral dissertation.</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">Gelin, M.N. &amp; Zumbo, B.D. (2003). Differential item functioning results may change depending on how an item is scored: an illustration with the center for epidemiologic studies depression scale. Educational and Psychological Measurement, X(X) DOI: 10.1177/0013164402239317.</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">Gierl, M.J., Jodoin, M.G., &amp; Ackerman, T.A. (2000). Performance of Mantel-Haenszel, Simultaneous Item Bias Test, and Logistic Regression when the proportion of DIF items is large. American Educational Research Association.</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">Gonzales, A., Padilla, J.L., Dolores, H., Gomez-Benito, J., &amp; Benitez, I. (2010).  EASY-DIF: Software for analyzing differential item functioning using the Mantel-Haenszel and Standardization procedures. Applied Psychological Measurement. doi:10.1177/0146621610381489.</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">Graham, J.W. (2009). Missing Data Analysis: Making it work in the real world. Annual Review of Psychology, 60(4), 549-576.</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">Groves, R. M. (2006). Nonresponse rates and nonresponse bias in household surveys. Public
Opinion Quarterly, 70(5), 646-675.</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">Gözen Çıtak, G. (2007). Klasik test ve madde-tepki kuramlarına göre çoktan seçmeli testlerde farklı puanlama yöntemlerinin karşılaştırılması. Doktora Tezi, Ankara Üniversitesi, Ankara</mixed-citation>
                    </ref>
                                    <ref id="ref25">
                        <label>25</label>
                        <mixed-citation publication-type="journal">Hambletton, R.K. &amp; Swaminathan, H. (1985). Item Response Theory: Principles and applications. Boston: Kluwer-Nijhoff Publishing.</mixed-citation>
                    </ref>
                                    <ref id="ref26">
                        <label>26</label>
                        <mixed-citation publication-type="journal">Harwell, M. Stone, C. A., Hsu, T.C., &amp; Kirisci, L. (1996). Monte carlo studies in item response theory. Applied Psychological Measurement, 20, 101-125.</mixed-citation>
                    </ref>
                                    <ref id="ref27">
                        <label>27</label>
                        <mixed-citation publication-type="journal">Hohensinn, C. &amp; Kubinger K. D. (2011). On the impact of missing values on item fit and the model validness of the Rasch model. Psychological Test and Assessment Modeling, 53, 380-393.</mixed-citation>
                    </ref>
                                    <ref id="ref28">
                        <label>28</label>
                        <mixed-citation publication-type="journal">Kan, A., Sünbül, Ö., Ömür, S. (2013). 6.- 8. sınıf seviye belirleme sınavları alt testlerinin çeşitli yöntemlere göre değişen madde fonksiyonlarının incelenmesi. Mersin University Journal of the Faculty of Education, 9(2), 207-222.</mixed-citation>
                    </ref>
                                    <ref id="ref29">
                        <label>29</label>
                        <mixed-citation publication-type="journal">Kothari, C.R. (2004). Research methodology: Methods and techniques (Second Revised Edition). New Delhi: New Age Int. Ltd.</mixed-citation>
                    </ref>
                                    <ref id="ref30">
                        <label>30</label>
                        <mixed-citation publication-type="journal">Kristanjansonn E., R. Aylesworth, I. McDowell &amp; B.D. Zumbo (2005). A Comparison of four methods for detecting differential item functioning in ordered response model. Educational and Psychological Measurement. 65(6), 935-953.</mixed-citation>
                    </ref>
                                    <ref id="ref31">
                        <label>31</label>
                        <mixed-citation publication-type="journal">Little, R. J. A &amp; Rubin, D. B. (1987). Statistical analysis with missing data (2nd ed.). New York: John Wiley &amp; Sons, Inc.</mixed-citation>
                    </ref>
                                    <ref id="ref32">
                        <label>32</label>
                        <mixed-citation publication-type="journal">Lord, F. M. (1974). Estimation of latent ability and item parameters when there are omitted responses. Psychometrika, 39, 247-264.</mixed-citation>
                    </ref>
                                    <ref id="ref33">
                        <label>33</label>
                        <mixed-citation publication-type="journal">Lord, F. M. (1980). Applications of item response theory to practical testing problems. New Jersey: Lawrence Erlbaum Associates.</mixed-citation>
                    </ref>
                                    <ref id="ref34">
                        <label>34</label>
                        <mixed-citation publication-type="journal">Molenberghs, G., &amp; Kenward, M.G. (2007). Missing data in clinical studie (1 st ed.). England: John Wiley&amp;Sons.</mixed-citation>
                    </ref>
                                    <ref id="ref35">
                        <label>35</label>
                        <mixed-citation publication-type="journal">Narayanan, P., &amp; Swaminathan, H. (1994). Performance of the Mantel-Haenszel and Simultaneous Item Bias procedures for detecting differential ıtem functioning, Applied Psychological Measurement, 18(4).</mixed-citation>
                    </ref>
                                    <ref id="ref36">
                        <label>36</label>
                        <mixed-citation publication-type="journal">Osterlind, S.J. (1983). Test item bias. London: Sage Publication.</mixed-citation>
                    </ref>
                                    <ref id="ref37">
                        <label>37</label>
                        <mixed-citation publication-type="journal">Padilla, J.L., Hidalgo, J.L., Benitez, I., &amp; Gomez-Benito, J. (2012). Comparison of three software programs for evaluating DIF by means of the Mantel-Haenszel procedure; EASY DIF, DIFAS and EZDIF, Psicologica, 33,135-156.</mixed-citation>
                    </ref>
                                    <ref id="ref38">
                        <label>38</label>
                        <mixed-citation publication-type="journal">Peng, C.Y.J., Harwell, M., Liou, S.M., &amp; Ehman, L. H. (2006). Advances in missing data methods and implications for educational research. In S. Sawilowsky (Ed.), Greenwich: Real data analysis.</mixed-citation>
                    </ref>
                                    <ref id="ref39">
                        <label>39</label>
                        <mixed-citation publication-type="journal">Peng, C. J., &amp; Zhu, J. (2008). Comparison of two approaches for handling missing covariates in logistic regression. Educational and Psychological Measurement, 68(1), 58-77.</mixed-citation>
                    </ref>
                                    <ref id="ref40">
                        <label>40</label>
                        <mixed-citation publication-type="journal">Pigott, T.D. (2001). A review of methods for missing data. Educational Research and Evaluation, 7(4); 353-383.</mixed-citation>
                    </ref>
                                    <ref id="ref41">
                        <label>41</label>
                        <mixed-citation publication-type="journal">Robitzsch, A, &amp; Rupp, A.A. (2009). Impact of missing data on the detection of differential item functioning the case of mantel-haenszel and logistic regression analysis. Educational and Psychological Measurement, 69(1): 18-34.</mixed-citation>
                    </ref>
                                    <ref id="ref42">
                        <label>42</label>
                        <mixed-citation publication-type="journal">Rousseau, M., Bertrand, R., &amp; Boiteau, N. (2006, April). Impact of missing data treatment on the efficiency of DIF methods. California: National Council on Measurement in Education.</mixed-citation>
                    </ref>
                                    <ref id="ref43">
                        <label>43</label>
                        <mixed-citation publication-type="journal">Royce, S., Straits, B.C., &amp; Straits, M.M. (1993). Approaches to social research (2nd ed.). New York: Oxford University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref44">
                        <label>44</label>
                        <mixed-citation publication-type="journal">Rubin, D. B. (1976). Inference and missing data. Biometrika, 63(3), 581-592.</mixed-citation>
                    </ref>
                                    <ref id="ref45">
                        <label>45</label>
                        <mixed-citation publication-type="journal">Schafer, J. L. (1999). Multiple imputation: A primer. Statistical Methods in Medical Research, (8), 3-15.</mixed-citation>
                    </ref>
                                    <ref id="ref46">
                        <label>46</label>
                        <mixed-citation publication-type="journal">Sedivy, S. K., Zhang, B., &amp; Traxel, N. M. (2006). Detection of differential item functioning with polytomous items in the presence of missing data. California: National Council of Measurement in Education.</mixed-citation>
                    </ref>
                                    <ref id="ref47">
                        <label>47</label>
                        <mixed-citation publication-type="journal">Selvi, H. (2013). Klasik test ve madde tepki kuramlarına dayalı değişen madde fonksiyonu belirleme tekniklerinin farklı puanlama durumlarında incelenmesi. Yayınlanmamış Doktora Tezi. Mersin Üniversitesi Eğitim Bilimleri Enstitüsü.</mixed-citation>
                    </ref>
                                    <ref id="ref48">
                        <label>48</label>
                        <mixed-citation publication-type="journal">Singh, Y.K. (2006). Fundamental of research methodology and statistics. New Delhi: New Age Int. Ltd.</mixed-citation>
                    </ref>
                                    <ref id="ref49">
                        <label>49</label>
                        <mixed-citation publication-type="journal">Spray, J., &amp; Miller, T. (1994). Identifying nonuniform DIF in polytomously scored test items. American College Testing Research Report Series 94-1. Iowa City, IA: American College Testing Program.</mixed-citation>
                    </ref>
                                    <ref id="ref50">
                        <label>50</label>
                        <mixed-citation publication-type="journal">Ward, W.C., &amp; Bennett, R.E. (2012). Construction versus choice in cognitive measurement: issues in constructed response, performance testing, and portfolio assessment. London and New York: Routledge, Taylor &amp; Francis Group.</mixed-citation>
                    </ref>
                                    <ref id="ref51">
                        <label>51</label>
                        <mixed-citation publication-type="journal">Woodward, M., Smith, W.C., &amp; Tunstall Pedoe H. (1991). Bias from missing values: Sex differences in implication of failed venepuncture for the Scottish Health Study.Int J. Epidemiol.</mixed-citation>
                    </ref>
                                    <ref id="ref52">
                        <label>52</label>
                        <mixed-citation publication-type="journal">Wu, A. D., Li, Z., &amp; Zumbo, B. D. (2007). Decoding the meaning of factorial invariance and updating the practice of multi-group confirmatory factor analysis: A demonstration with TIMSS data. Practical Assessment, Research &amp; Evaluation, 12(3), 1-26.</mixed-citation>
                    </ref>
                                    <ref id="ref53">
                        <label>53</label>
                        <mixed-citation publication-type="journal">Zumbo, B. D. (1999). A Handbook on the theory and methods of Differential Item Functioning (DIF): Logistic Regression modeling as a unitary framework for binary and likert-type (ordinal) item scores. Ottawa ON: Directorate of Human Resources Research and Evaluation, Department of National Defense.</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
