<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>puje</journal-id>
            <journal-title-group>
                                                                                    <journal-title>Pamukkale Üniversitesi Eğitim Fakültesi Dergisi</journal-title>
            </journal-title-group>
                            <issn pub-type="ppub">1301-0085</issn>
                                        <issn pub-type="epub">1309-0275</issn>
                                                                                            <publisher>
                    <publisher-name>Pamukkale University</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id pub-id-type="doi">10.9779/pauefd.585774</article-id>
                                                                                                                                                                                            <title-group>
                                                                                                                        <trans-title-group xml:lang="tr">
                                    <trans-title>3 ve 4PL Madde Tepki Kuramı Modellerine Göre Farklı Yetenek Kestirim Yöntemlerinin Karşılaştırılması</trans-title>
                                </trans-title-group>
                                                                                                                                                                                                <article-title>Comparison of Different Ability Estimation Methods Based on 3 and 4PL Item Response Theory</article-title>
                                                                                                    </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0001-6572-274X</contrib-id>
                                                                <name>
                                    <surname>Doğruöz</surname>
                                    <given-names>Ebru</given-names>
                                </name>
                                                                    <aff>ÇANKIRI KARATEKİN ÜNİVERSİTESİ</aff>
                                                            </contrib>
                                                    <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0001-5255-8792</contrib-id>
                                                                <name>
                                    <surname>Akın Arıkan</surname>
                                    <given-names>Çiğdem</given-names>
                                </name>
                                                                    <aff>ORDU ÜNİVERSİTESİ</aff>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20200901">
                    <day>09</day>
                    <month>01</month>
                    <year>2020</year>
                </pub-date>
                                                    <issue>50</issue>
                                        <fpage>50</fpage>
                                        <lpage>69</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20190702">
                        <day>07</day>
                        <month>02</month>
                        <year>2019</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20200205">
                        <day>02</day>
                        <month>05</month>
                        <year>2020</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 1996, Pamukkale University Journal of Education</copyright-statement>
                    <copyright-year>1996</copyright-year>
                    <copyright-holder>Pamukkale University Journal of Education</copyright-holder>
                </permissions>
            
                                                                                                <trans-abstract xml:lang="tr">
                            <p>Bu araştırmada iki kategorili MaddeTepki Kuramı modelleri, farklı yetenek kestirim yöntemleri bağlamındaincelenmiştir. Araştırma 2015-2016 yılında 8. Sınıf öğrencilerin TEOG sınavınınmatematik alt testinde yer alan 20 maddeye verdikleri yanıtlar ışığındagerçekleştirilmiştir. Bu verilerden seçkisiz olarak seçilen 4000 yanıtlayıcı,çalışma grubunu oluşturmaktadır. Veriler üzerinden yetenek kestirimleri ve bukestirimlere ait standart hata değerleri hesaplanmıştır. Bu kestirimlertekrarlı ölçümler için iki faktörlü varyans analizi (ANOVA) kullanılarakkarşılaştırılmıştır. Araştırma bulguları 4PL modelin daha iyi uyum gösterdiğiniortaya çıkartmıştır. WLE yetenek kestirim yönteminin doğruluğu MAP ve EAPyetenek kestirim yönteminin doğruluğundan daha yüksektir. 4PL modele göre WLE,3PL modele göre MAP yetenek kestirim modelinin standart hata değeri dahadüşüktür. En yüksek marjinal güvenirlik katsayı değeri 3PL model için MAP, 4PLmodel için WLE yöntemine göre gerçekleştirilen kestirimlerden hesaplanmıştır.Araştırma bulgularına dayalı olarak 4 PL model altında WLE kestirim yönteminegöre gerçekleştirilen yetenek puanlarının doğruluğunun yüksek olduğu sonucunaulaşılmıştır.</p></trans-abstract>
                                                                                                                                    <abstract><p>This research analyzed the two-category ItemResponse Theory (IRT) models as part of different ability estimation methods.The research was carried out in consideration of responses to 20 items underthe Mathematics subtest of TEOG (National Transition from Primary to SecondaryEducation) exam by the 8th-grade students in 2015-2016. The study groupconsisted of 400 students who were randomly selected from the studentsparticipated in the TEOG exam. Ability estimations and standard error valuesfor these estimations were calculated based on the data. These estimations werecompared by two-way analysis of variance (ANOVA) for repeated measurementsAccording to the research findings; it was revealed that the four-parameterlogistic (4PL) item model fit better. In terms of ability estimation methods,the accuracy of Weighted Likelihood Estimation (WLE) was higher than Maximum APosteriori (MAP) and Expected A Posteriori (EAP). WLE and MAP ability estimationmodel gave lower standard error values compared to the 4PL and 3PL model,respectively. The highest marginal reliability coefficient value for the 3PLmodel was calculated using estimations made according to MAP while estimationsmade according to WLE were used for the 4PL model. According to the researchfindings, it was concluded that the accuracy of ability scores obtained by theWLE estimation method under the 4PL model was higher</p></abstract>
                                                            
            
                                                                                        <kwd-group>
                                                    <kwd>ability estimation methods</kwd>
                                                    <kwd>  item response theory</kwd>
                                                    <kwd>  3 PLM</kwd>
                                                    <kwd>  4PLM</kwd>
                                            </kwd-group>
                            
                                                <kwd-group xml:lang="tr">
                                                    <kwd>yetenek kestirim yöntemleri</kwd>
                                                    <kwd>  madde tepki kuramı</kwd>
                                                    <kwd>  3PLM</kwd>
                                                    <kwd>  4PLM</kwd>
                                            </kwd-group>
                                                                                                                                        </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">Baker, F. B. (1992). Item Response Theory: Parameter Estimation Technique. New York: Marcel Dekker.</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">Bar-Hillel, M., Budescu, D., &amp; Attali, Y. (2005). Scoring and keying multiple choice tests: A case study in irrationality. Mind &amp; Society, 4, 3-12. http://doi.org/cp7ddc</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">Barton, M. A., &amp; Lord, F. M. (1981). An upper asymptote for the three-parameter logistic item-response model. Research Bulletin, 81-20. Princeton, NJ: Educational Testing Service.</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">Baykul, Y. (1979). Örtük özellikler ve klasik test kuramları üzerine bir karşılaştırma (Unpublished Doctoral thesis). Hacettepe University, Graduate School of Social Sciences, Ankara.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">Berberoğlu, G. (1988). Seçme amacıyla kullanılan testlerde Rasch modelinin katkıları (Unpublished Doctoral thesis). Hacettepe University, Graduate School of Social Sciences, Ankara.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee’s ability. F. M. Lord &amp; M. R. Novick(Ed), Statistical theories of mental test scores içinde (pp. 397-472). Reading MA: Addison-Wesley.</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">Borgatto, A. F., Azevedo, C. L. N., Pinheiro, A., &amp; Andrade, D. F. (2015). Comparison of ability estimation methods using irt for test with different degrees of difficulty. Communications in Statistics-Simulation and Computation, 44(2), 474-488.</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">Ching-Fung, B. S. (2002). Ability estimation under different item parametrization and scoring models (Unpublished Doctoral thesis). North Teksas University, Teksas.</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">Can, S. (2003). The analyses of secondary education institutions student selection and placement test’s verbal section with respect to item response theory models (Unpublished Master&#039;s thesis). Middle East Technical University, Graduate School of Social Sciences, Ankara.</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">Chalmers R. P. (2013). mirt: Multidimensional Item Response Theory. R package version 0.9.0, [Çevirim içi: http://CRAN.R-project.org/package=mirt].</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">Cole, D. A. (1987). Utility of confirmatory factor analysis in test validation research. Journal of Consulting and Clinical Psychology, 55, 584-594.</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">Çelik, D. (2001). The Fit of one, two and three-parameter models of item response theory (IRT) to the ministry of National Education secondary school institutions student selection and placement test data (Unpublished Master’s thesis). Middle East Technical University, Graduate School of Social Sciences, Ankara.</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">Çetin, B. ve Çelikten, S. (2016). Nominal response model altında yetenek kestirim yöntemlerinin karşılaştırılması. International Engineering, Science and Education Conference, 01-03 December 2016, Diyarbakır.</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">DeMars, C. (2010). Item response theory. New York: Oxford University Press.</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">De Ayala, R. J. (2009). The Theory and Practice of Item Response Theory. U. S. A. Erdemir, A. (2015). Bir, iki, üç ve dört parametreli lojistik madde tepki kuramı modellerinin karşılaştırılması (Comparison of 1PL, 2PL, 3PL and 4PL item response theory models) (Unpublished Master&#039;s thesis). Gazi University, Graduate School of Educational Sciences, Ankara.</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">Finch, W. Holmes, &amp; French, Brian F. (2012). Parameter Estimation with Mixture Item Response Theory Models: A Monte Carlo Comparison of Maximum Likelihood and Bayesian Methods. Journal of Modern Applied Statistical Methods, 11(1), Article 14. DOI: 10.22237/jmasm/1335845580.</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">Gardner-Medwin, A. R., &amp; Gahan, M. (2003). Formative and summative confidence-based assessment. In J. Christie (Ed.), Proceedings of the 7th International Computer-Aided Assessment Conference (pp.147-155). Loughborough, UK: Loughborough University.</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">Hambleton, R. K., &amp; Swaminathan, H. (1985). Item Response Theory: Principles and Applications. Boston: Kluwer Nijhoff.</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">Hockemeyer, C. (2002). A comparison of non-deterministic procedures for the adaptive assessment of knowledge. Psychologische Beiträge, 44, 495-503.</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">Kılıç, İ. (1999). The fit of one- two- and three- parameter models of item response theory to the student selection test of the student selection and placement center (Unpublished Doctoral thesis). Middle East Technical University, Graduate School of Social Sciences, Ankara.</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">Kline, R. B. (2005). Principles and practices of structural equation modeling. New York: The Guildord.</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">Liao, W., Ho, R., &amp; Yen, Y. (2012). The Four-Parameter Logistic Item Response Theory Model as a Robust Method of Estimating Ability Despite Aberrant Responses. Social Behavior and Personality, 40(10), 1679-1694.</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">Loken, E., &amp; Rulison, K. L. (2010). Estimation of a four-parameter item response theory model. British Journal of Mathematical and Statistical Psychology, 63, 509-525.</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">Önder, İ. (2007). An investigation of goodness of model data fit. Hacettepe University Journal of Education, 32, 210-220.</mixed-citation>
                    </ref>
                                    <ref id="ref25">
                        <label>25</label>
                        <mixed-citation publication-type="journal">Reise, S. P., &amp; Waller, N. G. (2003). How many IRT parameters does it take to model psychopathology items? Psychological Methods 8(2), 164-184.</mixed-citation>
                    </ref>
                                    <ref id="ref26">
                        <label>26</label>
                        <mixed-citation publication-type="journal">Reynolds, T., Perkins, K., &amp; Brutten, S. 1994. A comparative item analysis study of a language testing instrument. Language Testing, 11, 1-14.</mixed-citation>
                    </ref>
                                    <ref id="ref27">
                        <label>27</label>
                        <mixed-citation publication-type="journal">Rose, N. (2010). Maximum Likelihood and Bayes Modal Ability Estimation in Two-Parametric IRT Models: Derivations and Implementation.</mixed-citation>
                    </ref>
                                    <ref id="ref28">
                        <label>28</label>
                        <mixed-citation publication-type="journal">Rulison, K., &amp; Loken, E. (2009). I’ve fallen and I can’t get up: Can high-ability students recover from early mistakes in CAT? Applied Psychological Measurement, 33, 83-101. http://doi.org/dtqjq8</mixed-citation>
                    </ref>
                                    <ref id="ref29">
                        <label>29</label>
                        <mixed-citation publication-type="journal">Samejima, E. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika Monograph, 17.</mixed-citation>
                    </ref>
                                    <ref id="ref30">
                        <label>30</label>
                        <mixed-citation publication-type="journal">Seong, T. J., Kim, S. H., &amp; Cohen, A. S. (1997). A comparison of procedures for ability estimation under the graded response model. Paper presented at the Annual Meeting of the American Educational Research Association, Chicago.</mixed-citation>
                    </ref>
                                    <ref id="ref31">
                        <label>31</label>
                        <mixed-citation publication-type="journal">Taşdelen Teker, G., Kelecioğlu, H. ve Eroğlu, M. G. (2013). An investigation of goodness of model data fit. 4. International Conference on New Horizons in Education, June, 25-27, 2013, Roma, Italia.</mixed-citation>
                    </ref>
                                    <ref id="ref32">
                        <label>32</label>
                        <mixed-citation publication-type="journal">Wainer, H., &amp; Thissen, D. (1987). Estimating ability with the wrong model. Journal of Educational Statistics, 12, 339-368.</mixed-citation>
                    </ref>
                                    <ref id="ref33">
                        <label>33</label>
                        <mixed-citation publication-type="journal">Wang, T., &amp; Vispoel, W. P. (1998). Properties of ability estimation methods in computerized adaptive testing. Journal of Education Measurement, 35, 109-135.</mixed-citation>
                    </ref>
                                    <ref id="ref34">
                        <label>34</label>
                        <mixed-citation publication-type="journal">Wang, S., &amp; Wang, T. (2001). Precision of Warm’s weighted likelihood estimates for a polytomous model in computarized adaptive testing. Applied Psychological Measurement, 25, 317-331.</mixed-citation>
                    </ref>
                                    <ref id="ref35">
                        <label>35</label>
                        <mixed-citation publication-type="journal">Warm, T. A. (1989).Weighted likelihood estimation of ability in item response theory. Psychometrika 54, 427-450.</mixed-citation>
                    </ref>
                                    <ref id="ref36">
                        <label>36</label>
                        <mixed-citation publication-type="journal">Yalçın, S. (2018). Data Fit Comparison of Mixture Item Response Theory Models and Traditional Models. International Journal of Assessment Tools in Education, 5(2), 301-313 DOI:10.21449/ijate.402806.</mixed-citation>
                    </ref>
                                    <ref id="ref37">
                        <label>37</label>
                        <mixed-citation publication-type="journal">Yapar, T. (2003). A study of the predictive validity of the Başkent a study of the predictive validity of the Başkent University English proficiency exam through the use of the two-parameter irt model’s abiliıty estimates (Unpublished Master’s thesis). Middle East Technical University, Graduate School of Social Sciences, Ankara.</mixed-citation>
                    </ref>
                                    <ref id="ref38">
                        <label>38</label>
                        <mixed-citation publication-type="journal">Yeğin, O. P. (2003). The predictive validity of Başkent University proficiency exam (buepe) through the use of the three-parameter irt model’s ability estimates (Unpublished Master’s thesis). Middle East Technical University, Graduate School of Social Sciences, Ankara.</mixed-citation>
                    </ref>
                                    <ref id="ref39">
                        <label>39</label>
                        <mixed-citation publication-type="journal">Yen, Y.-C., Ho, R.-G., Chen, L.-J., Chou, K.-Y., &amp; Chen, Y.-L. (2010). Development and evaluation of a confidence-weighting computerized adaptive testing. Educational Technology &amp; Society, 13, 163-176.</mixed-citation>
                    </ref>
                                    <ref id="ref40">
                        <label>40</label>
                        <mixed-citation publication-type="journal">Yen, Y., Ho, R., Liao, W., &amp; Chen, L. (2012). Reducing the Impact of Inappropriate Items on Reviewable Computerized Adaptive Testing. Educational Technology &amp; Society, 15, 231–243.</mixed-citation>
                    </ref>
                                    <ref id="ref41">
                        <label>41</label>
                        <mixed-citation publication-type="journal">Zwinderman, A. H., &amp; van den Wollenberg, A. L. (1990). Robustness of marginal maximum likelihood estimation in the rasch model. Applied Psychological Measurement, 14(1), 73–81.</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
