<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>jmeep</journal-id>
            <journal-title-group>
                                                                                    <journal-title>Journal of Measurement and Evaluation in Education and Psychology</journal-title>
            </journal-title-group>
                                        <issn pub-type="epub">1309-6575</issn>
                                                                                            <publisher>
                    <publisher-name>Association for Measurement and Evaluation in Education and Psychology</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id/>
                                                                <article-categories>
                                            <subj-group  xml:lang="en">
                                                            <subject>Statistical Analysis Methods</subject>
                                                            <subject>Modelling</subject>
                                                    </subj-group>
                                            <subj-group  xml:lang="tr">
                                                            <subject>İstatistiksel Analiz Teknikleri</subject>
                                                            <subject>Modelleme</subject>
                                                    </subj-group>
                                    </article-categories>
                                                                                                                                                        <title-group>
                                                                                                                                                            <article-title>An Empirical Demonstration of Selecting Predictors for Multilevel Models</article-title>
                                                                                                    </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0001-9796-6749</contrib-id>
                                                                <name>
                                    <surname>Mumba</surname>
                                    <given-names>Brian</given-names>
                                </name>
                                                                    <aff>EGE UNIVERSITY</aff>
                                                            </contrib>
                                                    <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0003-4462-1784</contrib-id>
                                                                <name>
                                    <surname>Aydın</surname>
                                    <given-names>Burak</given-names>
                                </name>
                                                                    <aff>EGE UNIVERSITY</aff>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20260401">
                    <day>04</day>
                    <month>01</month>
                    <year>2026</year>
                </pub-date>
                                        <volume>17</volume>
                                        <issue>1</issue>
                                        <fpage>42</fpage>
                                        <lpage>62</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20251030">
                        <day>10</day>
                        <month>30</month>
                        <year>2025</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20260330">
                        <day>03</day>
                        <month>30</month>
                        <year>2026</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 2010, Journal of Measurement and Evaluation in Education and Psychology</copyright-statement>
                    <copyright-year>2010</copyright-year>
                    <copyright-holder>Journal of Measurement and Evaluation in Education and Psychology</copyright-holder>
                </permissions>
            
                                                                                                                        <abstract><p>This paper presents a comparative demonstration of variable selection based on the Akaike Information Criterion (AIC), the Bayesian Information Criterion (BIC), and deviance for identifying the best-fitting two-level model using Early Grade Mathematics Assessment (EGMA) data collected in Zambia. A model that included all available predictor variables as fixed effects with random intercepts was run in R using the lme4 package, followed by an input model for comparison using a custom R function introduced by Nimon (2018). The analysis generated 108 models: 99 valid and nine invalid. The study determined a final model as the best fitting based on the principle of parsimony. The final model revealed that fixed effects of students&#039; group-mean-centered reading ability, home reading status, gender, school reading average score, and number of pupils, and the random effect of group-mean-centered reading ability predicted the early-grade mathematics ability. The results from the retained model, compared with the null model, showed substantial improvements in model fit indices with a pseudo-R² value of 0.309. Overall, this study provides an empirical demonstration of selecting predictors among numerous variables for multilevel models, a crucial practical issue that is common in educational research due to the increasing availability of large datasets.</p></abstract>
                                                            
            
                                                                                        <kwd-group>
                                                    <kwd>predictor selection strategies</kwd>
                                                    <kwd>  multilevel modelling</kwd>
                                                    <kwd>  AIC</kwd>
                                                    <kwd>  BIC</kwd>
                                                    <kwd>  EGMA</kwd>
                                            </kwd-group>
                            
                                                                                                                                                    </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">Aho, K., Derryberry, D., &amp; Peterson, T. (2014). Model selection for ecologists: The worldviews of AIC and BIC. Ecology, 95(3), 631–636. https://doi.org/10.1890/13-1452.1</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716–723. https://doi.org/10.1109/TAC.1974.1100705</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">Aydın, B. (2016). Çok düzeyli modeller: Sürekli değişken ile iki düzeyli model örneği ve R programı ile analizi [Multilevel models: Example with continuous variable and two-level model analyzed in R program]. Ege Eğitim Dergisi, 17(2), 567–596. https://doi.org/10.12984/egeefd.280758</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">Baskin, A. W. (2023). Campus-level teacher turnover in Texas public elementary schools: An examination of the impact of leadership factors and school demographics using hierarchical linear modeling [Doctoral dissertation, University of Texas]. University of Texas Repository.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">Bates, D. (2005). Fitting linear mixed models in R. R News, 5(1), 27–30.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">Bates, D., Mächler, M., Bolker, B., &amp; Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48. https://doi.org/10.18637/jss.v067.i01</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">Blatchford, P., Bassett, P., &amp; Brown, P. (2011). Examining the effect of class size on classroom engagement and teacher–pupil interaction: Differences in relation to pupil prior attainment and primary vs. secondary schools. Learning and Instruction, 21(6), 715–730. https://doi.org/10.1016/j.learninstruc.2011.04.001</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">Burnham, K. P., &amp; Anderson, D. R. (2004). Multimodel inference: Understanding AIC and BIC in model selection. Sociological Methods &amp; Research, 33(2), 261–304. https://doi.org/10.1177/0049124104268644</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">Buscemi, S., &amp; Plaia, A. (2020). Model selection in linear mixed-effect models. AStA Advances in statistical Analysis, 104(4), 529–575. https://doi.org/10.1007/s10182-019-00359-z</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">Diego, V. P., Manusov, E. G., Mao, X., Curran, J. E., Göring, H., Almeida, M., Mahaney, M. C., Peralta, J. M., Blangero, J., &amp; Williams-Blangero, S. (2023). Genotype-by-socioeconomic status interaction influences heart disease risk scores and carotid artery thickness in Mexican Americans: The predominant role of education in comparison to household income and socioeconomic index. Frontiers in Genetics, 14, 1132110. https://doi.org/10.3389/fgene.2023.1132110</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">Fokkema, M., Smits, N., Zeileis, A., Hothorn, T., &amp; Kelderman, H. (2017). Detecting treatment-subgroup interactions in clustered data with generalized linear mixed-effects model trees. Behavior Research Methods, 50, 2016–2034. https://doi.org/10.3758/s13428-017-0971x</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">Gelman, A., &amp; Hill, J. (2007). Data analysis using regression and multilevel/hierarchical models. Cambridge University Press</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">Gelman, A., Vehtari, A., Simpson, D., Margossian, C. C., Carpenter, B., Yao, Y., ... &amp; Modrák, M. (2020). Bayesian workflow. arXiv preprint arXiv:2011.01808. https://arxiv.org/abs/2011.01808</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">Goldstein, H. (2011). Multilevel statistical models (4th ed.). John Wiley &amp; Sons.</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">Gove, A., Habib, S., Piper, B., &amp; Ralaingita, W. (2013). Classroom-up policy change: Early reading and math assessments at work. Research in Comparative and International Education, 8(3), 373–386. https://doi.org/10.2304/rcie.2013.8.3.373</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">Hamaker, E. L. (2023). The within-between dispute in cross-lagged panel research and how to move forward. Psychological Methods, 28(1), 1–15. https://doi.org/10.1037/met0000473</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">Hamaker, E. L., Kuiper, R. M., &amp; Grasman, R. P. (2015). A critique of the cross-lagged panel model. Psychological Methods, 20(1), 102–116. https://doi.org/10.1037/a0038889</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">Harrison, X. A., Donaldson, L., Correa-Cano, M. E., Evans, J., Fisher, D. N., Goodwin, C. E., Robinson, B. S., Hodgson, D. J., &amp; Inger, R. (2018). A brief introduction to mixed effects modelling and multimodel inference in ecology. PeerJ, 6, e4794. https://doi.org/10.7717/peerj.4794</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">Johnson, J. B., &amp; Omland, K. S. (2004). Model selection in ecology and evolution. Trends in Ecology &amp; Evolution, 19(2), 101–108. https://doi.org/10.1016/j.tree.2003.10.013</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">Kuha, J. (2004). AIC and BIC: Comparisons of assumptions and performance. Sociological Methods &amp; Research, 33(2), 188–229. https://doi.org/10.1177/0049124103262065</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">Kuznetsova, A., Brockhoff, P. B., &amp; Christensen, R. H. B. (2017). lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software, 82(13), 1–26. https://doi.org/10.18637/jss.v082.i13</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">Lorah, J., &amp; Womack, A. (2019). Value of sample size for computation of the Bayesian information criterion (BIC) in multilevel modeling. Behavior Research Methods, 51(1), 440–450. https://doi.org/10.3758/s13428-018-1188-3</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">Luke, D. A. (2020). Multilevel Modeling (2nd ed.). Sage Publications.</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">McNeish, D. (2017). Small sample methods for multilevel modeling: A colloquial elucidation of REML and the Kenward-Roger correction. Multivariate Behavioral Research, 52(5), 661–670. https://doi.org/10.1080/00273171.2017.1344538</mixed-citation>
                    </ref>
                                    <ref id="ref25">
                        <label>25</label>
                        <mixed-citation publication-type="journal">McNeish, D., &amp; Stapleton, L. M. (2016). The effect of small sample size on two-level model estimates: A review and illustration. Educational Psychology Review, 28(2), 295–314. https://doi.org/10.1007/s10648-014-9287-x</mixed-citation>
                    </ref>
                                    <ref id="ref26">
                        <label>26</label>
                        <mixed-citation publication-type="journal">Nimon, K. (2018). apsl2lme: A model-selection diagnostic tool for hierarchical linear models. General Linear Model Journal, 44(1), 1–9. https://doi.org/10.31523/glmj.044002.002</mixed-citation>
                    </ref>
                                    <ref id="ref27">
                        <label>27</label>
                        <mixed-citation publication-type="journal">Ortega, A., &amp; Navarrete, G. (2017). Bayesian hypothesis testing: An alternative to null hypothesis significance testing (NHST) in psychology and social sciences. Frontiers in Psychology, 8, 608. https://doi.org/10.3389/fpsyg.2017.00608</mixed-citation>
                    </ref>
                                    <ref id="ref28">
                        <label>28</label>
                        <mixed-citation publication-type="journal">Pinheiro, J., Bates, D., DebRoy, S., Sarkar, D., &amp; R Core Team. (2023). nlme: Linear and nonlinear mixed effects models (R package version 3.1-163) [Computer software]. https://CRAN.R-project.org/package=nlme</mixed-citation>
                    </ref>
                                    <ref id="ref29">
                        <label>29</label>
                        <mixed-citation publication-type="journal">Platas, L. M., Ketterlin-Geller, L. R., &amp; Sitabkhan, Y. (2016). Using an assessment of early mathematical knowledge and skills to inform policy and practice: Examples from the Early Grade Mathematics Assessment. International Journal of Education in Mathematics, Science and Technology, 4(3), 163–173. https://doi.org/10.18404/ijemst.15920</mixed-citation>
                    </ref>
                                    <ref id="ref30">
                        <label>30</label>
                        <mixed-citation publication-type="journal">Preacher, K. J., &amp; Yaremych, H. E. (2023). Model selection in structural equation modeling. In R. H. Hoyle (Ed.), Handbook of structural equation modeling (2nd ed., pp. 206–222). Guilford Press.</mixed-citation>
                    </ref>
                                    <ref id="ref31">
                        <label>31</label>
                        <mixed-citation publication-type="journal">Raudenbush, S. W., &amp; Bryk, A. S. (2002). Hierarchical linear models: Applications and data analysis methods (2nd ed.). Sage</mixed-citation>
                    </ref>
                                    <ref id="ref32">
                        <label>32</label>
                        <mixed-citation publication-type="journal">R Core Team. (2023). R: A language and environment for statistical computing [Computer software]. R Foundation for Statistical Computing. https://www.R-project.org/</mixed-citation>
                    </ref>
                                    <ref id="ref33">
                        <label>33</label>
                        <mixed-citation publication-type="journal">RTI International. (2014). Early grade mathematics assessment (EGMA) toolkit (Rev. ed.). Research Triangle Institute. https://pdf.usaid.gov/pdf_docs/PA00JP65.pdf</mixed-citation>
                    </ref>
                                    <ref id="ref34">
                        <label>34</label>
                        <mixed-citation publication-type="journal">Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6(2), 461–464. https://doi.org/10.1214/aos/1176344136</mixed-citation>
                    </ref>
                                    <ref id="ref35">
                        <label>35</label>
                        <mixed-citation publication-type="journal">Silhavy, R., Silhavy, P., &amp; Prokopova, Z. (2017). Analysis and selection of a regression model for the use case points method using a stepwise approach. Journal of Systems and Software, 125, 1–14. https://doi.org/10.1016/j.jss.2016.11.037</mixed-citation>
                    </ref>
                                    <ref id="ref36">
                        <label>36</label>
                        <mixed-citation publication-type="journal">Snijders, T. A. B., &amp; Bosker, R. J. (2012). Multilevel analysis: An introduction to basic and advanced multilevel modeling (2nd ed.). Sage.</mixed-citation>
                    </ref>
                                    <ref id="ref37">
                        <label>37</label>
                        <mixed-citation publication-type="journal">Stylianou, C., Pickles, A., &amp; Roberts, S. A. (2013). Using Bonferroni, BIC, and AIC to assess evidence for alternative biological pathways: Covariate selection for the multilevel Embryo-Uterus model. BMC Medical Research Methodology, 13(1), 73. https://doi.org/10.1186/1471-2288-13-73</mixed-citation>
                    </ref>
                                    <ref id="ref38">
                        <label>38</label>
                        <mixed-citation publication-type="journal">Sweller, J., Zhang, L., Ashman, G., Cobern, W., &amp; Kirschner, P. A. (2024). Response to De Jong et al.&#039;s (2023) paper &quot;Let us talk evidence—The case for combining inquiry-based and direct instruction.&quot; Educational Research Review, 42, 100584. https://doi.org/10.1016/j.edurev.2024.100584</mixed-citation>
                    </ref>
                                    <ref id="ref39">
                        <label>39</label>
                        <mixed-citation publication-type="journal">Vallejo, G., Tuero-Herrero, E., Núñez, J. C., &amp; Rosário, P. (2014). Performance evaluation of recent information criteria for selecting multilevel models in behavioral and social sciences. International Journal of Clinical and Health Psychology, 14(1), 48–57. https://doi.org/10.1016/s1697-2600(14)70036-5</mixed-citation>
                    </ref>
                                    <ref id="ref40">
                        <label>40</label>
                        <mixed-citation publication-type="journal">Vrieze, S. I. (2012). Model selection and psychological theory: A discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC). Psychological Methods, 17(2), 228–243. https://doi.org/10.1037/a0027127</mixed-citation>
                    </ref>
                                    <ref id="ref41">
                        <label>41</label>
                        <mixed-citation publication-type="journal">West, B. T., Welch, K. B., &amp; Galecki, A. T. (2022). Linear mixed models: A practical guide using statistical software (2nd ed.). Chapman and Hall/CRC.</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
