Research Article

From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy

Volume: 17 Number: 2 July 2, 2026
TR EN

From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy

Abstract

This study aimed to (1) examine the performance and variable importance rankings of bagging, random forests, and gradient boosting algorithms in predicting reading comprehension skills in the PIRLS 2021 cycle, and (2) investigate the causal relationships of influential predictor variables identified by these algorithms using Structural Equation Modeling (SEM). Data from Finland, Portugal, and Türkiye, representing different reading achievement levels, were analyzed. Results showed that all algorithms exhibited similar and moderate predictive performance across countries. Variable importance rankings were consistent across algorithms within each country, and the key predictors were largely similar across countries. In Finland, the most influential predictors were students’ reading confidence, early literacy task performance before primary school, and home learning resources. In Portugal, reading confidence and home resources were most influential, whereas in Türkiye, home resources and reading confidence were dominant. Partial dependence analyses indicated that these variables had positive effects across all countries. Notably, reading confidence had a dominant effect in Finland and Portugal, which have higher reading achievement, while home resources were particularly influential in Türkiye, with comparatively lower reading achievement. SEM results supported these findings. The SEM models showed acceptable fit for Türkiye and Portugal, and good fit for Finland. According to the models, contextual and familial factors were more prominent in Türkiye, individual factors were influential in Portugal, and early literacy tasks had a noteworthy impact in Finland. These findings highlight the value of combining machine learning algorithms and SEM to investigate reading literacy. The integration of predictive modeling and causal analysis can provide a comprehensive understanding of reading literacy performance and its underlying factors across different educational contexts.

Keywords

Ethical Statement

Since secondary data were used in this study, ethical committee approval was not required.

References

  1. Austin, P. C., Lee, D. S., Steyerberg, E. W., & Tu, J. V. (2012). Regression trees for predicting mortality in patients with cardiovascular disease: What improvement is achieved by using ensemble‐based methods?. Biometrical journal, 54(5), 657-673. https://doi.org/10.1002/bimj.201100251
  2. Bandura, A. (1997). Self-efficacy: The exercise of control. New York: Freeman.
  3. Breiman, L. (1996). Bagging predictors. Machine Learning, 24, 123–140. https://doi.org/10.1007/BF00058655
  4. Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32.https://doi.org/10.1023/A:1010933404324https://doi.org/10.1023/A:1010933404324
  5. Bronfenbrenner, U. (1979). The ecology of human development: Experiments by nature and design. Harvard University Press.
  6. Cho, E., Kim, E. H., Ju, U., & Lee, G. A. (2021). Motivational predictors of reading comprehension in middle school: Role of self-efficacy and growth mindsets. Reading and Writing: An Interdisciplinary Journal, 34(9), 2337–2355. https://doi.org/10.1007/s11145-021-10146-5
  7. Claes, R., Laga, J., Denies, K., Bleukx, N., Dockx, J., Van Keer, H., & Aesaert, K. (2024). The interplay between students’ home literacy environment, reading attitudes and comprehension: a serial mediation analysis using PIRLS 2021-data. Large-scale Assessments in Education, 12(1). https://doi.org/10.1186/s40536-024-00233-8
  8. Duncan, G. J., & Magnuson, K. (2013). Investing in preschool programs. Journal of economic perspectives, 27(2), 109-132. https://dio.org/10.1257/jep.27.2.109

Details

Primary Language

English

Subjects

Testing, Assessment and Psychometrics (Other)

Journal Section

Research Article

Publication Date

July 2, 2026

Submission Date

March 3, 2026

Acceptance Date

June 1, 2026

Published in Issue

Year 2026 Volume: 17 Number: 2

APA
Aydın, F. N., & Atalay Kabasakal, K. (2026). From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy. Journal of Measurement and Evaluation in Education and Psychology, 17(2), 98-114. https://doi.org/10.21031/epod.1901793
AMA
1.Aydın FN, Atalay Kabasakal K. From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy. JMEEP. 2026;17(2):98-114. doi:10.21031/epod.1901793
Chicago
Aydın, Fatma Nur, and Kübra Atalay Kabasakal. 2026. “From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy”. Journal of Measurement and Evaluation in Education and Psychology 17 (2): 98-114. https://doi.org/10.21031/epod.1901793.
EndNote
Aydın FN, Atalay Kabasakal K (July 1, 2026) From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy. Journal of Measurement and Evaluation in Education and Psychology 17 2 98–114.
IEEE
[1]F. N. Aydın and K. Atalay Kabasakal, “From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy”, JMEEP, vol. 17, no. 2, pp. 98–114, July 2026, doi: 10.21031/epod.1901793.
ISNAD
Aydın, Fatma Nur - Atalay Kabasakal, Kübra. “From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy”. Journal of Measurement and Evaluation in Education and Psychology 17/2 (July 1, 2026): 98-114. https://doi.org/10.21031/epod.1901793.
JAMA
1.Aydın FN, Atalay Kabasakal K. From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy. JMEEP. 2026;17:98–114.
MLA
Aydın, Fatma Nur, and Kübra Atalay Kabasakal. “From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy”. Journal of Measurement and Evaluation in Education and Psychology, vol. 17, no. 2, July 2026, pp. 98-114, doi:10.21031/epod.1901793.
Vancouver
1.Fatma Nur Aydın, Kübra Atalay Kabasakal. From Prediction to Explanation: Integrating Ensemble Machine Learning and Structural Equation Modeling in Reading Literacy. JMEEP. 2026 Jul. 1;17(2):98-114. doi:10.21031/epod.1901793