Predicting Student Academic Performance Using Machine Learning: A Case Study On Educational Data From Bangladesh Towards Sustainable Education (SDG 4)
Öz
Educational sustainability is a global priority. Because of the rapid digital transformation that we are seeing, it is necessary to integrate modern technologies to achieve inclusive, equitable, and high-quality education. Despite this, there is still a limitation in applying Machine Learning (ML), as one of the modern technologies, to educational data within institutions. This research addresses this gap by providing practical evidence that predicting academic performance via ML is both achievable and highly effective. Two main objectives were generated from the main goal. The first one is to compare four ML models, while the second is to determine the most important factors affecting academic performance. To achieve the previous goals, the study used data from more than 8,000 secondary school students from Bangladesh. Based on the given dataset, students’ academic performance was represented by the average grades of five common courses, where the study tried to predict it without relying on the existence of previous grades. The study depends on the contextual, behavioral, and other variables that are available in that dataset. Three of the models used were shallow: Multiple Linear Regression (MLR), Decision Tree (DT), Random Forest (RF), while one was deep: Feedforward Deep Neural Network (DNN). The whole process was implemented in the R environment, and the results revealed some interesting points. The shallow RF model was more accurate than the deep one with a small margin (R² = 93.3 vs. R² = 93.0). The most important factor affecting student performance is the student group (arts, commerce, or science), which dominated the predictive power of the models. Study time and attendance can also not be ignored as important behavioral modifiable factors. In addition to the results presented, this study contributes to educational sustainability by developing two Early Warning Systems (EWS): a simplified system for quick group screening and a full EWS for individual predictions. The results and the contributions of this study will help to find the most at-risk subgroups, particularly in the arts track, and allocate resources to provide proactive support based on modifiable behaviors or other factors. Ultimately, this research provides a scalable framework to enhance equity and efficiency in education. It is aligned with UN Sustainable Development Goal 4 (SDG 4): Quality Education.
Anahtar Kelimeler
Destekleyen Kurum
Etik Beyan
Teşekkür
Kaynakça
- Lewis N D. Machine learning made easy with R: An intuitive step-by-step blueprint for beginners. 1st ed. AusCov; 2017
- Huang, J., Zhong, Y., Chen, X., 2025. Adaptive and personalized learning in STEM education using high-performance computing and artificial intelligence. Journal of Supercomputing, 81(1), 981–1004. https://doi.org/10.1007/s11227-025-07481-7
- Romero, C., Ventura, S., 2020. Educational data mining and learning analytics: An updated survey. WIREs Data Mining and Knowledge Discovery, 10(3), e1355. https://doi.org/10.1002/widm.1355
- Ersozlu, Z., Taheri, S., Koch, I., 2024. A review of machine learning methods used for educational data. Education and Information Technologies, 29, 22125–22145. https://doi.org/10.1007/s10639-024-12704-0
- Albreiki, B., Zaki, N., Alashwal, H., 2021. A systematic literature review of students’ performance prediction using machine learning techniques. Education Sciences, 11(9), 552. https://doi.org/10.3390/educsci11090552
- Ouhaddou, C., Retbi, A., Bennani, S., 2025. Predicting student academic path using machine learning: Systematic review. 2025 5th International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET), IEEE, 1–8. https://doi.org/10.1109/IRASET64571.2025.11008352
- Cortez, P., Silva, A., 2008. Using data mining to predict secondary school student performance. Proceedings of the 5th Future Business Technology Conference (FUBUTEC), Porto, Portugal, 5–12
- Gull, H., Saqib, M., Iqbal, S. Z., Saeed, S., 2020. Improving learning experience of students by early prediction of student performance using machine learning. 2020 IEEE International Conference for Innovation in Technology (INOCON), IEEE, 1–4. https://doi.org/10.1109/INOCON50539.2020.9298266
Ayrıntılar
Birincil Dil
İngilizce
Konular
Yazılım Mühendisliği (Diğer), İstatistik (Diğer)
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
26 Mart 2026
Gönderilme Tarihi
3 Şubat 2026
Kabul Tarihi
17 Mart 2026
Yayımlandığı Sayı
Yıl 2026 Cilt: 4 Sayı: 1