Predicting Student Success in Distance Education Utilizing Soft Matrix-Based Machine Learning via Moodle and Student Information Systems Data
Abstract
Machine learning has become an important tool for predicting student performance. This paper aims to create a dataset of the participation of some students who took Turkish Language, Atatürk’s Principles and History of Revolution, and English joint courses given via distance education at Istanbul Arel University to synchronous and asynchronous course activities for 14 weeks, and to predict the students’ success by employing fuzzy parameterized fuzzy soft k-nearest neighbor (FPFS-kNN) and the dataset. First, anonymized participation data from a 14-week lecture period is collected. Later, these data are processed to be used in machine learning. Two data sets are obtained from each raw dataset, whose class labels consist of two classes (pass/fail) and multi-class (letter grades). Then, FPFS-kNN and well-known/state-of-the-art machine learning algorithms are applied to the datasets. The performance results are compared using accuracy (Acc), precision (Pre), recall (Rec), macro F1-score (MacF1), and micro F1-score (MicF1) performance metrics. The results show that FPFS-kNN outperforms the other algorithms in binary pass–fail classification, achieving the highest accuracy with (ING1), (ATA1), and (TDE1), while maintaining competitive F1-scores (up to on TDE1). In the letter-grades datasets, performance decreased overall, with Boosted Tree reaching the best MicF1 ( on TDE2), yet FPFS-kNN still produced strong and stable results ( on TDE2, on ATA2). These findings indicate that FPFS-kNN is highly effective in binary classification and competitive in multi-class problems. Finally, a discussion of performance results and the use of machine learning in predicting student achievement is provided.Machine learning has become an important tool for predicting student performance. This paper aims to create a dataset of the participation of some students who took Turkish Language, Atatürk’s Principles and History of Revolution, and English joint courses given via distance education at Istanbul Arel University to synchronous and asynchronous course activities for 14 weeks, and to predict the students’ success by employing fuzzy parameterized fuzzy soft k-nearest neighbor (FPFS-kNN) and the dataset. First, anonymized participation data from a 14-week lecture period is collected. Later, these data are processed to be used in machine learning. Two data sets are obtained from each raw dataset, whose class labels consist of two classes (pass/fail) and multi-class (letter grades). Then, FPFS-kNN and well-known/state-of-the-art machine learning algorithms are applied to the datasets. The performance results are compared using accuracy (Acc), precision (Pre), recall (Rec), macro F1-score (MacF1), and micro F1-score (MicF1) performance metrics. The results show that FPFS-kNN outperforms the other algorithms in binary pass–fail classification, achieving the highest accuracy with (ING1), (ATA1), and (TDE1), while maintaining competitive F1-scores (up to on TDE1). In the letter-grades datasets, performance decreased overall, with Boosted Tree reaching the best MicF1 ( on TDE2), yet FPFS-kNN still produced strong and stable results ( on TDE2, on ATA2). These findings indicate that FPFS-kNN is highly effective in binary classification and competitive in multi-class problems. Finally, a discussion of performance results and the use of machine learning in predicting student achievement is provided.
Keywords
Ethical Statement
Thanks
References
- [1] İşman, A., “Distance Education”, Pegem Academy Publishing, Ankara, Türkiye, (2011).
- [2] Odabaş, H., “Internet–based distance education and departments of information and records management”, Turkish Librarianship, 17(1): 22–36, (2003).
- [3] Ergin, İ., and Akseki, B., “Student information system used in graduate education”, Journal of Research in Education and Teaching, 1(2): 364–380, (2012). DOI: https://jret.elapublishing.net/makale/6068
- [4] Gürkut, C., and Nat, M., “Important factors affecting student information system quality and satisfaction”, Eurasia Journal of Mathematics, Science and Technology Education, 14(3): 923–932, (2018). DOI: https://doi.org/10.12973/ejmste/81147
- [5] Mohri, M., Rostamizadeh, A., and Talwalkar, A., “Foundations of Machine Learning”, London: The MIT Press, (2018).
- [6] Moussawi, A., and Ibrahim, P., “Using machine learning to enhance ‘students’ assessment’ Moodle application”, Master’s Thesis, Arts, Sciences & Technology University, Beirut, (2020).
- [7] Kliksoft, “Moodle Online Education”, https://www.kliksoft.net/moodle-lms/. Access date: 04.04.2023.
- [8] Moodle, “The Moodle Story-Moodle-Online Education for Everyone”, https://moodle.com/about/the-moodle-story/. Access date: 24.04.2023.
Details
Primary Language
English
Subjects
Machine Learning (Other)
Journal Section
Research Article
Early Pub Date
April 10, 2026
Publication Date
June 1, 2026
Submission Date
December 7, 2024
Acceptance Date
February 24, 2026
Published in Issue
Year 2026 Volume: 39 Number: 2