Research Article

Educational Data Mining: Construction of a Tree-based Model to Predict Students’ Performance

Volume: 14 Number: 1 January 29, 2025
EN TR

Educational Data Mining: Construction of a Tree-based Model to Predict Students’ Performance

Abstract

Educational data mining is a research field that probes undercover patterns in educational data. In this paper, machine learning algorithms have been applied to the dataset that consists of major features so as to predict students’ final grade performances. Thus, the most significant features and the highest-performance machine learning algorithm have been also tried to be detected. To this end, univariate feature selection, tree-based feature selection, and L1-based feature selection methods have been used for the feature selection process. Classification and regression trees, k-nearest neighbors, naive Bayes, random forest, and support vector machines have been employed to build the learning models. The L1-based feature selection and classification and regression trees have delivered the best performance for the feature selection and the model creation processes, respectively. The experimental results demonstrate that the proposed model reached a classification accuracy of 0.7700 and an F1-score of 0.7888 on average. The L1-based feature selection method has selected only 4 features: these are scholarship type, total salary, transportation to the university, and cumulative grade point average in the last semester. In consequence, there exist lots of indicators that impact students' academic successes, the success or failure that emerges after the measurement process can be estimated by regarding these features in advance. Such a task will enable the relationship mechanism between the educational inputs and outputs to be understandable and eliminate shortcomings concerning the education process.

Keywords

References

  1. Acar, E. (2022). Comparison of the Performances of OECD Countries in the Perspective of Socio-Economic Global Indices: CRITIC-Based Cocoso Method. Dumlupınar Üniversitesi Sosyal Bilimler Dergisi, 73, 256–277. https://doi.org/10.51290/dpusbe.1122650
  2. Akdamar, E., & Kızılkaya, Y. M. (2022). Üniversite Öğrencilerinin Akademik Erteleme Eğilimleri ile Umutsuzluk Seviyeleri ve Akademik Başarıları Arasındaki İlişkinin İncelenmesi. Kahramanmaraş Sütçü İmam Üniversitesi Sosyal Bilimler Dergisi, 19(1), 212–221. https://doi.org/10.33437/ksusbd.844605
  3. Aslanargun, E., Bozkurt, S., & Sarıoğlu, S. (2016). Sosyo Ekonomik Değişkenlerin Öğrencilerin Akademik Başarısı Üzerine Etkileri. Uşak Üniversitesi Sosyal Bilimler Dergisi, 9(27/3), 201–234.
  4. Aziz, Y., & Memon, K. H. (2023). Fast geometrical extraction of nearest neighbors from multi-dimensional data. Pattern Recognition, 136, 109183. https://doi.org/10.1016/j.patcog.2022.109183
  5. Baker, Ryan S. (2014). Educational Data Mining: An Advance for Intelligent Systems in Education. IEEE Intelligent Systems, 29(3), 78–82. https://doi.org/10.1109/MIS.2014.42
  6. Baker, Ryan Shaun, & Inventado, P. S. (2014). Educational Data Mining and Learning Analytics. In Learning Analytics (pp. 61–75). Springer New York. https://doi.org/10.1007/978-1-4614-3305-7_4
  7. Baudat, G., & Anouar, F. (2000). Generalized Discriminant Analysis Using a Kernel Approach. Neural Computation, 12(10), 2385–2404. https://doi.org/10.1162/089976600300014980
  8. Bermingham, M. L., Pong-Wong, R., Spiliopoulou, A., Hayward, C., Rudan, I., Campbell, H., Wright, A. F., Wilson, J. F., Agakov, F., Navarro, P., & Haley, C. S. (2015). Application of high-dimensional feature selection: evaluation for genomic prediction in man. Scientific Reports, 5(1), 10312. https://doi.org/10.1038/srep10312

Details

Primary Language

English

Subjects

Higher Education Studies (Other)

Journal Section

Research Article

Publication Date

January 29, 2025

Submission Date

November 13, 2023

Acceptance Date

October 13, 2024

Published in Issue

Year 2025 Volume: 14 Number: 1

APA
Aydın, F. (2025). Educational Data Mining: Construction of a Tree-based Model to Predict Students’ Performance. Bartın University Journal of Faculty of Education, 14(1), 181-195. https://doi.org/10.14686/buefad.1390209
AMA
1.Aydın F. Educational Data Mining: Construction of a Tree-based Model to Predict Students’ Performance. BUEFAD. 2025;14(1):181-195. doi:10.14686/buefad.1390209
Chicago
Aydın, Furkan. 2025. “Educational Data Mining: Construction of a Tree-Based Model to Predict Students’ Performance”. Bartın University Journal of Faculty of Education 14 (1): 181-95. https://doi.org/10.14686/buefad.1390209.
EndNote
Aydın F (January 1, 2025) Educational Data Mining: Construction of a Tree-based Model to Predict Students’ Performance. Bartın University Journal of Faculty of Education 14 1 181–195.
IEEE
[1]F. Aydın, “Educational Data Mining: Construction of a Tree-based Model to Predict Students’ Performance”, BUEFAD, vol. 14, no. 1, pp. 181–195, Jan. 2025, doi: 10.14686/buefad.1390209.
ISNAD
Aydın, Furkan. “Educational Data Mining: Construction of a Tree-Based Model to Predict Students’ Performance”. Bartın University Journal of Faculty of Education 14/1 (January 1, 2025): 181-195. https://doi.org/10.14686/buefad.1390209.
JAMA
1.Aydın F. Educational Data Mining: Construction of a Tree-based Model to Predict Students’ Performance. BUEFAD. 2025;14:181–195.
MLA
Aydın, Furkan. “Educational Data Mining: Construction of a Tree-Based Model to Predict Students’ Performance”. Bartın University Journal of Faculty of Education, vol. 14, no. 1, Jan. 2025, pp. 181-95, doi:10.14686/buefad.1390209.
Vancouver
1.Furkan Aydın. Educational Data Mining: Construction of a Tree-based Model to Predict Students’ Performance. BUEFAD. 2025 Jan. 1;14(1):181-95. doi:10.14686/buefad.1390209

All the articles published in the journal are open access and distributed under the conditions of CommonsAttribution-NonCommercial 4.0 International License 

88x31.png


Bartın University Journal of Faculty of Education