Research Article

Educational data mining: A tutorial for the rattle package in R

Volume: 6 Number: 5 December 30, 2019
EN TR

Educational data mining: A tutorial for the rattle package in R

Abstract

Educational data mining (EDM) has been a rapidly growing research field over the last decade and enabled researchers to discover patterns and trends in education with more sophisticated methods. EDM offers promising solutions to complex educational problems. Given the rapid increase in the availability of big data in education and software programs to analyze big data, the demand for user-friendly, free software programs to implement EDM methods also continues to increase. The R programming language has become a popular environment for data mining due to its availability and flexibility. The rattle package in R contains a set of functions to implement data mining with a graphical user interface. This study demonstrates three widely used data mining algorithms (classification and regression tree, random forest, and support vector machine) in EDM using real data from the 2015 administration of the Programme for International Student Assessment (PISA). First, a brief introduction to EDM is provided along with the description of the selected data mining algorithms. Then, how to perform data mining analysis using the rattle’s graphical user interface is demonstrated. The study concludes by comparing the results of the selected data mining algorithms and highlighting how those algorithms can be utilized in the context of educational research.

Keywords

References

  1. Agarwal, S., Pandey, G. N., & Tiwari, M. D. (2012). Data mining in education: Data classification and decision tree approach. International Journal of e-Education, e-Business, e-Management and e-Learning, 2(2), 140.
  2. Aldowah, H., Al-Samarraie, H., & Fauzy, W. M. (2019). Educational Data Mining and Learning Analytics for 21st century higher education: A Review and Synthesis. Telematics and Informatics, 37, 13-49.
  3. Aulck, L., Velagapudi, N., Blumenstock, J., & West, J. (2016). Predicting student dropout in higher education. arXiv preprint arXiv:1606.06364.
  4. Baker, R. S., Martin, T., & Rossi, L. M. (2017). Educational data mining and learning analytics. In A. A. Rupp & J. P. Leighton (Eds.), The handbook of cognition and assessment: Frameworks, methodologies, and applications (pp. 379-396). Oxford, UK: John Wiley & Sons, Inc.
  5. Berland, M., Baker, R. S., & Blikstein, P. (2014). Educational data mining and learning analytics: Applications to constructionist research. Technology, Knowledge and Learning, 19(1-2), 205-220.
  6. Breiman, L. (2001). Random forest. Machine Learning, 45(1), 5–32.
  7. Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees. Belmont, CA: Wadsworth.
  8. Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine learning, 20(3), 273-297.

Details

Primary Language

English

Subjects

Studies on Education

Journal Section

Research Article

Publication Date

December 30, 2019

Submission Date

October 1, 2019

Acceptance Date

December 5, 2019

Published in Issue

Year 2019 Volume: 6 Number: 5

APA
Bulut, O., & Yavuz, H. C. (2019). Educational data mining: A tutorial for the rattle package in R. International Journal of Assessment Tools in Education, 6(5), 20-36. https://doi.org/10.21449/ijate.627361

Cited By

23823             23825             23824