Comparison of Clustering Methods for Mixed Data: A Case Study on Hypothetical Student Scholarship Data
Abstract
Keywords
Clustering mixed data, K-Means, K-Prototypes, Latent Class Analysis, Factor Analysis with Mixed Data
References
- Ahmad, A., & Khan, S. S. (2019). Survey of state-of-the-art mixed data clustering algorithms. IEEE Access, 7, 31883-31902. https://doi.org/10.1109/ACCESS.2019.2903568
- Bektas, A., & Schumann, R. (2019, June). How to optimize Gower distance weights for the k-medoids clustering algorithm to obtain mobility profiles of the Swiss population. In 2019 6th Swiss Conference on Data Science (SDS) (pp. 51-56). IEEE. https://doi.org/10.1109/SDS.2019.000-8.
- Costa, E., Papatsouma, I., & Markos, A. (2023). Benchmarking distance-based partitioning methods for mixed-type data. Advances in Data Analysis and Classification, 17(3), 701-724. https://doi.org/10.1007/s11634-022-00521-7
- Dutt, A., Ismail, M. A., Herawan, T., & Targio, I. A. (2024). Partition-Based Clustering Algorithms Applied to Mixed Data for Educational Data Mining: A Survey From 1971 to 2024. IEEE Access 12, 172923- 172942. https://doi.org/10.1109/ACCESS.2024.3496929
- Hadzi-Pavlovic, D. (2010). Finding patterns and groupings: II. Introduction to latent profile analysis and finite mixture models. Acta Neuropsychiatrica, 22(1), 40-42.https://doi.org/10.1111/j.1601-5215.2009.00442.x
- Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2019). Multivariate data analysis (8th ed.). Cengage Learning.
- Hunt, L., & Jorgensen, M. (2011). Clustering mixed data. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1(4), 352-361. https://doi.org/10.1002/widm.33
- Kim, B. (2017). A fast K-prototypes algorithm using partial distance computation. Symmetry, 9(4), 58-68. https://doi.org/10.3390/sym9040058
- MacQueen, J. (1967, January). Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Statistics (Vol. 5, pp. 281-298). University of California press.
- Romero, C., & Ventura, S. (2020). Educational data mining and learning analytics: An updated survey. Wiley interdisciplinary reviews: Data mining and knowledge discovery, 10(3), 1-21. https://doi.org/10.1002/widm.1355