Research Article

Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures

Volume: 11 Number: 1 January 8, 2024
TR EN

Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures

Abstract

It is a challenging task for decision makers for finding the optimal classification pattern for the dataset obtained from national accounts, such as household budget survey (HBS) data. Fuzzy c-means (FCM) clustering, a fuzzy logic-based clustering algorithm, can be used effectively to find the proper cluster structure of given data sets under uncertainty. In this study, crisp (k-means) and fuzzy (FCM) clustering performances on grouping of households are compared while changing fuzzifier parameter for FCM. The results of the study reveal that FCM clustering performs better when compared with k-means clustering. It is found out that the optimal number of household groups is 5 and further, high cluster validity index scores are obtained when fuzzifier value is 1.5 in FCM clustering. High cluster validity index scores obtained from fuzzy Silhouette is compared to the crisp cluster validity index. The experimental results proved that fuzzy clustering superior grouping ability and it has better validity measures for grouping of households in a national dataset. It is observed that smaller fuzzifier value is a better choice to enhance fitness of fuzzy clustering. It is hoped that future experiments will compare the clustering abilities of FCM using datasets with different sizes and variables under the uncertainty conditions to determine the class boundary.

Keywords

fuzzycmeans, fuzzifier parameter, kmeans, silhouette

References

  1. Askari, S. (2021). Fuzzy C-means clustering algorithm for data with unequal cluster sizes and contaminated with noise and outliers: review and development. Expert Systems with Applications, 165(113856), 1-27.
  2. Bezdek J.C. (1981). Pattern recognition with fuzzy objective algorithms. Plenum Press. New York.
  3. Bezdek, J. C., Ehrlich, R., & Full, W. (1984). FCM: the fuzzy c-means clustering algorithm. Computers & Geosciences, 10(2-3), 191-203.
  4. Bonis, T., & Oudot, S. (2018). A fuzzy clustering algorithm for the mode-seeking framework. Pattern Recognition Letters, 102, 43-73.
  5. Campello, R. J., & Hruschka, E. R. (2006). A fuzzy extension of the silhouette width criterion for cluster analysis. Fuzzy Sets Systems, 157(2), 2858-2875.
  6. Chan, K. P., & Cheung, Y. S. (1992). Clustering of clusters. Pattern Recognition, 25(2), 211-217.
  7. De Carvalho, F. D. A., Lechevallier, Y., & De Melo, F. M. (2021). Partitioning hard clustering algorithms based on multiple dissimilarity matrices. Pattern Recognition, 45(1), 447-464.
  8. Di Martino, F., & Sessa, S. (2022). A novel quantum inspired genetic algorithm to initialize cluster centers in fuzzy C-means. Expert Systems with Applications, 191(116340), 1-10.
  9. Dunn J.C. (1974). A fuzzy relative ISODATA process and its use in detecting compact well-separated clusters. Journal of Cybernetics, 3(3), 32-57.
  10. Ferreira, M. R., de Carvalho, F. D. A., & Simões, E. C. (2016). Kernel based hard clustering methods with kernelization of the metric and automatic weighting of the variables. Pattern Recognition, 51, 310-321.
APA
Çınaroğlu, S. (2024). Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures. Optimum Ekonomi Ve Yönetim Bilimleri Dergisi, 11(1), 17-31. https://doi.org/10.17541/optimum.1269918
AMA
1.Çınaroğlu S. Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures. OJEMS. 2024;11(1):17-31. doi:10.17541/optimum.1269918
Chicago
Çınaroğlu, Songül. 2024. “Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures”. Optimum Ekonomi Ve Yönetim Bilimleri Dergisi 11 (1): 17-31. https://doi.org/10.17541/optimum.1269918.
EndNote
Çınaroğlu S (January 1, 2024) Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures. Optimum Ekonomi ve Yönetim Bilimleri Dergisi 11 1 17–31.
IEEE
[1]S. Çınaroğlu, “Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures”, OJEMS, vol. 11, no. 1, pp. 17–31, Jan. 2024, doi: 10.17541/optimum.1269918.
ISNAD
Çınaroğlu, Songül. “Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures”. Optimum Ekonomi ve Yönetim Bilimleri Dergisi 11/1 (January 1, 2024): 17-31. https://doi.org/10.17541/optimum.1269918.
JAMA
1.Çınaroğlu S. Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures. OJEMS. 2024;11:17–31.
MLA
Çınaroğlu, Songül. “Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures”. Optimum Ekonomi Ve Yönetim Bilimleri Dergisi, vol. 11, no. 1, Jan. 2024, pp. 17-31, doi:10.17541/optimum.1269918.
Vancouver
1.Songül Çınaroğlu. Comparison of Hard and Fuzzy Clustering Techniques and Selection of Optimal Fuzzifier Parameter: An Application on Household Characteristics and Health Expenditures. OJEMS. 2024 Jan. 1;11(1):17-31. doi:10.17541/optimum.1269918