Araştırma Makalesi

A New Method for Determining the Number of Clusters Without Clustering

Cilt: 30 Sayı: 2 31 Ağustos 2025
PDF İndir
TR EN

A New Method for Determining the Number of Clusters Without Clustering

Öz

Clustering methods are essential for identifying patterns in data, and the number of clusters significantly impacts the quality of results. Determining the optimal number of clusters is challenging, particularly for large datasets, as traditional methods can be computationally expensive. Developing efficient techniques to determine the number of clusters is crucial for improving both the accuracy and scalability of clustering, especially in large-scale applications. In this study, a new approach for determining the number of clusters is presented. The proposed method aims to find the number of clusters based solely on the distances between data points, without performing clustering. Similar to the Elbow method, the elbow point is found for the distances between data points, and the number of clusters is determined using this elbow point. The proposed algorithm was compared with the Elbow method using 11 real-world datasets and 4 performance metrics. The results demonstrate that the proposed method is particularly advantageous in terms of time complexity, especially as the dataset size increases.

Anahtar Kelimeler

Clustering, Comparative analysis, Cluster validity index

Kaynakça

  1. Akogul, S., & Erisoglu, M. (2017). An approach for determining the number of clusters in a model-based cluster analysis. Entropy, 19(9), 452. https://doi.org/10.3390/e19090452
  2. Arthur, D., & Vassilvitskii, S. (2007). k-means++: The advantages of careful seeding. Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, in New Orleans, Louisiana.
  3. Calinski, T., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics - Theory and Methods, 3(1), 1-27. https://doi.org/10.1080/03610927408827101
  4. Davies, D. L., & Bouldin, D. W. (1979). A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1(2), 224-227. https://doi.org/10.1109/TPAMI.1979.4766909
  5. Doan, H., & Nguyen, D. (2018). A method for finding the appropriate number of clusters. Int. Arab J. Inf. Technol., 15(4), 675-682.
  6. Dua, D., & Graff, C. (2019). UCI machine learning repository. Access date: 10.12.2024 https://archive.ics.uci.edu/datasets
  7. Halkidi, M., & Vazirgiannis, M. (2001, November). Clustering validity assessment: Finding the optimal partitioning of a data set. In Proceedings 2001 IEEE international conference on data mining (pp. 187-194). IEEE. https://doi.org/10.1109/ICDM.2001.989517
  8. Hartigan, J. A., & Wong, M. A. (1979). Algorithm AS 136: A k-means clustering algorithm. Journal of the Royal Statistical Society: Series C (Applied Statistics), 28(1), 100–108. https://doi.org/10.2307/2346830
  9. Jain, A. K. (2010). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31(8), 651-666. https://doi.org/10.1016/j.patrec.2009.09.011
  10. Jung, Y., Park, H., Du, D. Z., & Drake, B. L. (2003). A decision criterion for the optimal number of clusters in hierarchical clustering. Journal of Global Optimization, 25(1), 91-111. https://doi.org/10.1023/A:1021394316112

Kaynak Göster

APA
Turan, D. S. (2025). A New Method for Determining the Number of Clusters Without Clustering. Yüzüncü Yıl Üniversitesi Fen Bilimleri Enstitüsü Dergisi, 30(2), 596-607. https://doi.org/10.53433/yyufbed.1612608
AMA
1.Turan DS. A New Method for Determining the Number of Clusters Without Clustering. YYUFBED. 2025;30(2):596-607. doi:10.53433/yyufbed.1612608
Chicago
Turan, Duygu Selin. 2025. “A New Method for Determining the Number of Clusters Without Clustering”. Yüzüncü Yıl Üniversitesi Fen Bilimleri Enstitüsü Dergisi 30 (2): 596-607. https://doi.org/10.53433/yyufbed.1612608.
EndNote
Turan DS (01 Ağustos 2025) A New Method for Determining the Number of Clusters Without Clustering. Yüzüncü Yıl Üniversitesi Fen Bilimleri Enstitüsü Dergisi 30 2 596–607.
IEEE
[1]D. S. Turan, “A New Method for Determining the Number of Clusters Without Clustering”, YYUFBED, c. 30, sy 2, ss. 596–607, Ağu. 2025, doi: 10.53433/yyufbed.1612608.
ISNAD
Turan, Duygu Selin. “A New Method for Determining the Number of Clusters Without Clustering”. Yüzüncü Yıl Üniversitesi Fen Bilimleri Enstitüsü Dergisi 30/2 (01 Ağustos 2025): 596-607. https://doi.org/10.53433/yyufbed.1612608.
JAMA
1.Turan DS. A New Method for Determining the Number of Clusters Without Clustering. YYUFBED. 2025;30:596–607.
MLA
Turan, Duygu Selin. “A New Method for Determining the Number of Clusters Without Clustering”. Yüzüncü Yıl Üniversitesi Fen Bilimleri Enstitüsü Dergisi, c. 30, sy 2, Ağustos 2025, ss. 596-07, doi:10.53433/yyufbed.1612608.
Vancouver
1.Duygu Selin Turan. A New Method for Determining the Number of Clusters Without Clustering. YYUFBED. 01 Ağustos 2025;30(2):596-607. doi:10.53433/yyufbed.1612608