Research Article
BibTex RIS Cite

Kümeleme Çözümlemesinde Düzeltilmiş Tek Adım M-Tahmin Edicisinin Kullanılması

Year 2010, Volume: 7 Issue: 1, 41 - 54, 15.07.2010

Abstract

Kümeleme çözümlemesi, yaygın olarak kullanılan çok değişkenli istatistiksel yöntemlerden biridir. Düzeltilmiş tek adım M-tahmin edicisi ise Huber’in (Huber, 1964) önerdiği M-tahmin edicisi üzerinde Wilcox tarafından yapılan değişiklik ile elde edilmiştir (Wilcox, 2003a). Düzeltilmiş tek adım M-tahmin edicisi, 2003 yılından bugüne kadar varyans çözümlemesi (Wilcox ve Keselman, 2003) ve çoklu karşılaştırmalar (Wilcox, 2003b) gibi alanlarda kullanılmıştır. Hiyerarşik olmayan kümeleme yöntemi olarak sınıflandırılan k-ortalama yöntemini konu alan bu çalışmada ortalama yerine düzeltilmiş tek adım M-tahmin edicisi (Modified one-step M-estimator-MoM) kullanılarak geliştirilen kümeleme algoritması tanıtılmıştır.

References

  • Dallas, E.J., 1998. Applied multivariate methods for data analysts. Duxbury Press.
  • Fasulo, D., 1999. An analysis of recent work on clustering algorithms. Technical Report, 01-03-02. Department of Computer Science and Engineering, University of Washington. (in English).
  • Guha, S., Rastogi R., Shim, K., 1998. CURE an efficient clustering algorithm for large databases. Proceedings of the 1988 ACM SIGMOD International Conference on Management of Data, L. M. Haas, A. Tiwary (eds.), Seattle, Washington. 73-84.
  • Han, J., Kamber M., 2001. Data mining concepts and techniques. Morgan Kauffmann Publishers Inc. San Fransisco.
  • Hautamäki, V., Cherednichenko, S., Kärkkäinen, I. Kinnunen, T., Fränti, P., 2005. Improving K-Means by outlier removal. Lecture Notes in Computer Sciences, Springer/Heidelberg. 978-987.
  • Huber, P.J., 1964. Robust estimation of location parameters. Annals of Mathematical Statistics, 35, 73-101.
  • MacQueen, J., 1967. Some methods for classification and analysis of multivariate observations. University of California Pres, Berkeley.
  • Richard, A.J, Dean, W.W., 1992. Applied multivariate statistical analysis. Prentice-Hall, New Jersey.
  • Tatlıdil, H., 2002. Uygulamalı çok değikenli istatistiksel analiz. Ziraat Matbaacılık, Ankara.
  • Wilcox R.R., 2003a. Applying contemporary statistical techniques. Academic Pres.
  • Wilcox, R.R. 2003b. Multiple comparisons based on a modified one-step M-estimator. Journal of Applied Statistics, 37, 1231–1241.
  • Wilcox, R.R., Keselman, H.J., 2003. Repeated measure one-way ANOVA based on a modified one-step M estimator. British Journal of Mathematical and Statistical Phychology, 56, 15–25.
  • Zaiane, O., Pei, Y., 2008. http://www. .ualberta.ca/~yaling/Cluster/Applet/Code/Cluster.html, Temmuz 2008.

Cluster Analysis with Modified One-Step M-Estimator

Year 2010, Volume: 7 Issue: 1, 41 - 54, 15.07.2010

Abstract

Cluster analysis is one of the most widespread multivariate statistical analysis methods. Modified one-step M-estimator is developed with a modification on Huber’s M-estimator (Huber, 1964) by Wilcox (Wilcox, 2003a) which is used in analysis of variance (Wilcox and Keselman, 2003) and multiple comparisons (Wilcox, 2003b) since then. In this study, k-means method which is classified as non-hierarchical clustering method has been presented with an algorithm that uses modified one-step M-estimator instead of mean.

References

  • Dallas, E.J., 1998. Applied multivariate methods for data analysts. Duxbury Press.
  • Fasulo, D., 1999. An analysis of recent work on clustering algorithms. Technical Report, 01-03-02. Department of Computer Science and Engineering, University of Washington. (in English).
  • Guha, S., Rastogi R., Shim, K., 1998. CURE an efficient clustering algorithm for large databases. Proceedings of the 1988 ACM SIGMOD International Conference on Management of Data, L. M. Haas, A. Tiwary (eds.), Seattle, Washington. 73-84.
  • Han, J., Kamber M., 2001. Data mining concepts and techniques. Morgan Kauffmann Publishers Inc. San Fransisco.
  • Hautamäki, V., Cherednichenko, S., Kärkkäinen, I. Kinnunen, T., Fränti, P., 2005. Improving K-Means by outlier removal. Lecture Notes in Computer Sciences, Springer/Heidelberg. 978-987.
  • Huber, P.J., 1964. Robust estimation of location parameters. Annals of Mathematical Statistics, 35, 73-101.
  • MacQueen, J., 1967. Some methods for classification and analysis of multivariate observations. University of California Pres, Berkeley.
  • Richard, A.J, Dean, W.W., 1992. Applied multivariate statistical analysis. Prentice-Hall, New Jersey.
  • Tatlıdil, H., 2002. Uygulamalı çok değikenli istatistiksel analiz. Ziraat Matbaacılık, Ankara.
  • Wilcox R.R., 2003a. Applying contemporary statistical techniques. Academic Pres.
  • Wilcox, R.R. 2003b. Multiple comparisons based on a modified one-step M-estimator. Journal of Applied Statistics, 37, 1231–1241.
  • Wilcox, R.R., Keselman, H.J., 2003. Repeated measure one-way ANOVA based on a modified one-step M estimator. British Journal of Mathematical and Statistical Phychology, 56, 15–25.
  • Zaiane, O., Pei, Y., 2008. http://www. .ualberta.ca/~yaling/Cluster/Applet/Code/Cluster.html, Temmuz 2008.
There are 13 citations in total.

Details

Primary Language Turkish
Subjects Economics, Statistics
Journal Section Research Articles
Authors

Abdullah Fırat Özdemir This is me

Engin Yıldıztepe This is me

Publication Date July 15, 2010
Published in Issue Year 2010 Volume: 7 Issue: 1

Cite

APA Özdemir, A. F., & Yıldıztepe, E. (2010). Kümeleme Çözümlemesinde Düzeltilmiş Tek Adım M-Tahmin Edicisinin Kullanılması. İstatistik Araştırma Dergisi, 7(1), 41-54.