Kohonen Öğrenme Kuralı Yardımıyla Centroid Değerleri Güncelleme

Hidayet Takcı

doi:10.55205/joctensa.3220241801338

Research Article

Kohonen Öğrenme Kuralı Yardımıyla Centroid Değerleri Güncelleme

Year 2024, Volume: 3 Issue: 2, 57 - 67, 02.12.2025

Hidayet Takcı

https://doi.org/10.55205/joctensa.3220241801338

Abstract

Metin madenciliği alanında öne çıkan görevlerden birisi metin sınıflandırmadır. Konuyla ilgili çok sayıda tümevarımsal öğrenim algoritması bulunmakta olup onlardan birisi de centroid tabanlı sınıflayıcıdır. Performansı naive bayes sınıflayıcı kadar iyi olan centroid sınıflayıcıların en büyük problemi model misfittir ve bu problemin çözümü için centroid değerlerinin güncellenmesine ihtiyaç vardır. Centroid güncelleme için bugüne kadar birçok yöntem geliştirilmiş olup bu çalışmada yeni bir yöntem önerilmektedir. Önerimiz; kohonen öğrenme kuralı yardımıyla centroid değerlerinin dinamik olarak güncellenmesi üzerinedir. Bu öneriye uygun olarak veriler üzerinde deneyler yapılmış ve kohonen öğrenme kuralının centroid sınıflayıcı performansını artırdığı gözlemlenmiştir.

Keywords

centroid sınıflayıcı , model misfit , centroid güncelleme , kohonen öğrenme

References

Chovanec, P., et al. (2023). A survey of text classification with transformers: How wide? How large? How long? How accurate? How expensive? arXiv preprint arXiv:2311.14758.
Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273–297.
Fabrizio, S. (2002). Machine learning in automated text categorization. ACM Computing Surveys, 34(1), 1–47.
Guan, H., Zhou, J., & Guo, M. (2009). A class-feature-centroid classifier for text categorization. In Proceedings of the WWW Conference (pp. 1–10). Madrid, Spain.
Han, E.-H., & Karypis, G. (2000). Centroid-based document classification: Analysis and experimental results. In Principles of Data Mining and Knowledge Discovery (pp. 424–431).
Joachims, T. (1998). Text categorization with support vector machines: Learning with many relevant features. In C. N. Dellec & C. Rouveirol (Eds.), Proceedings of the 10th European Conference on Machine Learning (ECML-98) (pp. 137–142). Springer-Verlag.
Kohonen, T. (1984). Self-organization and associative memory. Springer-Verlag.
Lertnattee, V., & Theeramunkong, T. (2002). Combining homogeneous classifiers for centroid-based text classification. In Proceedings of ISCC (pp. 1034–1039).
Lewis, D., & Ringuette, M. (1994). Comparison of two learning algorithms for text categorization. In Proceedings of the 3rd Annual Symposium on Document Analysis and Information Retrieval.
Li, C. H., & Park, S. C. (2009). Combination of modified BPNN algorithms and an efficient feature selection method for text categorization. Information Processing and Management, 45(3), 329–340.
Li, Q., Peng, H., Li, J., Xia, C., Yang, R., Sun, L., & Yu, P. (2020). A survey on text classification: From shallow to deep learning. arXiv preprint arXiv:2008.00364.
Masand, B., Linoff, G., & Waltz, D. (1992). Classifying news stories using memory-based reasoning. In Proceedings of the 15th ACM International Conference on Research and Development in Information Retrieval (SIGIR-92) (pp. 59–65). ACM.
Öztemel, E. (2003). Yapay sinir ağları. Papatya Yayıncılık.
Salton, G. (1989). Automatic text processing: The transformation, analysis, and retrieval of information by computer. Addison-Wesley.
Takçı, H., & Soğukpınar, İ. (2004). Centroid-based language identification using letter feature set. In Proceedings of CicLing (pp. 635–645). Springer-Verlag.
Tan, S. (2007). Large margin drag-pushing strategy for centroid text categorization. Expert Systems with Applications, 33(1), 215–220.
Tan, S. (2008). An improved centroid classifier for text categorization. Expert Systems with Applications, 35(1–2), 279–285.
Wang, K., Ding, Y., & Han, S. C. (2023). Graph neural networks for text classification: A survey. arXiv preprint arXiv:2304.11534.

Updating Centroid Values Using Kohonen Learning Rule

Year 2024, Volume: 3 Issue: 2, 57 - 67, 02.12.2025

Hidayet Takcı

https://doi.org/10.55205/joctensa.3220241801338

Abstract

Text classification is one of the most prominent tasks in text mining. Numerous inductive learning algorithms exist, one of which is the centroid-based classifier. Centroid classifiers, which perform as well as naive Bayes classifiers, suffer from model misfit, a major problem requiring centroid values to be updated. Many methods have been developed for centroid updating, and this study proposes a new method. Our proposal involves dynamically updating centroid values using the Kohonen learning rule. Experiments have been conducted on data based on this proposal, and it has been observed that the Kohonen learning rule improves centroid classifier performance.

Keywords

centroid classifier , model misfit , centroid updating , kohonen learning

References

Chovanec, P., et al. (2023). A survey of text classification with transformers: How wide? How large? How long? How accurate? How expensive? arXiv preprint arXiv:2311.14758.
Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273–297.
Fabrizio, S. (2002). Machine learning in automated text categorization. ACM Computing Surveys, 34(1), 1–47.
Guan, H., Zhou, J., & Guo, M. (2009). A class-feature-centroid classifier for text categorization. In Proceedings of the WWW Conference (pp. 1–10). Madrid, Spain.
Han, E.-H., & Karypis, G. (2000). Centroid-based document classification: Analysis and experimental results. In Principles of Data Mining and Knowledge Discovery (pp. 424–431).
Joachims, T. (1998). Text categorization with support vector machines: Learning with many relevant features. In C. N. Dellec & C. Rouveirol (Eds.), Proceedings of the 10th European Conference on Machine Learning (ECML-98) (pp. 137–142). Springer-Verlag.
Kohonen, T. (1984). Self-organization and associative memory. Springer-Verlag.
Lertnattee, V., & Theeramunkong, T. (2002). Combining homogeneous classifiers for centroid-based text classification. In Proceedings of ISCC (pp. 1034–1039).
Lewis, D., & Ringuette, M. (1994). Comparison of two learning algorithms for text categorization. In Proceedings of the 3rd Annual Symposium on Document Analysis and Information Retrieval.
Li, C. H., & Park, S. C. (2009). Combination of modified BPNN algorithms and an efficient feature selection method for text categorization. Information Processing and Management, 45(3), 329–340.
Li, Q., Peng, H., Li, J., Xia, C., Yang, R., Sun, L., & Yu, P. (2020). A survey on text classification: From shallow to deep learning. arXiv preprint arXiv:2008.00364.
Masand, B., Linoff, G., & Waltz, D. (1992). Classifying news stories using memory-based reasoning. In Proceedings of the 15th ACM International Conference on Research and Development in Information Retrieval (SIGIR-92) (pp. 59–65). ACM.
Öztemel, E. (2003). Yapay sinir ağları. Papatya Yayıncılık.
Salton, G. (1989). Automatic text processing: The transformation, analysis, and retrieval of information by computer. Addison-Wesley.
Takçı, H., & Soğukpınar, İ. (2004). Centroid-based language identification using letter feature set. In Proceedings of CicLing (pp. 635–645). Springer-Verlag.
Tan, S. (2007). Large margin drag-pushing strategy for centroid text categorization. Expert Systems with Applications, 33(1), 215–220.
Tan, S. (2008). An improved centroid classifier for text categorization. Expert Systems with Applications, 35(1–2), 279–285.
Wang, K., Ding, Y., & Han, S. C. (2023). Graph neural networks for text classification: A survey. arXiv preprint arXiv:2304.11534.

Year 2024, Volume: 3 Issue: 2, 57 - 67, 02.12.2025

Hidayet Takcı

https://doi.org/10.55205/joctensa.3220241801338

Abstract

References

Chovanec, P., et al. (2023). A survey of text classification with transformers: How wide? How large? How long? How accurate? How expensive? arXiv preprint arXiv:2311.14758.
Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273–297.
Fabrizio, S. (2002). Machine learning in automated text categorization. ACM Computing Surveys, 34(1), 1–47.
Guan, H., Zhou, J., & Guo, M. (2009). A class-feature-centroid classifier for text categorization. In Proceedings of the WWW Conference (pp. 1–10). Madrid, Spain.
Han, E.-H., & Karypis, G. (2000). Centroid-based document classification: Analysis and experimental results. In Principles of Data Mining and Knowledge Discovery (pp. 424–431).
Joachims, T. (1998). Text categorization with support vector machines: Learning with many relevant features. In C. N. Dellec & C. Rouveirol (Eds.), Proceedings of the 10th European Conference on Machine Learning (ECML-98) (pp. 137–142). Springer-Verlag.
Kohonen, T. (1984). Self-organization and associative memory. Springer-Verlag.
Lertnattee, V., & Theeramunkong, T. (2002). Combining homogeneous classifiers for centroid-based text classification. In Proceedings of ISCC (pp. 1034–1039).
Lewis, D., & Ringuette, M. (1994). Comparison of two learning algorithms for text categorization. In Proceedings of the 3rd Annual Symposium on Document Analysis and Information Retrieval.
Li, C. H., & Park, S. C. (2009). Combination of modified BPNN algorithms and an efficient feature selection method for text categorization. Information Processing and Management, 45(3), 329–340.
Li, Q., Peng, H., Li, J., Xia, C., Yang, R., Sun, L., & Yu, P. (2020). A survey on text classification: From shallow to deep learning. arXiv preprint arXiv:2008.00364.
Masand, B., Linoff, G., & Waltz, D. (1992). Classifying news stories using memory-based reasoning. In Proceedings of the 15th ACM International Conference on Research and Development in Information Retrieval (SIGIR-92) (pp. 59–65). ACM.
Öztemel, E. (2003). Yapay sinir ağları. Papatya Yayıncılık.
Salton, G. (1989). Automatic text processing: The transformation, analysis, and retrieval of information by computer. Addison-Wesley.
Takçı, H., & Soğukpınar, İ. (2004). Centroid-based language identification using letter feature set. In Proceedings of CicLing (pp. 635–645). Springer-Verlag.
Tan, S. (2007). Large margin drag-pushing strategy for centroid text categorization. Expert Systems with Applications, 33(1), 215–220.
Tan, S. (2008). An improved centroid classifier for text categorization. Expert Systems with Applications, 35(1–2), 279–285.
Wang, K., Ding, Y., & Han, S. C. (2023). Graph neural networks for text classification: A survey. arXiv preprint arXiv:2304.11534.

There are 18 citations in total.

Details

Primary Language	Turkish
Subjects	Supervised Learning
Journal Section	Research Article
Authors	Hidayet Takcı 0000-0002-4448-4284
Publication Date	December 2, 2025
Submission Date	October 12, 2025
Acceptance Date	November 4, 2025
Published in Issue	Year 2024 Volume: 3 Issue: 2

Cite

APA	Takcı, H. (2025). Kohonen Öğrenme Kuralı Yardımıyla Centroid Değerleri Güncelleme. Cihannüma Teknoloji Fen Ve Mühendislik Bilimleri Akademi Dergisi, 3(2), 57-67. https://doi.org/10.55205/joctensa.3220241801338

Download Cover Image

Article Files

Full Text