Research Article

Classification of News Texts by Categories Using Machine Learning Methods

Volume: 10 Number: 2 December 31, 2022
EN

Classification of News Texts by Categories Using Machine Learning Methods

Abstract

In parallel with the advances in technology, digital journalism is preferred more than printed journalism day by day. Due to the fast and up-to-date sense of journalism provided by digital journalism and its ubiquitous accessibility features, it is read more by users. In addition to these advantages provided by digital journalism, it also has some difficulties compared to printed journalism. The stage of preparation and delivery of the news to the user requires more technological knowledge and equipment compared to printed journalism. The processes of title selection, text creation, photo selection and determination of the appropriate news category in the preparation phase of the news are designed to be both faster and user-friendly compared to printed publishing. The news created to be presented to the target audience may belong to one or more of different categories such as economy, politics, sports, technology, and health. The inclusion of the news in the appropriate category provides convenience in terms of reaching the right audience and archiving the news correctly. In this study, news texts were classified according to their categories based on the machine learning methods. In the study, news of five newspapers in three different categories were used. Bayesian classifier and decision tree methods were used to classify the news in the dataset including a total of 10.500 news. In the results of the study, it was observed that the Bayesian classifier classified the news more successfully according to their categories.

Keywords

References

  1. Acı, Ç.İ., Çırak, A. 2019. “Türkçe Haber Metinlerinin Konvolüsyonel Sinir Ağları ve Word2Vec Kullanılarak Sınıflandırılması”, Bilişim Teknolojileri Dergisi, 12(3), 219–228.
  2. Adak, M.F., Yurtay, N. 2013. "Gini Algoritmasını Kullanarak Karar Ağacı Oluşturmayı Sağlayan Bir Yazılımın Geliştirilmesi," Internatıonal Journal of Informatics Technologies, 6(3), 1-6.
  3. Amasyalı, M.F., Yıldırım, T. 2004. “Otomatik haber metinleri sınıflandırma”, 13. Sinyal İşleme ve Uygulama Kurultayı, 224–226, Kuşadası, Türkiye.
  4. Amasyalı, M.F., Beken, A. 2009. “Türkçe Kelimelerin Anlamsal Benzerliklerinin Ölçülmesi ve Metin Sınıflandırmada Kullanılması”, IEEE 17. Sinyal İşleme ve İletişim Uygulamaları Kurultayı, Antalya, Türkiye.
  5. Amasyalı, M.F., Diri, B., Türkoğlu, F. 2006. “Farklı özellik vektörleri ile Türkçe dokümanların yazarlarının belirlenmesi”, 15th Turkish Symposium on Artificial Intelligence and Neural Network, Muğla, Türkiye.
  6. Aşlıyan, R., Günel, K. 2010. “Metin İçerikli Türkçe Dokümanların Sınıflandırılması”, Akademik Bilişim Konferansı, 659–665, Muğla, Türkiye.
  7. Aydoğan, D. 2013. Türkiye’de dijital gazetecilik: Habertürk ve Hürriyet gazeteleri örneği. Turkish Online Journal of Design Art and Communication, 3(3), 26-40.
  8. Bardoel, J. (1996). Beyond journalism: A profession between ınformation society and civil society. European Journal of Communication, 11(3), 283-302.

Details

Primary Language

English

Subjects

Operation

Journal Section

Research Article

Publication Date

December 31, 2022

Submission Date

July 27, 2022

Acceptance Date

October 20, 2022

Published in Issue

Year 2022 Volume: 10 Number: 2

APA
Kayakuş, M., & Yiğit Açıkgöz, F. (2022). Classification of News Texts by Categories Using Machine Learning Methods. Alphanumeric Journal, 10(2), 155-166. https://doi.org/10.17093/alphanumeric.1149753

Cited By

Estimation Of Turkey's Carbon Dioxide Emission with Machine Learning

International Journal of Computational and Experimental Science and Engineering

https://doi.org/10.22399/ijcesen.302

Alphanumeric Journal is hosted on DergiPark, a web based online submission and peer review system powered by TUBİTAK ULAKBIM.

Alphanumeric Journal is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License