Nowadays, with the advancement of technology, the use of news sources has also undergone a great evolution. News sources have constantly evolved from past to present, ranging from magazines to radios, from newspapers to televisions. The fact that it has become so easy to access news has caused society to pay more attention to fake news. Fake news has the ability to influence society through news sources such as social media, which can reach wider audiences with the development of technology. The difficulties of users in accessing accurate and reliable sources in this information flow that shapes their daily lives increases the potential for the spread of fake news, and it becomes increasingly difficult to distinguish between real and fake news. In this study, classification models for fake news detection were designed using machine learning algorithms. The dataset, which includes fake and real news examples, contains 42,000 examples. Each class, including fake and real samples, contains 22,000 sample data. In order to increase data quality, accuracy and usability, preprocessing methods were applied to the data set. The removal of numbers, stop words, and html tags was done in the pre-processing step to remove unnecessary information from the text. Models were created for fake news detection with singular and ensemble classification algorithms. Performance evaluation of the models was performed using 5-fold cross-validation. In the performance comparisons of the models, values such as accuracy, sensitivity, specificity, tp rate and fp rate were calculated. The highest performance results were observed in the random forest classification algorithm with an accuracy rate of 76%.
The authors declare that no acknowledgments are applicable for this study.
Birincil Dil | İngilizce |
---|---|
Konular | İstatistiksel Analiz |
Bölüm | Makaleler |
Yazarlar | |
Yayımlanma Tarihi | 30 Eylül 2024 |
Gönderilme Tarihi | 24 Nisan 2024 |
Kabul Tarihi | 7 Eylül 2024 |
Yayımlandığı Sayı | Yıl 2024 Cilt: 20 Sayı: 3 |