Multi-Class News Classification with BERT, DistilBERT, RoBERTa, and ELECTRA Natural Language Processing Models
Abstract
Keywords
BERT, DistilBERT, RoBERTa, ELECTRA, Natural language processing, News classification, Text classification
Supporting Institution
Ethical Statement
Thanks
References
- Anand, S., & Prakasam, P. (2024). Deep learning-based text news classification using bi-directional LSTM model. In 2024 3rd International Conference on Artificial Intelligence for Internet of Things (AIIoT 2024). https://doi.org/10.1109/AIIoT58432.2024.10574679
- Aydın, Ö., & Kantarcı, H. (2024). Türkçe anahtar sözcük çıkarımında LSTM ve BERT tabanlı modellerin karşılaştırılması. Bilgisayar Bilimleri ve Mühendisliği Dergisi, 17(1), 9–18. https://doi.org/10.54525/bbmd.1454220
- Clark, K., Luong, M.-T., Le, Q. V., & Manning, C. D. (2020). ELECTRA: Pre-training text encoders as discriminators rather than generators. arXiv Preprint arXiv:2003.10555. http://arxiv.org/abs/2003.10555
- Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 1, 4171–4186. https://doi.org/10.48550/arXiv.1810.04805
- Dos Santos, D. P., Da Costa, J. P. J., Da Silva, D. A., Mendonca, F., Veiga, C., & De Sousa, R. T. (2023). Multi-class text classification based in oversampling for highly imbalanced dataset. Proceedings of the 22nd IEEE International Conference on Machine Learning and Applications (ICMLA 2023) (pp. 752–755). https://doi.org/10.1109/ICMLA58977.2023.00109
- Dvořáčková, L. (2025). Analyzing word embeddings and their impact on semantic similarity: Through extreme simulated conditions to real dataset characteristics. Neural Computing and Applications, 37(19), 13765–13793. https://doi.org/10.1007/S00521-025-11231-4
- Hassan, S. U., Ahamed, J., & Ahmad, K. (2022). Analytics of machine learning-based algorithms for text classification. Sustainable Operations and Computers, 3, 238–248. https://doi.org/10.1016/j.susoc.2022.03.001
- İzdaş, T., İskifoğlu, H., & Diri, B. (2025). Occupation prediction from twitter data. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen ve Mühendislik Dergisi, 27(80), 267–271. https://doi.org/10.21205/deufmd.2025278013
- Joachims, T. (1998). Text categorization with Support Vector Machines: Learning with many relevant features. In C. Nédellec & C. Rouveirol (Eds.), Machine learning: ECML‑98 (Lecture Notes in Computer Science, Vol. 1398, pp. 137–142). Springer. https://doi.org/10.1007/BFb0026683
- Joseph, J., Vineetha, S., & Sobhana, N. V. (2022). A survey on deep learning based sentiment analysis. Materials Today: Proceedings, 58(1), 456–460. https://doi.org/10.1016/j.matpr.2022.02.483