Araştırma Makalesi

Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures

Cilt: 1 Sayı: 2 30 Aralık 2021
PDF İndir
EN

Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures

Öz

Text classification is a natural language processing (NLP) problem that aims to classify previously unseen texts. In this study, Bidirectional Encoder Representations for Transformers (BERT) architecture is preferred for text classification. The classification is aimed explicitly at a chatbot that can give automated responses to website visitors' queries. BERT is trained to reduce the need for RAM and storage by replacing multiple separate models for different chatbots on a server with a single model. Moreover, since a pre-trained multilingual BERT model is preferred, the system reduces the need for system resources. It handles multiple chatbots with multiple languages simultaneously. The model mainly determines a class for a given input text. The classes correspond to specific answers from a database, and the bot selects an answer and replies back. For multiple chatbots, a special masking operation is performed to select a response from within the corresponding bank answers of a chatbot. We tested the proposed model for 13 simultaneous classification problems on a data set of three different languages, Turkish, English, and German, with 333 classes. We reported the accuracies for individually trained models and the proposed model together with the savings in the system resources.

Anahtar Kelimeler

Kaynakça

  1. [1] P. Muangkammuen, N. Intiruk, and K. R. Saikaew, “Automated Thai-FAQ Chatbot using RNN-LSTM,” 2018 22nd International Computer Science and Engineering Conference (ICSEC), 2018.
  2. [2] S. Ozan and D. E. Tasar, “Auto-tagging of short conversational sentences using natural language processing methods,” 2021 29th Signal Processing and Communications Applications Conference (SIU), 2021.
  3. [3] D.E. Taşar, Ş. Ozan., U. Özdil, M.F.Akca, O. Ölmez, S. Gülüm, S. Kutal, and C. Belhan, “Auto-tagging of Short Conversational Sentences using Transformer Methods”. arXiv preprint arXiv:2106.01735, 2021.
  4. [4] D.E. Taşar, Ş. Ozan., M.F.Akca, O. Ölmez, S. Gülüm, S. Kutal, and C. Belhan,“Çok Alanlı Chatbot Mimarilerinde Avantajlı Performans ve Bellek Takası”, presented at ICADA, Online, 26-28 Nov. 2021.
  5. [5] E. S. Tellez, S. Miranda-Jiménez, M. Graff, D. Moctezuma, R. R. Suárez, and O. S. Siordia, “A simple approach to multilingual polarity classification in Twitter,” Pattern Recognition Letters, vol. 94, pp. 68–74, 2017.
  6. [6] X. Ni, J.-T. Sun, J. Hu, and Z. Chen, “Cross lingual text classification by mining multilingual topics from Wikipedia,” Proceedings of the fourth ACM international conference on Web search and data mining - WSDM '11, 2011.
  7. [7] H. Schwenk and X. Li, “A corpus for multilingual document classification in eight languages”, arXiv preprint arXiv:1805.09821, 2018.
  8. [8] X. Zhou, X. Wan, and J. Xiao, “Attention-based LSTM network for cross-lingual sentiment classification,” Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Yapay Zeka

Bölüm

Araştırma Makalesi

Yazarlar

Seçilay Kutal
United States

Fatih Akca
Türkiye

Ceren Belhan Bu kişi benim
Türkiye

Yayımlanma Tarihi

30 Aralık 2021

Gönderilme Tarihi

30 Kasım 2021

Kabul Tarihi

24 Aralık 2021

Yayımlandığı Sayı

Yıl 2021 Cilt: 1 Sayı: 2

Kaynak Göster

APA
Taşar, D. E., Ozan, Ş., Kutal, S., Ölmez, O., Gülüm, S., Akca, F., & Belhan, C. (2021). Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures. Journal of Artificial Intelligence and Data Science, 1(2), 144-149. https://izlik.org/JA77UY82ER
AMA
1.Taşar DE, Ozan Ş, Kutal S, vd. Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures. Journal of Artificial Intelligence and Data Science. 2021;1(2):144-149. https://izlik.org/JA77UY82ER
Chicago
Taşar, Davut Emre, Şükrü Ozan, Seçilay Kutal, vd. 2021. “Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures”. Journal of Artificial Intelligence and Data Science 1 (2): 144-49. https://izlik.org/JA77UY82ER.
EndNote
Taşar DE, Ozan Ş, Kutal S, Ölmez O, Gülüm S, Akca F, Belhan C (01 Aralık 2021) Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures. Journal of Artificial Intelligence and Data Science 1 2 144–149.
IEEE
[1]D. E. Taşar vd., “Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures”, Journal of Artificial Intelligence and Data Science, c. 1, sy 2, ss. 144–149, Ara. 2021, [çevrimiçi]. Erişim adresi: https://izlik.org/JA77UY82ER
ISNAD
Taşar, Davut Emre - Ozan, Şükrü - Kutal, Seçilay - Ölmez, Oğuzhan - Gülüm, Semih - Akca, Fatih - Belhan, Ceren. “Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures”. Journal of Artificial Intelligence and Data Science 1/2 (01 Aralık 2021): 144-149. https://izlik.org/JA77UY82ER.
JAMA
1.Taşar DE, Ozan Ş, Kutal S, Ölmez O, Gülüm S, Akca F, Belhan C. Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures. Journal of Artificial Intelligence and Data Science. 2021;1:144–149.
MLA
Taşar, Davut Emre, vd. “Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures”. Journal of Artificial Intelligence and Data Science, c. 1, sy 2, Aralık 2021, ss. 144-9, https://izlik.org/JA77UY82ER.
Vancouver
1.Davut Emre Taşar, Şükrü Ozan, Seçilay Kutal, Oğuzhan Ölmez, Semih Gülüm, Fatih Akca, Ceren Belhan. Performance Trade-Off for Bert Based Multi-Domain Multilingual Chatbot Architectures. Journal of Artificial Intelligence and Data Science [Internet]. 01 Aralık 2021;1(2):144-9. Erişim adresi: https://izlik.org/JA77UY82ER