Research Article

The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard

Volume: 16 Number: 2 June 30, 2024
EN TR

The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard

Abstract

Nowadays, it is hard to find a part of human life that Artificial Intelligence (AI) has not been involved in. With the recent advances in AI, the change for chatbots has been an ‘evolution’ instead of a ‘revolution’. AI-powered chatbots have become an integral part of customer services as they are as functional as humans (if not more), and they can provide 24/7 service (unlike humans). There are several publicly available, widely used AI-powered chatbots. So, “Which one is better?” is a question that instinctively comes to mind and needs to shed light on. Motivated by the question, an experimental comparison of two widely used AI-powered chatbots, namely ChatGPT and Bard, was proposed in this study. For a quantitative comparison, (i) a gold standard QA dataset, which comprised 2,390 questions from 109 topics, was used and (ii) a novel answer-scoring algorithm based on cosine similarity was proposed. The covered chatbots were evaluated using the proposed algorithm on the dataset to reveal their (i) generated answer length and (ii) generated answer accuracy. According to the experimental results, (i) Bard generated lengthy answers compared to ChatGPT and (ii) Bard provided answers more similar to the ground truth compared to ChatGPT.

Keywords

chatbot, question answering, artificial intelligence, ChatGPT, Bard, large language model

References

  1. Ali, R., Tang, O. Y., Connolly, I. D., Fridley, J. S., Shin, J. H., Sullivan, P. L. Z., Cielo, D., Oyelese, A. A., Doberstein, C. E., Telfeian, A. E., Gokaslan, Z. L., & Asaad, W. F. (2023). Performance of ChatGPT, GPT-4, and Google Bard on a Neurosurgery Oral Boards Preparation Question Bank. MedRxiv, 2023.04.06.23288265, 1–23. https://doi.org/10.1101/2023.04.06.23288265
  2. Anand, Y., Nussbaum, Z., Duderstadt, B., Schmidt, B., & Mulyar, A. (2024). GPT4All: Training an Assistant-style Chatbot with Large Scale Data Distillation from GPT-3.5-Turbo. GitHub. https://github.com/nomic-ai/gpt4all
  3. Ariyaratne, S., Iyengar, K. P., Nischal, N., Chitti Babu, N., & Botchu, R. (2023). A comparison of ChatGPT-generated articles with human-written articles. Skeletal Radiology, 52, 1755–1758. https://doi.org/10.1007/s00256-023-04340-5
  4. Au Yeung, J., Kraljevic, Z., Luintel, A., Balston, A., Idowu, E., Dobson, R. J., & Teo, J. T. (2023). AI chatbots not yet ready for clinical use. Frontiers in Digital Health, 5, 1–5. https://doi.org/10.3389/fdgth.2023.1161098
  5. Bernardini, A. A., Sônego, A. A., & Pozzebon, E. (2018). Chatbots: An Analysis of the State of Art of Literature. Proceedings of the 1st Workshop on Advanced Virtual Environments and Education (WAVE2 2018). https://doi.org/10.5753/wave.2018.1
  6. Bird, S., Klein, E., & Loper, E. (2009). Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit (1st ed.). O’Reilly Media.
  7. Caldarini, G., Jaf, S., & McGarry, K. (2022). A Literature Survey of Recent Advances in Chatbots. Information, 13(1), 1–22. https://doi.org/10.3390/info13010041
  8. ChatGPT. (2024). OpenAI. https://chat.openai.com
  9. ChatGPT, Bard, Microsoft Copilot - Explore - Google Trends. (2024). Google Trends. https://trends.google.com/trends/explore?date=today%203-m&q=/g/11khcfz0y2,/g/11ts49p01g,/g/11tsqm45vd&hl=en
  10. Cheong, A. (2024). Python SDK/API for reverse engineered Google Bard. GitHub. https://github.com/acheong08/Bard
APA
Kabakuş, A. T., & Dogru, İ. (2024). The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard. International Journal of Engineering Research and Development, 16(2), 679-691. https://doi.org/10.29137/umagd.1390083
AMA
1.Kabakuş AT, Dogru İ. The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard. IJERAD. 2024;16(2):679-691. doi:10.29137/umagd.1390083
Chicago
Kabakuş, Abdullah Talha, and İbrahim Dogru. 2024. “The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard”. International Journal of Engineering Research and Development 16 (2): 679-91. https://doi.org/10.29137/umagd.1390083.
EndNote
Kabakuş AT, Dogru İ (June 1, 2024) The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard. International Journal of Engineering Research and Development 16 2 679–691.
IEEE
[1]A. T. Kabakuş and İ. Dogru, “The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard”, IJERAD, vol. 16, no. 2, pp. 679–691, June 2024, doi: 10.29137/umagd.1390083.
ISNAD
Kabakuş, Abdullah Talha - Dogru, İbrahim. “The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard”. International Journal of Engineering Research and Development 16/2 (June 1, 2024): 679-691. https://doi.org/10.29137/umagd.1390083.
JAMA
1.Kabakuş AT, Dogru İ. The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard. IJERAD. 2024;16:679–691.
MLA
Kabakuş, Abdullah Talha, and İbrahim Dogru. “The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard”. International Journal of Engineering Research and Development, vol. 16, no. 2, June 2024, pp. 679-91, doi:10.29137/umagd.1390083.
Vancouver
1.Abdullah Talha Kabakuş, İbrahim Dogru. The Battle of Chatbot Giants: An Experimental Comparison of ChatGPT and Bard. IJERAD. 2024 Jun. 1;16(2):679-91. doi:10.29137/umagd.1390083