Chatgpt Vs. Google Gemini: Assessment of Performance Regarding the Accuracy and Repeatability of Responses to Questions in Implant-Supported Prostheses
Abstract
Purpose: This study aimed to determine the accuracy and repeatability of the responses of different large language models to questions regarding implant-supported prostheses and assess the impact of pre-prompt utilization and the time of day.
Materials & Methods: A total of 12 open-ended questions related to implant-supported prostheses were generated and the content validity of the questions was verified by a specialist. Following that, questions were posed to 2 different LLMs: ChatGPT-4.0 and Google Gemini (morning, afternoon, evening; with and without pre-prompt). The responses were evaluated by two expert prosthodontists with a holistic rubric; the concordance between the graders' responses and repeated responses by C and G software programs was calculated with the Brennan and Prediger coefficient, Cohen kappa coefficient, Fleiss kappa, and Krippendorff alpha coefficients. Kruskal-Wallis, Mann-Whitney U, independent t-test, and ANOVA analyses were used to compare the responses obtained in the implementations.
Results: The results showed that the accuracy of ChatGPT and Google Gemini was 34.7% and 17.4%, respectively. The implementation of pre-prompt significantly increased accuracy in Gemini (p = 0.026). No significant difference was found according to the time of day (morning, afternoon, evening) or inter-week implementations. In addition, inter-rater reliability and repeatability showed high levels of consistency.
Conclusion: The use of pre-prompt positively affected accuracy and repeatability in both ChatGPT and Google Gemini. However, LLMs can still produce hallucinations. Therefore, LLMs may assist clinicians but they should be aware of these limitations.
Keywords: Chatbot, ChatGPT, Prostheses and Implant.
Keywords
Supporting Institution
None
Ethical Statement
None
Thanks
None
References
- Eggmann F, Blatz MB. ChatGPT: Chances and Challenges for Dentistry. Compend Contin Educ Dent. 2023;44(4):220–224.
- Omiye JA, Gui H, Rezaei SJ, Zou J, Daneshjou R. Large Language Models in Medicine: The Potentials and Pitfalls: A Narrative Review. Ann Intern Med. 2024;177(2):210–220. doi:10.7326/m23-2772.
- Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, et al. Large language models encode clinical knowledge. Nature. 2023;620(7972):172–180. doi:10.1038/s41586-023-06291-2.
- Wei Q, Yao Z, Cui Y, Wei B, Jin Z, Xu X. Evaluation of ChatGPT-generated medical responses: A systematic review and meta-analysis. J Biomed Inform. 2024;151:104620. doi:10.1016/j.jbi.2024.104620.
- Khan B, Fatima H, Qureshi A, Kumar S, Hanan A, Hussain J, et al. Drawbacks of Artificial Intelligence and Their Potential Solutions in the Healthcare Sector. Biomed Mater Devices. 2023:1–8. doi:10.1007/s44174-023-00063-2.
- Chatzopoulos GS, Koidou VP, Tsalikis L, Kaklamanos EG. Large language models in periodontology: Assessing their performance in clinically relevant questions. J Prosthet Dent. 2024. doi:10.1016/j.prosdent.2024.10.020.
- Schwendicke F, Samek W, Krois J. Artificial Intelligence in Dentistry: Chances and Challenges. J Dent Res. 2020;99(7):769–774. doi:10.1177/0022034520915714.
- Gheisarifar M, Shembesh M, Koseoglu M, Fang Q, Afshari FS, Yuan JC, et al. Evaluating the validity and consistency of artificial intelligence chatbots in responding to patients’ frequently asked questions in prosthodontics. J Prosthet Dent. 2025;134(1):199–206. doi:10.1016/j.prosdent.2025.03.009.
Details
Primary Language
English
Subjects
Prosthodontics
Journal Section
Research Article
Early Pub Date
August 30, 2025
Publication Date
August 31, 2025
Submission Date
April 18, 2025
Acceptance Date
June 17, 2025
Published in Issue
Year 2025 Volume: 52 Number: 2
APA
Yılmaz, D., & Çolpak, E. D. (2025). Chatgpt Vs. Google Gemini: Assessment of Performance Regarding the Accuracy and Repeatability of Responses to Questions in Implant-Supported Prostheses. European Annals of Dental Sciences, 52(2), 71-78. https://doi.org/10.52037/eads.2025.0011
AMA
1.Yılmaz D, Çolpak ED. Chatgpt Vs. Google Gemini: Assessment of Performance Regarding the Accuracy and Repeatability of Responses to Questions in Implant-Supported Prostheses. EADS. 2025;52(2):71-78. doi:10.52037/eads.2025.0011
Chicago
Yılmaz, Deniz, and Emine Dilara Çolpak. 2025. “Chatgpt Vs. Google Gemini: Assessment of Performance Regarding the Accuracy and Repeatability of Responses to Questions in Implant-Supported Prostheses”. European Annals of Dental Sciences 52 (2): 71-78. https://doi.org/10.52037/eads.2025.0011.
EndNote
Yılmaz D, Çolpak ED (August 1, 2025) Chatgpt Vs. Google Gemini: Assessment of Performance Regarding the Accuracy and Repeatability of Responses to Questions in Implant-Supported Prostheses. European Annals of Dental Sciences 52 2 71–78.
IEEE
[1]D. Yılmaz and E. D. Çolpak, “Chatgpt Vs. Google Gemini: Assessment of Performance Regarding the Accuracy and Repeatability of Responses to Questions in Implant-Supported Prostheses”, EADS, vol. 52, no. 2, pp. 71–78, Aug. 2025, doi: 10.52037/eads.2025.0011.
ISNAD
Yılmaz, Deniz - Çolpak, Emine Dilara. “Chatgpt Vs. Google Gemini: Assessment of Performance Regarding the Accuracy and Repeatability of Responses to Questions in Implant-Supported Prostheses”. European Annals of Dental Sciences 52/2 (August 1, 2025): 71-78. https://doi.org/10.52037/eads.2025.0011.
JAMA
1.Yılmaz D, Çolpak ED. Chatgpt Vs. Google Gemini: Assessment of Performance Regarding the Accuracy and Repeatability of Responses to Questions in Implant-Supported Prostheses. EADS. 2025;52:71–78.
MLA
Yılmaz, Deniz, and Emine Dilara Çolpak. “Chatgpt Vs. Google Gemini: Assessment of Performance Regarding the Accuracy and Repeatability of Responses to Questions in Implant-Supported Prostheses”. European Annals of Dental Sciences, vol. 52, no. 2, Aug. 2025, pp. 71-78, doi:10.52037/eads.2025.0011.
Vancouver
1.Deniz Yılmaz, Emine Dilara Çolpak. Chatgpt Vs. Google Gemini: Assessment of Performance Regarding the Accuracy and Repeatability of Responses to Questions in Implant-Supported Prostheses. EADS. 2025 Aug. 1;52(2):71-8. doi:10.52037/eads.2025.0011