Research Article

Performance of large language models on prosthodontics questions of the dentistry specialization examination: a comparative analysis (2014–2024)

Volume: 7 Number: 6 October 26, 2025
EN TR

Performance of large language models on prosthodontics questions of the dentistry specialization examination: a comparative analysis (2014–2024)

Abstract

Aims: This study aimed to comparatively evaluate the performance of five contemporary large language models (LLMs) on prosthodontics questions of the dentistry specialization examination (DUS) between 2014 and 2024. Methods: A total of 167 prosthodontics questions from the DUS were analyzed. The questions were administered to five different LLMs: ChatGPT-5 (OpenAI Inc., USA), Claude 4 (Anthropic, USA), Gemini 1.5 Pro (Google LLC, USA), DeepSeek-V2 (DeepSeek AI, China), and Perplexity Pro (Perplexity AI, USA). The models’ responses were compared with the official answer keys provided by the Student Selection and Placement Center (OSYM), coded as correct or incorrect, and accuracy percentages were calculated. Statistical analyses included the Friedman test, correlation analysis, and frequency distributions. Subsection analyses were also performed to evaluate model performance across different content areas. Results: DeepSeek-V2 achieved the highest overall accuracy rate (70.06%). Perplexity Pro (53.89%) and Gemini 1.5 Pro (51.50%) demonstrated moderate performance, ChatGPT-5 (49.10%) performed close to human levels, while Claude 4 had the lowest accuracy (32.34%). Subsection analyses revealed high accuracy in standardized knowledge areas such as implantology and temporomandibular joint (TMJ) disorders (66.7-100%), whereas notable decreases were observed in occlusion and morphology questions (9.1-53.9%). Correlation analyses indicated significant relationships between certain models. Conclusion: The findings demonstrate heterogeneous performance of LLMs on DUS prosthodontics questions. While these models may serve as supplementary tools for exam preparation and dental education, their variable accuracy and potential for generating misinformation suggest they should not be used independently. Under expert supervision, LLMs may enhance dental education.

Keywords

References

  1. Aura-Tormos JI, Llacer-Martinez M, Torres-Osca I. Educational applications of ChatGPT in university-based dental education. A systematic review. Eur J Dent Educ. 2025. doi:10.1111/eje.70011
  2. Huang Y, Gomaa A, Semrau S, et al. Benchmarking ChatGPT-4 on a radiation oncology in-training exam and Red Journal Gray Zone cases: potentials and challenges for ai-assisted medical education and decision making in radiation oncology. Front Oncol. 2023;13:1265024. doi:10. 3389/fonc.2023.1265024
  3. Ali K, Barhom N, Marino FT, Duggal M. The thrills and chills of ChatGPT: implications for assessments in undergraduate dental education. 2023. doi:10.20944/preprints202302.0513.v1
  4. Uribe SE, Maldupa I, Kavadella A, et al. Artificial intelligence chatbots and large language models in dental education: worldwide survey of educators. Eur J Dent Educ. 2024;28(4):865-876. doi:10.1111/eje.13009
  5. Sallam M, Al-Salahat K. Below average ChatGPT performance in medical microbiology exam compared to university students. Front Educ. 2023;8:1333415. doi:10.3389/feduc.2023.1333415
  6. Chau RCW, Thu KM, Yu OY, et al. Evaluation of Chatbot responses to text-based multiple-choice questions in prosthodontic and restorative dentistry. Dent J. 2025;13(7):279. doi:10.3390/dj13070279
  7. Özyemişci N, Bal BT, Güngör MB, Öztürk EK, Canvar A, Nemli SK. Evaluation of information provided by artificial intelligence chatbots on extraoral maxillofacial prostheses. J Prosthet Dent. 2025;25:3913. doi:10. 1016/j.prosdent.2025.08.028
  8. Sismanoglu S, Capan BS. Performance of artificial intelligence on Turkish dental specialization exam: can ChatGPT-4.0 and gemini advanced achieve comparable results to humans? BMC Med Educ. 2025; 25(1):214. doi:10.1186/s12909-024-06389-9

Details

Primary Language

English

Subjects

Information Systems (Other), Prosthodontics

Journal Section

Research Article

Publication Date

October 26, 2025

Submission Date

September 23, 2025

Acceptance Date

October 15, 2025

Published in Issue

Year 2025 Volume: 7 Number: 6

APA
Yay Kuşçu, H. Y., & Görüş, Z. (2025). Performance of large language models on prosthodontics questions of the dentistry specialization examination: a comparative analysis (2014–2024). Anatolian Current Medical Journal, 7(6), 893-899. https://doi.org/10.38053/acmj.1789931
AMA
1.Yay Kuşçu HY, Görüş Z. Performance of large language models on prosthodontics questions of the dentistry specialization examination: a comparative analysis (2014–2024). Anatolian Curr Med J / ACMJ / acmj. 2025;7(6):893-899. doi:10.38053/acmj.1789931
Chicago
Yay Kuşçu, Hayriye Yasemin, and Zuhal Görüş. 2025. “Performance of Large Language Models on Prosthodontics Questions of the Dentistry Specialization Examination: A Comparative Analysis (2014–2024)”. Anatolian Current Medical Journal 7 (6): 893-99. https://doi.org/10.38053/acmj.1789931.
EndNote
Yay Kuşçu HY, Görüş Z (October 1, 2025) Performance of large language models on prosthodontics questions of the dentistry specialization examination: a comparative analysis (2014–2024). Anatolian Current Medical Journal 7 6 893–899.
IEEE
[1]H. Y. Yay Kuşçu and Z. Görüş, “Performance of large language models on prosthodontics questions of the dentistry specialization examination: a comparative analysis (2014–2024)”, Anatolian Curr Med J / ACMJ / acmj, vol. 7, no. 6, pp. 893–899, Oct. 2025, doi: 10.38053/acmj.1789931.
ISNAD
Yay Kuşçu, Hayriye Yasemin - Görüş, Zuhal. “Performance of Large Language Models on Prosthodontics Questions of the Dentistry Specialization Examination: A Comparative Analysis (2014–2024)”. Anatolian Current Medical Journal 7/6 (October 1, 2025): 893-899. https://doi.org/10.38053/acmj.1789931.
JAMA
1.Yay Kuşçu HY, Görüş Z. Performance of large language models on prosthodontics questions of the dentistry specialization examination: a comparative analysis (2014–2024). Anatolian Curr Med J / ACMJ / acmj. 2025;7:893–899.
MLA
Yay Kuşçu, Hayriye Yasemin, and Zuhal Görüş. “Performance of Large Language Models on Prosthodontics Questions of the Dentistry Specialization Examination: A Comparative Analysis (2014–2024)”. Anatolian Current Medical Journal, vol. 7, no. 6, Oct. 2025, pp. 893-9, doi:10.38053/acmj.1789931.
Vancouver
1.Hayriye Yasemin Yay Kuşçu, Zuhal Görüş. Performance of large language models on prosthodontics questions of the dentistry specialization examination: a comparative analysis (2014–2024). Anatolian Curr Med J / ACMJ / acmj. 2025 Oct. 1;7(6):893-9. doi:10.38053/acmj.1789931

Cited By

 

TR DİZİN ULAKBİM and International Indexes (1b)
 

Interuniversity Board (UAK) Equivalency:  Article published in Ulakbim TR Index journal [10 POINTS], and Article published in other (excuding 1a, b, c) international indexed journal (1d) [5 POINTS]

Note: Our journal is not WOS indexed and therefore is not classified as Q.

You can download Council of Higher Education (CoHG) [Yüksek Öğretim Kurumu (YÖK)] Criteria) decisions about predatory/questionable journals and the author's clarification text and journal charge policy from your browser. https://dergipark.org.tr/tr/journal/3449/file/4924/show

 

Journal Indexes and Platforms: 

TR Dizin ULAKBİM, Google Scholar, Crossref, Worldcat (OCLC), DRJI, EuroPub, OpenAIRE, Turkiye Citation Index, Turk Medline, ROAD, ICI World of Journal's, Index Copernicus, ASOS Index, General Impact Factor, Scilit.


 

The indexes of the journal's are;


 

download?token=eyJhdXRoX3JvbGVzIjpbXSwiZW5kcG9pbnQiOiJqb3VybmFsIiwib3JpZ2luYWxuYW1lIjoiVHJfSW5kZXhfbG9nby5wbmciLCJwYXRoIjoiMDFiOS82MmZhLzA3MzMvNjlkZjNlNTdhMmI4ZjkuODYxMzMxMjQucG5nIiwiZXhwIjoxNzc2MjQxNzY3LCJub25jZSI6ImQyMTQ4MjdiNTg1ZjVmMGQwYzAzZTMxNzMwM2QwMThmIn0.RmnGvwR536HdIoKpGO-ApytZ5aRPRT_BFXE2EpGSIqc

asos-index.png
 
f9ab67f.png
 
WorldCat_Logo_H_Color.png
 

 

18596download?token=eyJhdXRoX3JvbGVzIjpbXSwiZW5kcG9pbnQiOiJqb3VybmFsIiwib3JpZ2luYWxuYW1lIjoiT3BlbkFpcmUuanBnIiwicGF0aCI6IjUyMWYvZjljYy8wMDk3LzY5ZGYzZDNiYmVkZGU0LjQzNDM2OTU3LmpwZyIsImV4cCI6MTc3NjI0MTQ4NCwibm9uY2UiOiIwYjgxZDE2NzRiNzhjMWQyOGVmMDM1OTA1MzI5NjdjZiJ9.xeFppR1ubA4i-dHG-u07ht9bQNogFheXQjLyEaP9GgAimages?q=tbn:ANd9GcQgDnBwx0yUPRKuetgIurtELxYERFv20CPAUcPe4jYrrJiwXzac8rGXlzd57gl8iikb1Tk&usqp=CAU

 

84039476_619085835534619_7808805634291269632_n.jpg

 

 

 

The platforms of the journal's are;
 

COPE.jpg
 
images?q=tbn:ANd9GcTbq2FM8NTdXECzlOUCeKQ1dvrISFL-LhxhC7zy1ZQeJk-GGKSx2XkWQvrsHxcfhtfHWxM&usqp=CAUicmje_1_orig.png
 
 
ncbi.png
 
ORCID_logo.pngimages?q=tbn:ANd9GcQlwX77nfpy3Bu9mpMBZa0miWT2sRt2zjAPJKg2V69ODTrjZM1nT1BbhWzTVPsTNKJMZzQ&usqp=CAU
 

 

images?q=tbn:ANd9GcTaWSousoprPWGwE-qxwxGH2y0ByZ_zdLMN-Oq93MsZpBVFOTfxi9uXV7tdr39qvyE-U0I&usqp=CAU
 


 


 

 


 


The indexes/platforms of the journal are;
 

TR Dizin Ulakbim, Crossref (DOI), Google Scholar, EuroPub, Directory of Research Journal İndexing (DRJI), Worldcat (OCLC), OpenAIRE, ASOS Index, ROAD, Turkiye Citation Index, ICI World of Journal's, Index Copernicus, Turk Medline, General Impact Factor, Scilit 
 


Journal articles are evaluated as "Double-Blind Peer Review"

 

All articles published in this journal are licensed under a Creative Commons Attribution 4.0 International License (CC BY NC ND)