Research Article

Evaluation of the performance of artificial intelligence platforms in answering and generating new questions in prosthetic dentistry specialization

Volume: 8 Number: 3 May 22, 2026

Evaluation of the performance of artificial intelligence platforms in answering and generating new questions in prosthetic dentistry specialization

Abstract

Aims: This study aimed to evaluate large language models (LLMs) not only in answering Dentistry Specialty Examination (DUS) questions but also in generating new DUS-format questions, with expert validation of educational and clinical quality. Methods: A total of 130 official DUS questions published between 2012 and 2021 were used to assess answering performance of four LLMs (ChatGPT, Gemini, DeepSeek, and Grok). Additionally, each model generated 20 new multiple-choice questions (n=80), which were independently evaluated by expert prosthodontists for content accuracy, clinical relevance, discriminative capacity, and conformity with DUS standards. Expert-approved questions were subsequently re-answered by all models to enable cross-model performance analysis. Model performances were compared using descriptive statistics, one-sample proportion tests against chance level (p₀=0.20), and inter-model comparisons using Cochran’s Q and McNemar tests. Results: ChatGPT achieved the highest overall accuracy on historical DUS questions (81.3%), followed by Gemini and DeepSeek (72.8% and 70.3%) and Grok (68.8%). In expert-validated AI-generated questions, overall accuracy rates ranged between 71.3% and 78.8% across models, with no statistically significant inter-model difference (Q=3.82, p=0.28). All models performed significantly above chance level (p<0.001). Importantly, question-generation quality and answering performance were not consistently aligned across models. Conclusion: Although LLMs demonstrate statistically significant performance in DUS-style questions, both answering accuracy and educational validity of AI-generated questions require expert supervision. LLMs should be considered supportive tools rather than autonomous agents in high-stakes dental education and assessment contexts.

Keywords

Supporting Institution

There is no Supporting Institution

Project Number

There is no project number

Ethical Statement

There is no Ethical Statement

Thanks

There is no Thanks

References

  1. Aggarwal A, Tam CC, Wu D, Li X, Qiao S. Artificial intelligence-based chatbots for promoting health behavioral changes: systematic review. J Med Internet Res. 2023;25:e40789. doi:10.2196/40789
  2. Fatani B. ChatGPT for future medical and dental research. Cureus. 2023;15(4):e37285. doi:10.7759/cureus.37285
  3. Alhaidry HM, Fatani B, Alrayes JO, Almana AM, Alfhaed NK. ChatGPT in dentistry: a comprehensive review. Cureus. 2023;15(4):e38317. doi:10. 7759/cureus.38317
  4. Ding H, Wu J, Zhao W, Matinlinna JP, Burrow MF, Tsoi JKH. Artificial intelligence in dentistry-a review. Front Dent Med. 2023;4:1085251. doi: 10.3389/fdmed.2023.1085251
  5. Aura-Tormos JI, Llacer-Martinez M, Torres-Osca I. Educational applications of ChatGPT in university-based dental education. A systematic review. Eur J Dent Educ. 2025. doi:10.1111/eje.70011
  6. Baluch W. ChatGPT in a controlled exam: exam-based evidence on student-AI collaboration, teacher-support assessment tools, and emerging cognitive profiles. SSRN Electron J. 2025. doi:10.2139/ssrn.5934534
  7. Krumsvik RJ. Artificial intelligence in nurse education: a new sparring partner? GPT-4 capabilities in formative and summative assessment in the National Examination in Anatomy, Physiology, and Biochemistry. Nord J Digit Lit. 2024(3):172-186. doi:10.18261/njdl.19.3.5
  8. Gan W, Ouyang J, Li H, et al. Integrating ChatGPT in orthopedic education for medical undergraduates: randomized controlled trial. J Med Internet Res. 2024;26:e57037. doi:10.2196/57037

Details

Primary Language

English

Subjects

Prosthodontics

Journal Section

Research Article

Publication Date

May 22, 2026

Submission Date

December 24, 2025

Acceptance Date

March 23, 2026

Published in Issue

Year 2026 Volume: 8 Number: 3

APA
Kuşçu, A. İ., Çınarer, G., & Kuşçu, S. (2026). Evaluation of the performance of artificial intelligence platforms in answering and generating new questions in prosthetic dentistry specialization. Anatolian Current Medical Journal, 8(3), 409-416. https://izlik.org/JA36ZU52UK
AMA
1.Kuşçu Aİ, Çınarer G, Kuşçu S. Evaluation of the performance of artificial intelligence platforms in answering and generating new questions in prosthetic dentistry specialization. Anatolian Curr Med J / ACMJ / acmj. 2026;8(3):409-416. https://izlik.org/JA36ZU52UK
Chicago
Kuşçu, Aliye İpek, Gökalp Çınarer, and Süha Kuşçu. 2026. “Evaluation of the Performance of Artificial Intelligence Platforms in Answering and Generating New Questions in Prosthetic Dentistry Specialization”. Anatolian Current Medical Journal 8 (3): 409-16. https://izlik.org/JA36ZU52UK.
EndNote
Kuşçu Aİ, Çınarer G, Kuşçu S (May 1, 2026) Evaluation of the performance of artificial intelligence platforms in answering and generating new questions in prosthetic dentistry specialization. Anatolian Current Medical Journal 8 3 409–416.
IEEE
[1]A. İ. Kuşçu, G. Çınarer, and S. Kuşçu, “Evaluation of the performance of artificial intelligence platforms in answering and generating new questions in prosthetic dentistry specialization”, Anatolian Curr Med J / ACMJ / acmj, vol. 8, no. 3, pp. 409–416, May 2026, [Online]. Available: https://izlik.org/JA36ZU52UK
ISNAD
Kuşçu, Aliye İpek - Çınarer, Gökalp - Kuşçu, Süha. “Evaluation of the Performance of Artificial Intelligence Platforms in Answering and Generating New Questions in Prosthetic Dentistry Specialization”. Anatolian Current Medical Journal 8/3 (May 1, 2026): 409-416. https://izlik.org/JA36ZU52UK.
JAMA
1.Kuşçu Aİ, Çınarer G, Kuşçu S. Evaluation of the performance of artificial intelligence platforms in answering and generating new questions in prosthetic dentistry specialization. Anatolian Curr Med J / ACMJ / acmj. 2026;8:409–416.
MLA
Kuşçu, Aliye İpek, et al. “Evaluation of the Performance of Artificial Intelligence Platforms in Answering and Generating New Questions in Prosthetic Dentistry Specialization”. Anatolian Current Medical Journal, vol. 8, no. 3, May 2026, pp. 409-16, https://izlik.org/JA36ZU52UK.
Vancouver
1.Aliye İpek Kuşçu, Gökalp Çınarer, Süha Kuşçu. Evaluation of the performance of artificial intelligence platforms in answering and generating new questions in prosthetic dentistry specialization. Anatolian Curr Med J / ACMJ / acmj [Internet]. 2026 May 1;8(3):409-16. Available from: https://izlik.org/JA36ZU52UK

 

TR DİZİN ULAKBİM and International Indexes (1b)
 

Interuniversity Board (UAK) Equivalency:  Article published in Ulakbim TR Index journal [10 POINTS], and Article published in other (excuding 1a, b, c) international indexed journal (1d) [5 POINTS]

Note: Our journal is not WOS indexed and therefore is not classified as Q.

You can download Council of Higher Education (CoHG) [Yüksek Öğretim Kurumu (YÖK)] Criteria) decisions about predatory/questionable journals and the author's clarification text and journal charge policy from your browser. https://dergipark.org.tr/tr/journal/3449/file/4924/show

 

Journal Indexes and Platforms: 

TR Dizin ULAKBİM, Google Scholar, Crossref, Worldcat (OCLC), DRJI, EuroPub, OpenAIRE, Turkiye Citation Index, Turk Medline, ROAD, ICI World of Journal's, Index Copernicus, ASOS Index, General Impact Factor, Scilit.


 

The indexes of the journal's are;


 

download?token=eyJhdXRoX3JvbGVzIjpbXSwiZW5kcG9pbnQiOiJqb3VybmFsIiwib3JpZ2luYWxuYW1lIjoiVHJfSW5kZXhfbG9nby5wbmciLCJwYXRoIjoiMDFiOS82MmZhLzA3MzMvNjlkZjNlNTdhMmI4ZjkuODYxMzMxMjQucG5nIiwiZXhwIjoxNzc2MjQxNzY3LCJub25jZSI6ImQyMTQ4MjdiNTg1ZjVmMGQwYzAzZTMxNzMwM2QwMThmIn0.RmnGvwR536HdIoKpGO-ApytZ5aRPRT_BFXE2EpGSIqc

asos-index.png
 
f9ab67f.png
 
WorldCat_Logo_H_Color.png
 

 

18596download?token=eyJhdXRoX3JvbGVzIjpbXSwiZW5kcG9pbnQiOiJqb3VybmFsIiwib3JpZ2luYWxuYW1lIjoiT3BlbkFpcmUuanBnIiwicGF0aCI6IjUyMWYvZjljYy8wMDk3LzY5ZGYzZDNiYmVkZGU0LjQzNDM2OTU3LmpwZyIsImV4cCI6MTc3NjI0MTQ4NCwibm9uY2UiOiIwYjgxZDE2NzRiNzhjMWQyOGVmMDM1OTA1MzI5NjdjZiJ9.xeFppR1ubA4i-dHG-u07ht9bQNogFheXQjLyEaP9GgAimages?q=tbn:ANd9GcQgDnBwx0yUPRKuetgIurtELxYERFv20CPAUcPe4jYrrJiwXzac8rGXlzd57gl8iikb1Tk&usqp=CAU

 

84039476_619085835534619_7808805634291269632_n.jpg

 

 

 

The platforms of the journal's are;
 

COPE.jpg
 
images?q=tbn:ANd9GcTbq2FM8NTdXECzlOUCeKQ1dvrISFL-LhxhC7zy1ZQeJk-GGKSx2XkWQvrsHxcfhtfHWxM&usqp=CAUicmje_1_orig.png
 
 
ncbi.png
 
ORCID_logo.pngimages?q=tbn:ANd9GcQlwX77nfpy3Bu9mpMBZa0miWT2sRt2zjAPJKg2V69ODTrjZM1nT1BbhWzTVPsTNKJMZzQ&usqp=CAU
 

 

images?q=tbn:ANd9GcTaWSousoprPWGwE-qxwxGH2y0ByZ_zdLMN-Oq93MsZpBVFOTfxi9uXV7tdr39qvyE-U0I&usqp=CAU
 


 


 

 


 


The indexes/platforms of the journal are;
 

TR Dizin Ulakbim, Crossref (DOI), Google Scholar, EuroPub, Directory of Research Journal İndexing (DRJI), Worldcat (OCLC), OpenAIRE, ASOS Index, ROAD, Turkiye Citation Index, ICI World of Journal's, Index Copernicus, Turk Medline, General Impact Factor, Scilit 
 


Journal articles are evaluated as "Double-Blind Peer Review"

 

All articles published in this journal are licensed under a Creative Commons Attribution 4.0 International License (CC BY NC ND)