Araştırma Makalesi

Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study

Cilt: 24 Sayı: 74 22 Aralık 2025
PDF İndir
EN TR

Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study

Öz

Background: Multiple-choice questions (MCQs) are widely used in medical education due to their objectivity, efficiency, and ability to cover a broad knowledge base. Case-based MCQs provide additional benefits by evaluating students’ clinical reasoning and decision-making skills. In psychiatry education, unique challenges arise from overlapping symptoms, reliance on subjective reports, and the absence of objective diagnostic tools. The aim of this study was to administer MCQs generated through template-based automatic item generation (AIG) in psychiatry to medical students and to evaluate their psychometric properties (difficulty and discrimination indices). Methods: Following ethical approval from XXX University Ethics Committee, the study included 138 volunteer students (61.6%) from a total of 224 who completed psychiatry clerkship during the 2023–2024 and 2024–2025 academic years. From a pool of 1189 template-based automatically generated questions, 22 were randomly selected to form the exam. The test was administered face-to-face under supervision, and students were not informed of the origin of the items. Difficulty indices were calculated as the proportion of correct answers, while discrimination indices were computed by comparing the performance of the top 27% and bottom 27% groups. Results: The mean exam score was 15.21 ± 3.55 out of 22. The average difficulty index was 0.69, classifying the exam as “easy.” Of the items, 63.6% were very easy, 9.1% easy, and 27.3% moderate. The most difficult item concerned somatization (0.33), whereas the easiest was related to bipolar disorder (0.92). Discrimination indices ranged from 0.19 to 0.70, with an average of 0.37. Ten items (45.6%) demonstrated excellent discrimination, eleven (50%) acceptable, and one (4.5%) poor. The highest discrimination was observed in the schizophreniform disorder item (0.70), while the lowest was in the postpartum psychosis item (0.19). Conclusions: This study represents the first direct implementation of template-based AIG in Turkish psychiatry education. The findings demonstrated that automatically generated MCQs achieved acceptable psychometric standards in terms of both difficulty and discrimination. Template-based AIG may reduce faculty workload while ensuring consistent and high-quality question development. However, further refinement is needed to generate items assessing higher-order cognitive processes. Multicenter comparative studies could provide stronger evidence for the integration of AIG into medical education assessments.

Anahtar Kelimeler

Destekleyen Kurum

Yok.

Etik Beyan

Etik kurul onayı Gazi Üniversitesi Etik komisyonunda alınmıştır (Tarih: 13.02.2024/ Sayı: 05).

Teşekkür

Soruların oluşturulmasında kullanılan kodun yazılmasındaki desteklerinden ötürü Mehmet Ali Akyol’a teşekkür ederiz.

Kaynakça

  1. 1. Rohlfsen CJ, Sayles H, Moore GF, Mikuls TR, O'Dell JR, McBrien S, et al. Innovation in early medical education, no bells or whistles required. BMC Med Educ. 2020;20:39.
  2. 2. Gordon M, Farnan J, Grafton-Clarke C, Ahmed N, Pelly T, Roberts M, et al. Non-technical skills assessments in undergraduate medical education: A focused BEME systematic review: BEME Guide No. 54. Med Teach. 2019;41:732–45.
  3. 3. Daniel M, Rencic J, Durning SJ, Torre D, King A, Gordon M, et al. Clinical reasoning assessment methods: a scoping review and practical guidance. Acad Med. 2019;94:902–12.
  4. 4. Pugh D, De Champlain A, Touchie C. Plus ça change, plus c’est pareil: making a continued case for the use of MCQs in medical education. Med Teach. 2019;41:569–77.
  5. 5. Zaidi NLB, Grob KL, Monrad SM, Schroeder R, Santen SA, Hughes DT, et al. Pushing critical thinking skills with multiple-choice questions: does Bloom’s taxonomy work? Acad Med. 2018;93:856–9.
  6. 6. Corrao S, Argano C. Rethinking clinical decision-making to improve clinical reasoning. Front Med (Lausanne). 2022;9:900543.
  7. 7. Rejón AC. Logic structure of clinical judgment and its relation to medical and psychiatric semiology. Psychopathology. 2012;45:344–51.
  8. 8. Gierl MJ, Lai H, Tanygin V. Advanced Methods in Automatic Item Generation. 1st ed. New York: Routledge; 2021. p.42–66.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Tıp Eğitimi

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

22 Aralık 2025

Gönderilme Tarihi

9 Eylül 2025

Kabul Tarihi

26 Ekim 2025

Yayımlandığı Sayı

Yıl 2025 Cilt: 24 Sayı: 74

Kaynak Göster

APA
Emekli, E., Soylu, R., Emekli, E., Kıyak, Y. S., Hosgören Alıcı, Y., Coşkun, Ö., & Budakoğlu, I. İ. (2025). Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study. Tıp Eğitimi Dünyası, 24(74), 209-216. https://doi.org/10.25282/ted.1779377
AMA
1.Emekli E, Soylu R, Emekli E, vd. Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study. TED. 2025;24(74):209-216. doi:10.25282/ted.1779377
Chicago
Emekli, Esra, Rabia Soylu, Emre Emekli, vd. 2025. “Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study”. Tıp Eğitimi Dünyası 24 (74): 209-16. https://doi.org/10.25282/ted.1779377.
EndNote
Emekli E, Soylu R, Emekli E, Kıyak YS, Hosgören Alıcı Y, Coşkun Ö, Budakoğlu Iİ (01 Aralık 2025) Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study. Tıp Eğitimi Dünyası 24 74 209–216.
IEEE
[1]E. Emekli vd., “Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study”, TED, c. 24, sy 74, ss. 209–216, Ara. 2025, doi: 10.25282/ted.1779377.
ISNAD
Emekli, Esra - Soylu, Rabia - Emekli, Emre - Kıyak, Yavuz Selim - Hosgören Alıcı, Yasemin - Coşkun, Özlem - Budakoğlu, Işıl İrem. “Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study”. Tıp Eğitimi Dünyası 24/74 (01 Aralık 2025): 209-216. https://doi.org/10.25282/ted.1779377.
JAMA
1.Emekli E, Soylu R, Emekli E, Kıyak YS, Hosgören Alıcı Y, Coşkun Ö, Budakoğlu Iİ. Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study. TED. 2025;24:209–216.
MLA
Emekli, Esra, vd. “Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study”. Tıp Eğitimi Dünyası, c. 24, sy 74, Aralık 2025, ss. 209-16, doi:10.25282/ted.1779377.
Vancouver
1.Esra Emekli, Rabia Soylu, Emre Emekli, Yavuz Selim Kıyak, Yasemin Hosgören Alıcı, Özlem Coşkun, Işıl İrem Budakoğlu. Psychometric Evaluation of Automatically Generated Template-Based Psychiatry Questions for Medical Students: A Validity and Reliability Study. TED. 01 Aralık 2025;24(74):209-16. doi:10.25282/ted.1779377