TR
EN
Artificial Intelligence in Pediatric Urology: Accuracy and Consistency of ChatGPT's Responses on Hypospadias
Abstract
Aim: This study aimed to evaluate the accuracy and reproducibility of ChatGPT (GPT-4-turbo) responses to frequently asked questions regarding hypospadias, a common congenital urological condition. As artificial intelligence (AI) becomes increasingly integrated into patient education, its reliability in delivering sensitive and clinically relevant information warrants empirical investigation.
Materials and Methods: Frequently asked questions about hypospadias were compiled from pediatric urology association websites, public health portals, and social media platforms. Questions were classified into five categories: general information, diagnosis, treatment, follow-up, and guideline-based recommendations. After excluding duplicate, vague, or subjective questions, 97 unique items were entered into ChatGPT. Two independent pediatric urologists rated the answers on a four-point scale (1 = completely correct, 4 = completely incorrect), and responses were repeated on separate devices to assess reproducibility.
Results: Of the 97 responses, 87.6% were graded as completely correct, 7.2% as correct but insufficient, 4.1% as partially misleading, and 1.0% as completely incorrect. The highest rate of accurate answers was observed in the diagnosis and follow-up categories (90.0%), while treatment-related questions showed slightly lower accuracy (86.7%). Guideline-based questions were answered correctly in 87.5% of cases. Overall reproducibility across all categories was 91.7%, with the highest consistency in diagnostic responses.
Conclusions: ChatGPT demonstrated high accuracy and reproducibility in answering patient-centered questions related to hypospadias, particularly in diagnosis and general information domains. However, variability in treatment-related content and limitations in referencing highlight the importance of cautious interpretation. While AI may serve as a supplementary educational tool in pediatric urology, clinical oversight remains essential to ensure safe and reliable information dissemination.
Keywords
Ethical Statement
Since this study involved the analysis of responses generated by an artificial intelligence model (ChatGPT) to publicly available and anonymized questions, and did not include any human participants, patient data, or identifiable personal information, ethical approval was not required in accordance with institutional and international research ethics guidelines.
References
- 1- Gabrielson AT, Galansky L, Shneyderman M, Cohen AJ. The Impact of Hypogonadism on Surgical Outcomes Following Primary Urethroplasty: Analysis of a Large Multi-institutional Database. Urology. 2024;185:116-23.
- 2- Wang F, Casalino LP, Khullar D. Deep Learning in Medicine-Promise, Progress, and Challenges. JAMA Intern Med. 2019;179(3):293-294.
- 3- Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25(1):44-56.
- 4- Baskin LS, Ebbers MB. Hypospadias: anatomy, etiology, and technique. J Pediatr Surg. 2006;41(3):463-472.
- 5- Spinoit AF, Poelaert F, Van Praet C, Groen LA, Van Laecke E, Hoebeke P. Grade of hypospadias is the only factor predicting for re-intervention after primary hypospadias repair: a multivariate analysis from a cohort of 474 patients. J Pediatr Urol. 2015;11(2):70.e1-70.e706.
- 6- Spinoit AF, Poelaert F, Van Praet C, Groen LA, Van Laecke E, Hoebeke P. Grade of hypospadias is the only factor predicting for re-intervention after primary hypospadias repair: a multivariate analysis from a cohort of 474 patients. J Pediatr Urol. 2015;11(2):70.e1-70.e706.
- 7- Betschart P, Pratsinis M, Müllhaupt G, et al. Information on surgical treatment of benign prostatic hyperplasia on YouTube is highly biased and misleading. BJU Int. 2020;125(4):595-601.
- 8- Alsyouf M, Stokes P, Hur D, Amasyali A, Ruckle H, Hu B. 'Fake News' in urology: evaluating the accuracy of articles shared on social media in genitourinary malignancies. BJU Int. 2019;124(4):701-706.
Details
Primary Language
English
Subjects
Pediatric Urology
Journal Section
Research Article
Publication Date
September 30, 2025
Submission Date
July 13, 2025
Acceptance Date
September 2, 2025
Published in Issue
Year 2025 Volume: 15 Number: 5
APA
Kandemir, E., & Sarıkaya, M. (2025). Artificial Intelligence in Pediatric Urology: Accuracy and Consistency of ChatGPT’s Responses on Hypospadias. Journal of Contemporary Medicine, 15(5), 216-220. https://doi.org/10.16899/jcm.1741131
AMA
1.Kandemir E, Sarıkaya M. Artificial Intelligence in Pediatric Urology: Accuracy and Consistency of ChatGPT’s Responses on Hypospadias. J Contemp Med. 2025;15(5):216-220. doi:10.16899/jcm.1741131
Chicago
Kandemir, Emre, and Mehmet Sarıkaya. 2025. “Artificial Intelligence in Pediatric Urology: Accuracy and Consistency of ChatGPT’s Responses on Hypospadias”. Journal of Contemporary Medicine 15 (5): 216-20. https://doi.org/10.16899/jcm.1741131.
EndNote
Kandemir E, Sarıkaya M (September 1, 2025) Artificial Intelligence in Pediatric Urology: Accuracy and Consistency of ChatGPT’s Responses on Hypospadias. Journal of Contemporary Medicine 15 5 216–220.
IEEE
[1]E. Kandemir and M. Sarıkaya, “Artificial Intelligence in Pediatric Urology: Accuracy and Consistency of ChatGPT’s Responses on Hypospadias”, J Contemp Med, vol. 15, no. 5, pp. 216–220, Sept. 2025, doi: 10.16899/jcm.1741131.
ISNAD
Kandemir, Emre - Sarıkaya, Mehmet. “Artificial Intelligence in Pediatric Urology: Accuracy and Consistency of ChatGPT’s Responses on Hypospadias”. Journal of Contemporary Medicine 15/5 (September 1, 2025): 216-220. https://doi.org/10.16899/jcm.1741131.
JAMA
1.Kandemir E, Sarıkaya M. Artificial Intelligence in Pediatric Urology: Accuracy and Consistency of ChatGPT’s Responses on Hypospadias. J Contemp Med. 2025;15:216–220.
MLA
Kandemir, Emre, and Mehmet Sarıkaya. “Artificial Intelligence in Pediatric Urology: Accuracy and Consistency of ChatGPT’s Responses on Hypospadias”. Journal of Contemporary Medicine, vol. 15, no. 5, Sept. 2025, pp. 216-20, doi:10.16899/jcm.1741131.
Vancouver
1.Emre Kandemir, Mehmet Sarıkaya. Artificial Intelligence in Pediatric Urology: Accuracy and Consistency of ChatGPT’s Responses on Hypospadias. J Contemp Med. 2025 Sep. 1;15(5):216-20. doi:10.16899/jcm.1741131