Research Article

The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis

Volume: 13 Number: 2 December 29, 2025
Halil Kaynarpınar , Abdulkadir Şeker *
EN

The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis

Abstract

The effectiveness of generative artificial intelligence models in software development is determined not only by their ability to generate correct solutions but also by their adherence to quality metrics and their resilience to exceptional scenarios. In this context, a comparative evaluation was conducted on four models using 10 fundamental algorithm problems and 10 object-oriented programming problems in the C# programming language. The generated solutions were assessed in terms of time complexity, memory usage, lines of code, number of variables and methods, and execution time. In addition, meaningful edge-case scenarios were employed to measure error tolerance and exception handling performance. The findings indicate that all models produced functionally valid solutions, yet exhibited limitations in advanced software engineering practices such as modularity, comprehensive error management, performance measurement, and unit testing. The analysis revealed that ChatGPT and Gemini stood out in terms of structure and consistency, Claude demonstrated greater reliability in handling errors, while Copilot offered advantages in code simplicity. Overall, the results highlight the importance of evaluating generative AI models not only under ideal conditions but also in atypical scenarios to ensure software quality and reliability.

Keywords

GenAI, LLM, Programing, AI-assisted coding

References

  1. [1] A. Bozkurt, (2023) “ChatGPT, Üretken Yapay Zeka ve Algoritmik Paradigma Değişikliği”, Alanyazın, c. 4, sayı 1, ss. 63–72, doi: 10.59320/alanyazin.1283282.
  2. [2] S. Bulut, (2024) “Üretken Yapay Zeka Teknolojisi : GPT -4o”, Uluslararası İleri Doğa Bilimleri ve Mühendislik Araştırmaları Dergisi, sayı 8, ss. 380-387, 4.
  3. [3] A. Kahveci̇ Yeti̇ş ve R. Daş, (2022) “Yazılım Ürün Ölçütlerinin Uygulamalı İncelenmesi”, Fırat Üniversitesi Mühendislik Bilim. Derg., c. 34, sayı 2, ss. 635–645, Eyl. doi: 10.35234/FUMBD.1114056.
  4. [4] T. Demirhan, (2024) “Yazılım Geliştirme Öğreniminde Beceri Derinliği ve Dil Yeterliliğinin Yapay Zekâ ile Entegrasyonu”, c. 7, sayı 4, ss. 382–399.
  5. [5] M. Hanefi Calp ve N. Arici, (2011) “Nesne yönelimli tasarım metrikleri ve kalite özellikleriyle ilişkisi”, Politek. Derg. J. Polytech. Cilt Digit. Object Identifier, c. 14141, sayı 10, ss. 9–14.
  6. [6] U. Erdemir, U. Tekin, F. Bulut, “Nesneye Dayalı Yazılım Metrikleri ve Yazılım Kalitesi Object Oriented Software Metrics and Software Quality”, web.itu.edu.tr/buzluca/ykgs08_2.pdf.
  7. [7] A. Abdou ve N. Darwish, (2024) “Severity classification of software code smells using machine learning techniques: A comparative study”, J. Softw. Evol. Process, c. 36, sayı 1, doi: 10.1002/smr.2454.
  8. [8] A. Kıral ve T. E. Ayyıldız, (2018) “Yazılım Kalite Metriklerinin Kıyaslanması: Örnek Bir Olay İncelemesi”, 12. Ulusal Yazılım Mühendisliği Sempozyumu (UYMS' 2018) , İstanbul, Türkiye.
  9. [9] M. Monteiro ve e. a. (2023), “End-to-End Software Construction using ChatGPT : An Experience Report End-to-End Software Construction using ChatGPT : An Experience Report”, sayı October, doi: 10.13140/RG.2.2.18968.98566.
  10. [10] W. Y. Chen, (2024) “Intelligent Tutor: Leveraging ChatGPT and Microsoft Copilot Studio to Deliver a Generative AI Student Support and Feedback System within Teams”, arXiv:2405.13024.
APA
Kaynarpınar, H., & Şeker, A. (2025). The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis. MANAS Journal of Engineering, 13(2), 125-142. https://doi.org/10.51354/mjen.1784716
AMA
1.Kaynarpınar H, Şeker A. The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis. MJEN. 2025;13(2):125-142. doi:10.51354/mjen.1784716
Chicago
Kaynarpınar, Halil, and Abdulkadir Şeker. 2025. “The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis”. MANAS Journal of Engineering 13 (2): 125-42. https://doi.org/10.51354/mjen.1784716.
EndNote
Kaynarpınar H, Şeker A (December 1, 2025) The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis. MANAS Journal of Engineering 13 2 125–142.
IEEE
[1]H. Kaynarpınar and A. Şeker, “The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis”, MJEN, vol. 13, no. 2, pp. 125–142, Dec. 2025, doi: 10.51354/mjen.1784716.
ISNAD
Kaynarpınar, Halil - Şeker, Abdulkadir. “The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis”. MANAS Journal of Engineering 13/2 (December 1, 2025): 125-142. https://doi.org/10.51354/mjen.1784716.
JAMA
1.Kaynarpınar H, Şeker A. The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis. MJEN. 2025;13:125–142.
MLA
Kaynarpınar, Halil, and Abdulkadir Şeker. “The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis”. MANAS Journal of Engineering, vol. 13, no. 2, Dec. 2025, pp. 125-42, doi:10.51354/mjen.1784716.
Vancouver
1.Halil Kaynarpınar, Abdulkadir Şeker. The Performance and Reliability of Generative AI Models in Software Development: A C# Based Analysis. MJEN. 2025 Dec. 1;13(2):125-42. doi:10.51354/mjen.1784716