Gastrointestinal (GI) diseases are a major issue in the human digestive system. Therefore, many studies have explored the automatic classification of GI diseases to reduce the burden on clinicians and improve patient outcomes for both diagnosis and treatment purposes. Convolutional neural networks (CNNs) and Vision Transformers (ViTs) in deep learning approaches have become a popular research area for the automatic detection of diseases from medical images. This study evaluated the classification performance of thirteen different CNN models and two different ViT architectures on endoscopic images. The impact of transfer learning parameters on classification performance was also observed. The tests revealed that the classification accuracies of the ViT models were 91.25% and 90.50%, respectively. In contrast, the DenseNet201 architecture, with optimized transfer learning parameters, achieved an accuracy of 93.13%, recall of 93.17%, precision of 93.13%, and an F1 score of 93.11%, making it the most successful model among all the others. Considering the results, it is evident that a well-optimized CNN model achieved better classification performance than the ViT models.
Medical Image Classification Convolutional Neural Networks Vision Transformers Fine Tuning Transfer Learning Gastrointestinal Diseases
Birincil Dil | İngilizce |
---|---|
Konular | Bilgisayar Yazılımı, Yazılım Mühendisliği (Diğer) |
Bölüm | Bilgisayar Mühendisliği / Computer Engineering |
Yazarlar | |
Erken Görünüm Tarihi | 27 Ağustos 2024 |
Yayımlanma Tarihi | 1 Eylül 2024 |
Gönderilme Tarihi | 15 Haziran 2024 |
Kabul Tarihi | 21 Temmuz 2024 |
Yayımlandığı Sayı | Yıl 2024 |