Research Article

Diagnosis of the Skin Cancer by Vision Transformers

Volume: 13 Number: 1 January 30, 2025
TR EN

Diagnosis of the Skin Cancer by Vision Transformers

Abstract

Skin cancer, one of the most frequent cancers, requires early identification for efficient treatment and better survival. Early diagnosis relies on precise and quick skin lesion categorization into benign and malignant categories. This work uses Vision Transformers (ViTs) to classify skin cancer photos by modeling long-range relationships and capturing complicated visual patterns. ViTs, originally created for natural language processing, have showed great potential in picture classification tasks because to their self-attention processes, outperforming CNNs. A public collection of 270 skin lesion images—240 malignant and 30 benign—was used in this study. Preprocessing included scaling and normalizing the dataset to 384x384x3 and splitting it into 80% training and 20% testing sets. Transfer learning optimised a pre-trained ViT model for this job. To improve accuracy and avoid overfitting, hyperparameters were carefully selected for network training. Parallel computing accelerated training to 30 minutes and 20 seconds. Vision Transformers classify medical images well, according to the study. The ViT model outperformed numerous other methods with 98.15% accuracy on the test set. A confusion matrix analysis showed great sensitivity in detecting malignant lesions and low misclassification of benign patients. These findings show that ViTs can capture detailed medical picture aspects, making them useful for dermatological diagnoses. This study shows that Vision Transformers can improve diagnosis accuracy and lays the groundwork for their use in other medical imaging fields. In the battle against skin cancer and other illnesses that need early diagnosis, ViTs' scalability, efficiency, and precision are important. Future research might integrate ViTs with other deep learning architectures to improve robustness and flexibility. This study adds to the data supporting sophisticated AI in medical diagnostics and lays the groundwork for automated, reliable, and efficient healthcare solutions.

Keywords

Ethical Statement

Ethical approval: The authors declare that they comply with ethical standards. Conflict of Interest: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this manuscript. Data availability: Since no datasets were collected or analyzed during this study, data sharing does not apply to this publication. There are no data associated with this manuscript. Any inquiries regarding data availability should be directed to the authors.

References

  1. [1] B. Banushi, S. R. Joseph, B. Lum, J. J. Lee, and F. Simpson. “Endocytosis in cancer and cancer therapy,” Nature Reviews Cancer, 23(7), 450-473, 2023.
  2. [2] M. Naqvi, S. Q. Gilani, T. Syed, O. Marques, and H. C. Kim. “Skin cancer detection using deep learning—a review,” Diagnostics, 13(11), 1911, 2023.
  3. [3] K. Han, Y. Wang, H. Chen, X. Chen, J. Guo, Z. Liu, and D. Tao. “A survey on vision transformer,” IEEE transactions on pattern analysis and machine intelligence, 45(1), 87-110, 2022.
  4. [4] A. B. Subba, and A. K. Sunaniya. “Computationally optimized brain tumor classification using attention based GoogLeNet-style CNN,” Expert Systems with Applications, 125443, 2024.
  5. [5] K. Elbedoui, H. Mzoughi, and M. B. Slima. “Deep Learning Approaches for Dermoscopic Image-Based Skin Cancer Diagnosis. In 2024 IEEE 7th International Conference on Advanced Technologies, Signal and Image Processing (ATSIP), 2024, Vol. 1, pp. 1-7, IEEE.
  6. [6] S. Mejri, and A. E. Oueslati. Dermoscopic Images Classification Using Pretrained VGG-16 and ResNet-50 Models”, in 2024 IEEE 7th International Conference on Advanced Technologies, Signal and Image Processing (ATSIP), 2024, Vol. 1, pp. 342-347, IEEE.
  7. [7] M. Hameed, A. Zameer, and M. A. Z. Raja. “A Comprehensive Systematic Review: Advancements in Skin Cancer Classification and Segmentation Using the ISIC Dataset,” CMES-Computer Modeling in Engineering & Sciences, 140(3), 2024.
  8. [8] M. R. Kumar, S. Priyanga, J. S. Anusha, V. Chatiyode, J. Santiago, and P. Revath. “Synergistic Skin Cancer Classification: Vision Transformer alongside MobileNetV2,” in 2023 4th International Conference on Intelligent Technologies (CONIT), 2024, pp. 1-7, IEEE.

Details

Primary Language

English

Subjects

Deep Learning, Machine Vision

Journal Section

Research Article

Publication Date

January 30, 2025

Submission Date

October 23, 2024

Acceptance Date

December 12, 2024

Published in Issue

Year 2025 Volume: 13 Number: 1

APA
Demiroğlu, U. (2025). Diagnosis of the Skin Cancer by Vision Transformers. Duzce University Journal of Science and Technology, 13(1), 588-598. https://doi.org/10.29130/dubited.1572317
AMA
1.Demiroğlu U. Diagnosis of the Skin Cancer by Vision Transformers. DUBİTED. 2025;13(1):588-598. doi:10.29130/dubited.1572317
Chicago
Demiroğlu, Uğur. 2025. “Diagnosis of the Skin Cancer by Vision Transformers”. Duzce University Journal of Science and Technology 13 (1): 588-98. https://doi.org/10.29130/dubited.1572317.
EndNote
Demiroğlu U (January 1, 2025) Diagnosis of the Skin Cancer by Vision Transformers. Duzce University Journal of Science and Technology 13 1 588–598.
IEEE
[1]U. Demiroğlu, “Diagnosis of the Skin Cancer by Vision Transformers”, DUBİTED, vol. 13, no. 1, pp. 588–598, Jan. 2025, doi: 10.29130/dubited.1572317.
ISNAD
Demiroğlu, Uğur. “Diagnosis of the Skin Cancer by Vision Transformers”. Duzce University Journal of Science and Technology 13/1 (January 1, 2025): 588-598. https://doi.org/10.29130/dubited.1572317.
JAMA
1.Demiroğlu U. Diagnosis of the Skin Cancer by Vision Transformers. DUBİTED. 2025;13:588–598.
MLA
Demiroğlu, Uğur. “Diagnosis of the Skin Cancer by Vision Transformers”. Duzce University Journal of Science and Technology, vol. 13, no. 1, Jan. 2025, pp. 588-9, doi:10.29130/dubited.1572317.
Vancouver
1.Uğur Demiroğlu. Diagnosis of the Skin Cancer by Vision Transformers. DUBİTED. 2025 Jan. 1;13(1):588-9. doi:10.29130/dubited.1572317

Cited By