Research Article

CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS

Volume: 3 Number: 1 June 27, 2025
EN

CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS

Abstract

In this study, we investigated the performance of art painting classification based on the categories of ``Genre", ``Artist", and ``Style" using deep learning methods. We employed convolutional neural network (CNN)-based models such as ResNet, MobileNet, EfficientNet, and ConvNeXt for their effectiveness in feature extraction. Additionally, we used vision transformer models, including ViT, Swin, BEiT, and DeiT, which used attention mechanisms. We conducted our experiments on the publicly available WikiArt dataset, and the BEiT model achieved the highest classification accuracy in the Artist and Genre categories, with results of 84.90% and 79.52%, respectively. In Style category, the Swin model produced the best result with an accuracy of 72.59%. In general, our findings indicate that transformer-based methods outperformed CNN-based methods. Furthermore, we compared our results with similar studies in the literature and showed that transformer-based models generally perform better in classifying art paintings.

Keywords

References

  1. M. A. O¨ zdal, Resim ic¸erig˘i sınıflandırmasında yapay zekanın rolu¨, D-Sanat 1 (9) (2025) 56–71.
  2. B. Saleh, A. Elgammal, Large-scale classification of fine-art paintings: Learning the right metric on the right feature, arXiv preprint arXiv:1505.00855 (2015).
  3. A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, I. Polosukhin, Attention is all you need, Advances in neural information processing systems 30 (2017).
  4. A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, et al., An image is worth 16x16 words: Transformers for image recognition at scale, arXiv preprint arXiv:2010.11929 (2020).
  5. Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition, Proceedings of the IEEE 86 (11) (1998) 2278–2324.
  6. H. Bao, L. Dong, S. Piao, F.Wei, Beit: Bert pre-training of image transformers, arXiv preprint arXiv:2106.08254 (2021).
  7. Z. Liu, Y. Lin, Y. Cao, H. Hu, Y.Wei, Z. Zhang, S. Lin, B. Guo, Swin transformer: Hierarchical vision transformer using shifted windows, in: Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 10012–10022.
  8. H. Touvron, M. Cord, M. Douze, F. Massa, A. Sablayrolles, H. J´egou, Training data-efficient image transformers & distillation through attention, in: International conference on machine learning, PMLR, 2021, pp. 10347–10357.

Details

Primary Language

English

Subjects

Computer Vision

Journal Section

Research Article

Early Pub Date

June 9, 2025

Publication Date

June 27, 2025

Submission Date

December 3, 2024

Acceptance Date

December 26, 2024

Published in Issue

Year 2025 Volume: 3 Number: 1

APA
İnal, N., & Çiftçi, S. (2025). CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS. Current Trends in Computing, 3(1), 1-16. https://doi.org/10.71074/CTC.1595868
AMA
1.İnal N, Çiftçi S. CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS. CTC. 2025;3(1):1-16. doi:10.71074/CTC.1595868
Chicago
İnal, Nergiz, and Serdar Çiftçi. 2025. “CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS”. Current Trends in Computing 3 (1): 1-16. https://doi.org/10.71074/CTC.1595868.
EndNote
İnal N, Çiftçi S (June 1, 2025) CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS. Current Trends in Computing 3 1 1–16.
IEEE
[1]N. İnal and S. Çiftçi, “CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS”, CTC, vol. 3, no. 1, pp. 1–16, June 2025, doi: 10.71074/CTC.1595868.
ISNAD
İnal, Nergiz - Çiftçi, Serdar. “CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS”. Current Trends in Computing 3/1 (June 1, 2025): 1-16. https://doi.org/10.71074/CTC.1595868.
JAMA
1.İnal N, Çiftçi S. CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS. CTC. 2025;3:1–16.
MLA
İnal, Nergiz, and Serdar Çiftçi. “CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS”. Current Trends in Computing, vol. 3, no. 1, June 2025, pp. 1-16, doi:10.71074/CTC.1595868.
Vancouver
1.Nergiz İnal, Serdar Çiftçi. CLASSIFICATION OF ART PAINTINGS USING VISION TRANSFORMERS. CTC. 2025 Jun. 1;3(1):1-16. doi:10.71074/CTC.1595868