Araştırma Makalesi
BibTex RIS Kaynak Göster

Fourier-Based Image Classification Using CNN

Yıl 2024, Cilt: 5 Sayı: 1, 92 - 101, 21.06.2024
https://doi.org/10.53525/jster.1501920

Öz

Recently, Convolutional Neural Networks (CNNs) have achieved remarkable success in
computer vision, image processing and image processing tasks. Traditional CNN models
work directly with spatial domain images. On the other hand, images obtained with Fast
Fourier Transform (FFT) represent the Frequency domain and provide an advantage in
computational cost by reducing potential calculation complexity. This study uses FFT
converted images as input to the CNN algorithm to increase image classification and
recognition accuracy and investigates the effects of this. The study begins with a
comprehensive review of the foundations and features of FFT. It assumes that by
converting the input images from the Spatial domain to the Frequency domain, the input
image can be learned more efficiently and better results can be achieved in terms of
performance by studying the most important features in the Frequency domain. To evaluate
the effectiveness of this assumption, CIFAR-10, MNIST-Digits and MNIST-Fashion
datasets were used. As a result, it has been shown that FFT-based preprocessing can
improve classification accuracy, especially in cases where the datasets contain
high-frequency noise, and it has shown different results in different datasets. Therefore, it
is thought that the effect of FFT preprocessing varies depending on the datasets.

Kaynakça

  • [1] S. Russell and P. Norvig, "Artificial Intelligence: A Modern Approach," 3rd ed., Prentice Hall, Upper Saddle River, NJ, USA, 2020.
  • [2] I. Goodfellow, Y. Bengio, and A. Courville, "Deep Learning," MIT Press, Cambridge, MA, USA, 2016.
  • [3] A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet Classification with Deep Convolutional Neural Networks," in Advances in Neural Information Processing Systems 25, 2012, pp. 1097-1105.
  • [4] K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2015, pp. 770-778.
  • [5] C. Szegedy et al., "Going Deeper with Convolutions," in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2015, pp. 1-9.
  • [6] K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," in Proc. Int. Conf. Learning Representations, 2014.
  • [7] V. Mnih et al., "Playing Atari with Deep Reinforcement Learning," in Advances in Neural Information Processing Systems 27, 2013, pp. 1-9.
  • [8] J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding," in Proc. NAACL-HLT, 2018, pp. 4171-4186.
  • [9] I. Goodfellow et al., "Generative Adversarial Nets," in Advances in Neural Information Processing Systems 27, 2014, pp. 2672-2680.
  • [10] A. Vaswani et al., "Attention is All You Need," in Advances in Neural Information Processing Systems 30, 2017, pp. 5998-6008.
  • [11] R. Girshick, J. Donahue, T. Darrell, and J. Malik, "R-CNN: Regions with Convolutional Neural Network Features," in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2013, pp. 580-587.
  • [12] Akwasi Darkwah Akwaboah, "Implementation of Convolutional Neural Networks for CIFAR-10 Image Classification," 2019.
  • [13] Ajala Sunday Adeyinka, "Convolutional Neural Network Implementation for Classification using CIFAR-10," ResearchGate, 2023.
  • [14] Hengyue Pan, "Learning Convolutional Neural Networks in Frequency Domain," ResearchGate, 2023.
  • [15] S. Tötterström, "Frequency Domain Image Classification with Convolutional Neural Networks," Bachelor’s Thesis, Tampere University, 2023.
  • [16] Stuchi, J. A., Canto, N. G., de Faissol Attux, R. R., & Boccato, L. (2024). A frequency-domain approach with learnable filters for image classification. Applied Soft Computing, 155, 111443.

Fourier-Based Image Classification Using CNN

Yıl 2024, Cilt: 5 Sayı: 1, 92 - 101, 21.06.2024
https://doi.org/10.53525/jster.1501920

Öz

Recently, Convolutional Neural Networks (CNNs) have achieved remarkable success in
computer vision, image processing and image processing tasks. Traditional CNN models
work directly with spatial domain images. On the other hand, images obtained with Fast
Fourier Transform (FFT) represent the Frequency domain and provide an advantage in
computational cost by reducing potential calculation complexity. This study uses FFT
converted images as input to the CNN algorithm to increase image classification and
recognition accuracy and investigates the effects of this. The study begins with a
comprehensive review of the foundations and features of FFT. It assumes that by
converting the input images from the Spatial domain to the Frequency domain, the input
image can be learned more efficiently and better results can be achieved in terms of
performance by studying the most important features in the Frequency domain. To evaluate
the effectiveness of this assumption, CIFAR-10, MNIST-Digits and MNIST-Fashion
datasets were used. As a result, it has been shown that FFT-based preprocessing can
improve classification accuracy, especially in cases where the datasets contain
high-frequency noise, and it has shown different results in different datasets. Therefore, it
is thought that the effect of FFT preprocessing varies depending on the datasets.

Kaynakça

  • [1] S. Russell and P. Norvig, "Artificial Intelligence: A Modern Approach," 3rd ed., Prentice Hall, Upper Saddle River, NJ, USA, 2020.
  • [2] I. Goodfellow, Y. Bengio, and A. Courville, "Deep Learning," MIT Press, Cambridge, MA, USA, 2016.
  • [3] A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet Classification with Deep Convolutional Neural Networks," in Advances in Neural Information Processing Systems 25, 2012, pp. 1097-1105.
  • [4] K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2015, pp. 770-778.
  • [5] C. Szegedy et al., "Going Deeper with Convolutions," in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2015, pp. 1-9.
  • [6] K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," in Proc. Int. Conf. Learning Representations, 2014.
  • [7] V. Mnih et al., "Playing Atari with Deep Reinforcement Learning," in Advances in Neural Information Processing Systems 27, 2013, pp. 1-9.
  • [8] J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding," in Proc. NAACL-HLT, 2018, pp. 4171-4186.
  • [9] I. Goodfellow et al., "Generative Adversarial Nets," in Advances in Neural Information Processing Systems 27, 2014, pp. 2672-2680.
  • [10] A. Vaswani et al., "Attention is All You Need," in Advances in Neural Information Processing Systems 30, 2017, pp. 5998-6008.
  • [11] R. Girshick, J. Donahue, T. Darrell, and J. Malik, "R-CNN: Regions with Convolutional Neural Network Features," in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2013, pp. 580-587.
  • [12] Akwasi Darkwah Akwaboah, "Implementation of Convolutional Neural Networks for CIFAR-10 Image Classification," 2019.
  • [13] Ajala Sunday Adeyinka, "Convolutional Neural Network Implementation for Classification using CIFAR-10," ResearchGate, 2023.
  • [14] Hengyue Pan, "Learning Convolutional Neural Networks in Frequency Domain," ResearchGate, 2023.
  • [15] S. Tötterström, "Frequency Domain Image Classification with Convolutional Neural Networks," Bachelor’s Thesis, Tampere University, 2023.
  • [16] Stuchi, J. A., Canto, N. G., de Faissol Attux, R. R., & Boccato, L. (2024). A frequency-domain approach with learnable filters for image classification. Applied Soft Computing, 155, 111443.
Toplam 16 adet kaynakça vardır.

Ayrıntılar

Birincil Dil İngilizce
Konular Bilgisayar Yazılımı, Yazılım Mühendisliği (Diğer)
Bölüm Araştırma Makaleleri
Yazarlar

Göktuğ Erdem Dağı 0000-0001-5723-4578

Erhan Gökçay 0000-0002-4220-199X

Hakan Tora 0000-0002-0427-483X

Erken Görünüm Tarihi 21 Haziran 2024
Yayımlanma Tarihi 21 Haziran 2024
Gönderilme Tarihi 15 Haziran 2024
Kabul Tarihi 20 Haziran 2024
Yayımlandığı Sayı Yıl 2024 Cilt: 5 Sayı: 1

Kaynak Göster

APA Dağı, G. E., Gökçay, E., & Tora, H. (2024). Fourier-Based Image Classification Using CNN. Journal of Science, Technology and Engineering Research, 5(1), 92-101. https://doi.org/10.53525/jster.1501920
AMA Dağı GE, Gökçay E, Tora H. Fourier-Based Image Classification Using CNN. JSTER. Haziran 2024;5(1):92-101. doi:10.53525/jster.1501920
Chicago Dağı, Göktuğ Erdem, Erhan Gökçay, ve Hakan Tora. “Fourier-Based Image Classification Using CNN”. Journal of Science, Technology and Engineering Research 5, sy. 1 (Haziran 2024): 92-101. https://doi.org/10.53525/jster.1501920.
EndNote Dağı GE, Gökçay E, Tora H (01 Haziran 2024) Fourier-Based Image Classification Using CNN. Journal of Science, Technology and Engineering Research 5 1 92–101.
IEEE G. E. Dağı, E. Gökçay, ve H. Tora, “Fourier-Based Image Classification Using CNN”, JSTER, c. 5, sy. 1, ss. 92–101, 2024, doi: 10.53525/jster.1501920.
ISNAD Dağı, Göktuğ Erdem vd. “Fourier-Based Image Classification Using CNN”. Journal of Science, Technology and Engineering Research 5/1 (Haziran 2024), 92-101. https://doi.org/10.53525/jster.1501920.
JAMA Dağı GE, Gökçay E, Tora H. Fourier-Based Image Classification Using CNN. JSTER. 2024;5:92–101.
MLA Dağı, Göktuğ Erdem vd. “Fourier-Based Image Classification Using CNN”. Journal of Science, Technology and Engineering Research, c. 5, sy. 1, 2024, ss. 92-101, doi:10.53525/jster.1501920.
Vancouver Dağı GE, Gökçay E, Tora H. Fourier-Based Image Classification Using CNN. JSTER. 2024;5(1):92-101.
Dergide yayınlanan çalışmalar
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0) Uluslararası Lisansı ile lisanslanmıştır.
by-nc-nd.png

Free counters!