Araştırma Makalesi

Efficient Hardware Optimization for CNN

Cilt: 6 Sayı: 1 20 Temmuz 2022
PDF İndir
EN

Efficient Hardware Optimization for CNN

Öz

Convolutional Neural Networks (CNN) architectures have been increasingly well-known for image processing applications such as object detection, and remote sensing. Some applications like these systems need to adopt CNN methods for real-time implementation. Embedded devices like Field Programmable Gate Arrays (FPGA) technologies are a favorable alternative to implementing CNN-based algorithms. However, FPGA has some drawbacks such as limited resources and bottlenecks, it is difficult and so crucial to map the whole CNN that has a high number of layers, on FPGA without any optimization. Therefore, hardware optimization techniques are compulsory. In this study, an FPGA-based CNN architecture using high-level synthesis (HLS) is demonstrated, and a synthesis report is created for Xilinx Zynq-7000 xc7z020-clg484-1 target FPGAs. By implementing the CNN architecture on an FPGA platform, the implemented architecture has been fastened. To improve the throughput, the proposed design is optimized for convolutional layers. The most important contribution of this study is to perform optimization on the convolution layer by unrolling kernels and input feature maps and examine the effects on throughput, latency, and hardware resources. In this study, throughput is 15.6 GOP/s for the first convolution layer. With the proposed method in the study, approximately x2.6 acceleration in terms of latency and throughput was achieved compared to the baseline design.

Anahtar Kelimeler

Destekleyen Kurum

Tubitak

Proje Numarası

121E393

Teşekkür

This research was supported by a grant from (121E393) TUBITAK (Türkiye Bilimsel ve Teknolojik Araştirma Kurumu). We thank the TUBITAK for their support of our research.

Kaynakça

  1. [1] Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324, 1998.
  2. [2] K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large-Scale Image Recognition,” arXiv.org, 2014, doi: 10.48550/arXiv.1409.1556.
  3. [3] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classification with deep convolutional neural networks,” Communications of the ACM, vol. 60, no. 6, pp. 84–90, May 2017, doi: 10.1145/3065386.
  4. [4] M. Mikaeili and H. S. Bilge, “Estimating Rotation Angle and Transformation Matrix Between Consecutive Ultrasound Images Using Deep Learning,” 2020 Medical Technologies Congress (TIPTEKNO), Nov. 2020, doi: 10.1109/tiptekno50054.2020.9299237.
  5. [5] C. Huang, S. Ni and G. Chen, "A layer-based structured design of CNN on FPGA," 2017 IEEE 12th International Conference on ASIC (ASICON), 2017, pp. 1037-1040, doi: 10.1109/ASICON.2017.8252656.
  6. [6] W. A. Haque, S. Arefin, A. S. M. Shihavuddin, and M. A. Hasan, “DeepThin: A novel lightweight CNN architecture for traffic sign recognition without GPU requirements,” Expert Systems with Applications, vol. 168, p. 114481, Apr. 2021, doi: 10.1016/j.eswa.2020.114481.
  7. [7] Y. Hu, Y. Liu, and Z. Liu, “A Survey on Convolutional Neural Network Accelerators: GPU, FPGA and ASIC,” 2022 14th International Conference on Computer Research and Development (ICCRD), Jan. 2022, doi: 10.1109/iccrd54409.2022.9730377.
  8. [8] N. Zhang, X. Wei, H. Chen, and W. Liu, “FPGA Implementation for CNN-Based Optical Remote Sensing Object Detection,” Electronics, vol. 10, no. 3, p. 282, Jan. 2021, doi: 10.3390/electronics10030282.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

20 Temmuz 2022

Gönderilme Tarihi

1 Haziran 2022

Kabul Tarihi

20 Haziran 2022

Yayımlandığı Sayı

Yıl 2022 Cilt: 6 Sayı: 1

Kaynak Göster

APA
Güzel Aydın, S., & Bilge, H. Ş. (2022). Efficient Hardware Optimization for CNN. International Journal of Multidisciplinary Studies and Innovative Technologies, 6(1), 38-44. https://izlik.org/JA85DY35WP
AMA
1.Güzel Aydın S, Bilge HŞ. Efficient Hardware Optimization for CNN. IJMSIT. 2022;6(1):38-44. https://izlik.org/JA85DY35WP
Chicago
Güzel Aydın, Seda, ve Hasan Şakir Bilge. 2022. “Efficient Hardware Optimization for CNN”. International Journal of Multidisciplinary Studies and Innovative Technologies 6 (1): 38-44. https://izlik.org/JA85DY35WP.
EndNote
Güzel Aydın S, Bilge HŞ (01 Temmuz 2022) Efficient Hardware Optimization for CNN. International Journal of Multidisciplinary Studies and Innovative Technologies 6 1 38–44.
IEEE
[1]S. Güzel Aydın ve H. Ş. Bilge, “Efficient Hardware Optimization for CNN”, IJMSIT, c. 6, sy 1, ss. 38–44, Tem. 2022, [çevrimiçi]. Erişim adresi: https://izlik.org/JA85DY35WP
ISNAD
Güzel Aydın, Seda - Bilge, Hasan Şakir. “Efficient Hardware Optimization for CNN”. International Journal of Multidisciplinary Studies and Innovative Technologies 6/1 (01 Temmuz 2022): 38-44. https://izlik.org/JA85DY35WP.
JAMA
1.Güzel Aydın S, Bilge HŞ. Efficient Hardware Optimization for CNN. IJMSIT. 2022;6:38–44.
MLA
Güzel Aydın, Seda, ve Hasan Şakir Bilge. “Efficient Hardware Optimization for CNN”. International Journal of Multidisciplinary Studies and Innovative Technologies, c. 6, sy 1, Temmuz 2022, ss. 38-44, https://izlik.org/JA85DY35WP.
Vancouver
1.Seda Güzel Aydın, Hasan Şakir Bilge. Efficient Hardware Optimization for CNN. IJMSIT [Internet]. 01 Temmuz 2022;6(1):38-44. Erişim adresi: https://izlik.org/JA85DY35WP