Research Article

Efficient Hardware Optimization for CNN

Volume: 6 Number: 1 July 20, 2022
EN

Efficient Hardware Optimization for CNN

Abstract

Convolutional Neural Networks (CNN) architectures have been increasingly well-known for image processing applications such as object detection, and remote sensing. Some applications like these systems need to adopt CNN methods for real-time implementation. Embedded devices like Field Programmable Gate Arrays (FPGA) technologies are a favorable alternative to implementing CNN-based algorithms. However, FPGA has some drawbacks such as limited resources and bottlenecks, it is difficult and so crucial to map the whole CNN that has a high number of layers, on FPGA without any optimization. Therefore, hardware optimization techniques are compulsory. In this study, an FPGA-based CNN architecture using high-level synthesis (HLS) is demonstrated, and a synthesis report is created for Xilinx Zynq-7000 xc7z020-clg484-1 target FPGAs. By implementing the CNN architecture on an FPGA platform, the implemented architecture has been fastened. To improve the throughput, the proposed design is optimized for convolutional layers. The most important contribution of this study is to perform optimization on the convolution layer by unrolling kernels and input feature maps and examine the effects on throughput, latency, and hardware resources. In this study, throughput is 15.6 GOP/s for the first convolution layer. With the proposed method in the study, approximately x2.6 acceleration in terms of latency and throughput was achieved compared to the baseline design.

Keywords

Supporting Institution

Tubitak

Project Number

121E393

Thanks

This research was supported by a grant from (121E393) TUBITAK (Türkiye Bilimsel ve Teknolojik Araştirma Kurumu). We thank the TUBITAK for their support of our research.

References

  1. [1] Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324, 1998.
  2. [2] K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large-Scale Image Recognition,” arXiv.org, 2014, doi: 10.48550/arXiv.1409.1556.
  3. [3] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classification with deep convolutional neural networks,” Communications of the ACM, vol. 60, no. 6, pp. 84–90, May 2017, doi: 10.1145/3065386.
  4. [4] M. Mikaeili and H. S. Bilge, “Estimating Rotation Angle and Transformation Matrix Between Consecutive Ultrasound Images Using Deep Learning,” 2020 Medical Technologies Congress (TIPTEKNO), Nov. 2020, doi: 10.1109/tiptekno50054.2020.9299237.
  5. [5] C. Huang, S. Ni and G. Chen, "A layer-based structured design of CNN on FPGA," 2017 IEEE 12th International Conference on ASIC (ASICON), 2017, pp. 1037-1040, doi: 10.1109/ASICON.2017.8252656.
  6. [6] W. A. Haque, S. Arefin, A. S. M. Shihavuddin, and M. A. Hasan, “DeepThin: A novel lightweight CNN architecture for traffic sign recognition without GPU requirements,” Expert Systems with Applications, vol. 168, p. 114481, Apr. 2021, doi: 10.1016/j.eswa.2020.114481.
  7. [7] Y. Hu, Y. Liu, and Z. Liu, “A Survey on Convolutional Neural Network Accelerators: GPU, FPGA and ASIC,” 2022 14th International Conference on Computer Research and Development (ICCRD), Jan. 2022, doi: 10.1109/iccrd54409.2022.9730377.
  8. [8] N. Zhang, X. Wei, H. Chen, and W. Liu, “FPGA Implementation for CNN-Based Optical Remote Sensing Object Detection,” Electronics, vol. 10, no. 3, p. 282, Jan. 2021, doi: 10.3390/electronics10030282.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

July 20, 2022

Submission Date

June 1, 2022

Acceptance Date

June 20, 2022

Published in Issue

Year 2022 Volume: 6 Number: 1

APA
Güzel Aydın, S., & Bilge, H. Ş. (2022). Efficient Hardware Optimization for CNN. International Journal of Multidisciplinary Studies and Innovative Technologies, 6(1), 38-44. https://izlik.org/JA85DY35WP
AMA
1.Güzel Aydın S, Bilge HŞ. Efficient Hardware Optimization for CNN. IJMSIT. 2022;6(1):38-44. https://izlik.org/JA85DY35WP
Chicago
Güzel Aydın, Seda, and Hasan Şakir Bilge. 2022. “Efficient Hardware Optimization for CNN”. International Journal of Multidisciplinary Studies and Innovative Technologies 6 (1): 38-44. https://izlik.org/JA85DY35WP.
EndNote
Güzel Aydın S, Bilge HŞ (July 1, 2022) Efficient Hardware Optimization for CNN. International Journal of Multidisciplinary Studies and Innovative Technologies 6 1 38–44.
IEEE
[1]S. Güzel Aydın and H. Ş. Bilge, “Efficient Hardware Optimization for CNN”, IJMSIT, vol. 6, no. 1, pp. 38–44, July 2022, [Online]. Available: https://izlik.org/JA85DY35WP
ISNAD
Güzel Aydın, Seda - Bilge, Hasan Şakir. “Efficient Hardware Optimization for CNN”. International Journal of Multidisciplinary Studies and Innovative Technologies 6/1 (July 1, 2022): 38-44. https://izlik.org/JA85DY35WP.
JAMA
1.Güzel Aydın S, Bilge HŞ. Efficient Hardware Optimization for CNN. IJMSIT. 2022;6:38–44.
MLA
Güzel Aydın, Seda, and Hasan Şakir Bilge. “Efficient Hardware Optimization for CNN”. International Journal of Multidisciplinary Studies and Innovative Technologies, vol. 6, no. 1, July 2022, pp. 38-44, https://izlik.org/JA85DY35WP.
Vancouver
1.Seda Güzel Aydın, Hasan Şakir Bilge. Efficient Hardware Optimization for CNN. IJMSIT [Internet]. 2022 Jul. 1;6(1):38-44. Available from: https://izlik.org/JA85DY35WP