Araştırma Makalesi

Quantitative Performance Analysis of BLAS Libraries on GPU Architectures

Cilt: 26 Sayı: 76 23 Ocak 2024
PDF İndir
EN TR

Quantitative Performance Analysis of BLAS Libraries on GPU Architectures

Öz

Basic Linear Algebra Subprograms (BLAS) are a set of linear algebra routines commonly used by machine learning applications and scientific computing. BLAS libraries with optimized implementations of BLAS routines offer high performance by exploiting parallel execution units in target computing systems. With massively large number of cores, graphics processing units (GPUs) exhibit high performance for computationally-heavy workloads. Recent BLAS libraries utilize parallel cores of GPU architectures efficiently by employing inherent data parallelism. In this study, we analyze GPU-targeted functions from two BLAS libraries, cuBLAS and MAGMA, and evaluate their performance on a single-GPU NVIDIA architecture by considering architectural features and limitations. We collect architectural performance metrics and explore resource utilization characteristics. Our work aims to help researchers and programmers to understand the performance behavior and GPU resource utilization of the BLAS routines implemented by the libraries.

Anahtar Kelimeler

Kaynakça

  1. Y. Wang, A. Davidson, Y. Pan, Y. Wu, A. Riffel, J. D. Owens, 2016. Gunrock: A high-performance graph processing library on the gpu, ACM SIGPLAN Notices, 51 (8), 1–12. DOI: 10.1145/3016078.2851145
  2. S. Le Grand, A. W. Götz, R. C. Walker, 2013. Spfp: Speed without compromise—a mixed precision model for gpu accelerated molecular dynamics simulations, Computer Physics Communications 184 (2), 374–380. DOI: 10.1016/j.cpc.2012.09.022
  3. A. Zeni, G. Guidi, M. Ellis, N. Ding, M. D. Santambrogio, S. A. Hofmeyr, A. Buluc ̧, L. Oliker, K. A. Yelick, 2020. LOGAN: high-performance gpu-based x-drop long-read alignment, IEEE International Parallel and Distributed Processing Symposium (IPDPS).
  4. F. F. d. Santos, P. F. Pimenta, C. Lunardi, L. Draghetti, L. Carro, D. Kaeli, P. Rech, 2019. Analyzing and increasing the reliability of convolutional neural networks on gpus, IEEE Transactions on Reliability 68 (2), 663–677. DOI: 10.1109/TR.2018.2878387
  5. S. Alcaide, L. Kosmidis, H. Tabani, C. Hernandez, J. Abella, F. J. Cazorla, 2018. Safety-related challenges and opportunities for gpus in the automotive domain, IEEE Micro 38 (6), 46–55. DOI: 10.1109/MM.2018.2873870
  6. M. Benito, M. M. Trompouki, L. Kosmidis, J. D. Garcia, S. Carretero, K. Wenger, 2021. Comparison of gpu computing methodologies for safety-critical systems: An avionics case study, Design, Automation Test in Europe Conference Exhibition (DATE).
  7. S. Kestur, J. D. Davis, O. Williams, 2010. Blas comparison on fpga, cpu and gpu, IEEE Computer Society Annual Symposium on VLSI.
  8. A. A. Awan, H. Subramoni, D. K. Panda, 2017. An in-depth performance characterization of cpu- and gpu-based dnn training on modern architectures, Proceedings of the Machine Learning on HPC Environments (MLHPC).

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Erken Görünüm Tarihi

22 Ocak 2024

Yayımlanma Tarihi

23 Ocak 2024

Gönderilme Tarihi

22 Şubat 2023

Kabul Tarihi

25 Mayıs 2023

Yayımlandığı Sayı

Yıl 2024 Cilt: 26 Sayı: 76

Kaynak Göster

APA
Öz, I. (2024). Quantitative Performance Analysis of BLAS Libraries on GPU Architectures. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen ve Mühendislik Dergisi, 26(76), 40-48. https://doi.org/10.21205/deufmd.2024267606
AMA
1.Öz I. Quantitative Performance Analysis of BLAS Libraries on GPU Architectures. DEUFMD. 2024;26(76):40-48. doi:10.21205/deufmd.2024267606
Chicago
Öz, Işıl. 2024. “Quantitative Performance Analysis of BLAS Libraries on GPU Architectures”. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen ve Mühendislik Dergisi 26 (76): 40-48. https://doi.org/10.21205/deufmd.2024267606.
EndNote
Öz I (01 Ocak 2024) Quantitative Performance Analysis of BLAS Libraries on GPU Architectures. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen ve Mühendislik Dergisi 26 76 40–48.
IEEE
[1]I. Öz, “Quantitative Performance Analysis of BLAS Libraries on GPU Architectures”, DEUFMD, c. 26, sy 76, ss. 40–48, Oca. 2024, doi: 10.21205/deufmd.2024267606.
ISNAD
Öz, Işıl. “Quantitative Performance Analysis of BLAS Libraries on GPU Architectures”. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen ve Mühendislik Dergisi 26/76 (01 Ocak 2024): 40-48. https://doi.org/10.21205/deufmd.2024267606.
JAMA
1.Öz I. Quantitative Performance Analysis of BLAS Libraries on GPU Architectures. DEUFMD. 2024;26:40–48.
MLA
Öz, Işıl. “Quantitative Performance Analysis of BLAS Libraries on GPU Architectures”. Dokuz Eylül Üniversitesi Mühendislik Fakültesi Fen ve Mühendislik Dergisi, c. 26, sy 76, Ocak 2024, ss. 40-48, doi:10.21205/deufmd.2024267606.
Vancouver
1.Işıl Öz. Quantitative Performance Analysis of BLAS Libraries on GPU Architectures. DEUFMD. 01 Ocak 2024;26(76):40-8. doi:10.21205/deufmd.2024267606

Cited By

Bu dergi, Creative Commons Atıf-GayriTicari 4.0 Uluslararası Lisansı (CC BY-NC 4.0) altında lisanslanmıştır.

download?token=eyJhdXRoX3JvbGVzIjpbXSwiZW5kcG9pbnQiOiJmaWxlIiwicGF0aCI6IjliNTAvMDBjMi8xZmIxLzY5MjZmZDIyOGE1NzgyLjA3MzU5MTk2LnBuZyIsImV4cCI6MTc2NDE2OTE1Nywibm9uY2UiOiJhZDRmNjNlNzdhOWYwOWQ4YTNjNGVmNGIxOTFlZWViNyJ9.4Dxgc9mc-p4Tyti8NTU5pxEfGUWeuJud1fPWxu2mUy8