Review

An overview of the activation functions used in deep learning algorithms

Volume: 10 Number: 3 December 31, 2021
EN

An overview of the activation functions used in deep learning algorithms

Abstract

In deep learning models, the inputs to the network are processed using activation functions to generate the output corresponding to these inputs. Deep learning models are of particular importance in analyzing big data with numerous parameters and forecasting and are useful for image processing, natural language processing, object recognition, and financial forecasting. Also, in deep learning algorithms, activation functions have been developed by taking into account features such as performing the learning process in a healthy way, preventing excessive learning, increasing the accuracy performance, and reducing the computational cost. In this study, we present an overview of common and current activation functions used in deep learning algorithms. In the study, fixed and trainable activation functions are introduced. As fixed activation functions, sigmoid, hyperbolic tangent, ReLU, softplus and swish, and as trainable activation functions, LReLU, ELU, SELU and RSigELU are introduced.

Keywords

Activation function, Neural network, Deep learning

References

  1. K. Adem, S. Kılıçarslan, O. Cömert, Classification and diagnosis of cervical cancer with stacked autoencoder and softmax classification, Expert Systems with Applications, 115, (2018) 557– 564.
  2. S. Kılıçarslan, K. Adem, M. Çelik, Diagnosis and classification of cancer using hybrid model based on ReliefF and convolutional neural network, medical hypotheses, 137, (2020) 199577.
  3. S. Kılıçarslan, M. Çelik, Ş. Sahin, Hybrid models based on genetic algorithm and deep learning algorithms for nutritional Anemia disease classification, Biomedical Signal Processing and Control, 63, (2021) 102231.
  4. S. Kılıçarslan, M. Çelik, RSigELU: A nonlinear activation function for deep neural networks, Expert Systems with Applications, 174, (2021) 114805.
  5. A. Apicella, F. Donnarumma, F. Isgrò, R. Prevete, A survey on modern trainable activation functions, Neural Networks, 138 (2021) 14–32.
  6. S. Scardapane, S. Van Vaerenbergh, S. Totaro, A. Uncini, Kafnets: Kernel-based non-parametric activation functions for neural networks, Neural Networks, 110, (2019) 19–32.
  7. V. Nair, G. E. Hinton, Rectified linear units improve restricted Boltzmann machines, In Proceedings of the 27th international conference on machine learning (ICML-10), 2010, pp. 807–814.
  8. A. L. Maas, A. Y. Hannun, A. Y. Nug, Rectifier nonlinearities improve neural network acoustic models, S. Dasgupta, D. McAllester (Eds.), International Conference on Machine Learning Workshop on Deep Learning for Audio, Speech, and Language Processing, Atlanta, USA, 2013, pp. 1–6.
  9. D. A. Clevert, T. Unterthiner, S. Hochreiter, Fast and accurate deep network learning by exponential linear units (elus), arXiv preprint arXiv:1511.07289, (2015).
  10. L. Trottier, P. Gigu, B. Chaib-draa, Parametric exponential linear unit for deep convolutional neural networks, in: X. Chen, B. Luo, F. Luo, V. Palade, M. A. Wani (Eds.), 16th IEEE International Conference on Machine Learning and Applications, Cancun, Mexico, 2017, pp. 207–214.
APA
Kılıçarslan, S., Adem, K., & Çelik, M. (2021). An overview of the activation functions used in deep learning algorithms. Journal of New Results in Science, 10(3), 75-88. https://doi.org/10.54187/jnrs.1011739
AMA
1.Kılıçarslan S, Adem K, Çelik M. An overview of the activation functions used in deep learning algorithms. JNRS. 2021;10(3):75-88. doi:10.54187/jnrs.1011739
Chicago
Kılıçarslan, Serhat, Kemal Adem, and Mete Çelik. 2021. “An Overview of the Activation Functions Used in Deep Learning Algorithms”. Journal of New Results in Science 10 (3): 75-88. https://doi.org/10.54187/jnrs.1011739.
EndNote
Kılıçarslan S, Adem K, Çelik M (December 1, 2021) An overview of the activation functions used in deep learning algorithms. Journal of New Results in Science 10 3 75–88.
IEEE
[1]S. Kılıçarslan, K. Adem, and M. Çelik, “An overview of the activation functions used in deep learning algorithms”, JNRS, vol. 10, no. 3, pp. 75–88, Dec. 2021, doi: 10.54187/jnrs.1011739.
ISNAD
Kılıçarslan, Serhat - Adem, Kemal - Çelik, Mete. “An Overview of the Activation Functions Used in Deep Learning Algorithms”. Journal of New Results in Science 10/3 (December 1, 2021): 75-88. https://doi.org/10.54187/jnrs.1011739.
JAMA
1.Kılıçarslan S, Adem K, Çelik M. An overview of the activation functions used in deep learning algorithms. JNRS. 2021;10:75–88.
MLA
Kılıçarslan, Serhat, et al. “An Overview of the Activation Functions Used in Deep Learning Algorithms”. Journal of New Results in Science, vol. 10, no. 3, Dec. 2021, pp. 75-88, doi:10.54187/jnrs.1011739.
Vancouver
1.Serhat Kılıçarslan, Kemal Adem, Mete Çelik. An overview of the activation functions used in deep learning algorithms. JNRS. 2021 Dec. 1;10(3):75-88. doi:10.54187/jnrs.1011739

Cited By

Use of Izhikevich neurons in Hopfield models

Bulletin of Taras Shevchenko National University of Kyiv. Series: Physics and Mathematics

https://doi.org/10.17721/1812-5409.2025/1.16