EN
TR
A THEORETICAL INVESTIGATION ON TRAINING OF PIPE-LIKE NEURAL NETWORK BENCHMARK ARCHITECTURES AND PERFORMANCE COMPARISONS OF POPULAR TRAINING ALGORITHMS
Öz
Architectures of neural networks affect the training performance of artificial neural networks. For more consistent performance evaluation of training algorithms, hard-to-train benchmarking architectures should be used. This study introduces a benchmark neural network architecture, which is called pipe-like architecture, and presents training performance analyses for popular Neural Network Backpropagation Algorithms (NNBA) and well-known Metaheuristic Search Algorithms (MSA). The pipe-like neural architectures essentially resemble an elongated fraction of a deep neural network and form a narrowed long bottleneck for the learning process. Therefore, they can significantly complicate the training process by causing the gradient vanishing problems and large training delays in backward propagation of parameter updates throughout the elongated pipe-like network. The training difficulties of pipe-like architectures are theoretically demonstrated in this study by considering the upper bound of weight updates according to an aggregated one-neuron learning channels conjecture. These analyses also contribute to Baldi et al.'s learning channel theorem of neural networks in a practical aspect. The training experiments for popular NNBA and MSA algorithms were conducted on the pipe-like benchmark architecture by using a biological dataset. Moreover, a Normalized Overall Performance Scoring (NOPS) was performed for the criterion-based assessment of overall performance of training algorithms.
Anahtar Kelimeler
Kaynakça
- Aliev, R.A., Fazlollahi, B., Guirimov, B.G., Aliev, R.R., 2008. Recurrent Fuzzy Neural Networks and Their Performance Analysis. in: Recurr. Neural Networks, InTech. https://doi.org/10.5772/5540.
- Arifovic, J., Gençay, R., 2001. Using genetic algorithms to select architecture of a feedforward artificial neural network. Phys. A Stat. Mech. Its Appl., 289:574–594. https://doi.org/10.1016/S0378-4371(00)00479-9.
- Awolusi, T.F., Oke, O.L., Akinkurolere, O.O., Sojobi, A.O., Aluko, O.G., 2019. Performance comparison of neural network training algorithms in the modeling properties of steel fiber reinforced concrete. Heliyon 5:e01115. https://doi.org/10.1016/j.heliyon.2018.e01115.
- Bahrami, M., Akbari, M., Bagherzadeh, S.A., Karimipour, A., Afrand, M., Goodarzi, M., 2019. Develop 24 dissimilar ANNs by suitable architectures & training algorithms via sensitivity analysis to better statistical presentation: Measure MSEs between targets & ANN for Fe–CuO/Eg–Water nanofluid. Phys. A Stat. Mech. Its Appl. 519:159–168. https://doi.org/10.1016/j.physa.2018.12.031.
- Bala, J.W., Analytics, D., Bloedorn, E., Bratko, I., 1992. The MONK’s Problems A Performance Comparison of Different Learning Algorithms. http://robots.stanford.edu/papers/thrun.MONK.html Accessed 05 August 2021.
- Battiti, R., 1992. First- and Second-Order Methods for Learning: Between Steepest Descent and Newton’s Method. Neural Comput., 4:141–166. https://doi.org/10.1162/neco.1992.4.2.141.
- Beale, E.M.L., 1972. A derivation of conjugate gradients. in F.A. Lootsma, Ed., Numerical methods for nonlinear optimization, Academic Press, London, 39-43.
- Birattari, M., Kacprzyk, J., 2009. Tuning metaheuristics: a machine learning perspective, Springer, Berlin.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Bilgisayar Yazılımı
Bölüm
Araştırma Makalesi
Yazarlar
Yayımlanma Tarihi
30 Aralık 2022
Gönderilme Tarihi
17 Nisan 2022
Kabul Tarihi
15 Temmuz 2022
Yayımlandığı Sayı
Yıl 2022 Cilt: 10 Sayı: 4
APA
İmik Şimşek, Ö., & Alagöz, B. B. (2022). A THEORETICAL INVESTIGATION ON TRAINING OF PIPE-LIKE NEURAL NETWORK BENCHMARK ARCHITECTURES AND PERFORMANCE COMPARISONS OF POPULAR TRAINING ALGORITHMS. Mühendislik Bilimleri ve Tasarım Dergisi, 10(4), 1251-1271. https://doi.org/10.21923/jesd.1104772
AMA
1.İmik Şimşek Ö, Alagöz BB. A THEORETICAL INVESTIGATION ON TRAINING OF PIPE-LIKE NEURAL NETWORK BENCHMARK ARCHITECTURES AND PERFORMANCE COMPARISONS OF POPULAR TRAINING ALGORITHMS. MBTD. 2022;10(4):1251-1271. doi:10.21923/jesd.1104772
Chicago
İmik Şimşek, Özlem, ve Barış Baykant Alagöz. 2022. “A THEORETICAL INVESTIGATION ON TRAINING OF PIPE-LIKE NEURAL NETWORK BENCHMARK ARCHITECTURES AND PERFORMANCE COMPARISONS OF POPULAR TRAINING ALGORITHMS”. Mühendislik Bilimleri ve Tasarım Dergisi 10 (4): 1251-71. https://doi.org/10.21923/jesd.1104772.
EndNote
İmik Şimşek Ö, Alagöz BB (01 Aralık 2022) A THEORETICAL INVESTIGATION ON TRAINING OF PIPE-LIKE NEURAL NETWORK BENCHMARK ARCHITECTURES AND PERFORMANCE COMPARISONS OF POPULAR TRAINING ALGORITHMS. Mühendislik Bilimleri ve Tasarım Dergisi 10 4 1251–1271.
IEEE
[1]Ö. İmik Şimşek ve B. B. Alagöz, “A THEORETICAL INVESTIGATION ON TRAINING OF PIPE-LIKE NEURAL NETWORK BENCHMARK ARCHITECTURES AND PERFORMANCE COMPARISONS OF POPULAR TRAINING ALGORITHMS”, MBTD, c. 10, sy 4, ss. 1251–1271, Ara. 2022, doi: 10.21923/jesd.1104772.
ISNAD
İmik Şimşek, Özlem - Alagöz, Barış Baykant. “A THEORETICAL INVESTIGATION ON TRAINING OF PIPE-LIKE NEURAL NETWORK BENCHMARK ARCHITECTURES AND PERFORMANCE COMPARISONS OF POPULAR TRAINING ALGORITHMS”. Mühendislik Bilimleri ve Tasarım Dergisi 10/4 (01 Aralık 2022): 1251-1271. https://doi.org/10.21923/jesd.1104772.
JAMA
1.İmik Şimşek Ö, Alagöz BB. A THEORETICAL INVESTIGATION ON TRAINING OF PIPE-LIKE NEURAL NETWORK BENCHMARK ARCHITECTURES AND PERFORMANCE COMPARISONS OF POPULAR TRAINING ALGORITHMS. MBTD. 2022;10:1251–1271.
MLA
İmik Şimşek, Özlem, ve Barış Baykant Alagöz. “A THEORETICAL INVESTIGATION ON TRAINING OF PIPE-LIKE NEURAL NETWORK BENCHMARK ARCHITECTURES AND PERFORMANCE COMPARISONS OF POPULAR TRAINING ALGORITHMS”. Mühendislik Bilimleri ve Tasarım Dergisi, c. 10, sy 4, Aralık 2022, ss. 1251-7, doi:10.21923/jesd.1104772.
Vancouver
1.Özlem İmik Şimşek, Barış Baykant Alagöz. A THEORETICAL INVESTIGATION ON TRAINING OF PIPE-LIKE NEURAL NETWORK BENCHMARK ARCHITECTURES AND PERFORMANCE COMPARISONS OF POPULAR TRAINING ALGORITHMS. MBTD. 01 Aralık 2022;10(4):1251-7. doi:10.21923/jesd.1104772
Cited By
Optimal deep neural network architecture design with improved generalization for data-driven cooling load estimation problem
Neural Computing and Applications
https://doi.org/10.1007/s00521-025-11212-7Preliminary study for a machine learning model for GENeuSIS
Fusion Engineering and Design
https://doi.org/10.1016/j.fusengdes.2025.115594