Araştırma Makalesi

Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control

Cilt: 5 Sayı: 2 18 Temmuz 2022
PDF İndir
TR EN

Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control

Abstract

Legged robots are very popular topics in the robotic field owing to walking on hard terrain. In the current study, the walking of a bipedal robot that is legged robot was aimed. For this purpose, the system was examined and an artificial neural network was designed. After, the neural network was trained by using the Deep Deterministic Policy Gradient (DDPG) and the Proximal Policy Optimization (PPO) algorithms. After the training process, the PPO algorithm was formed better training performance than the DDPG algorithm. Also, the optimal noise standard deviation of the PPO algorithm was investigated. The results were shown that the best results were obtained by using 0.50. The system was tested by utilizing the artificial neural networks that trained the PPO algorithm which has got 0.50 noise standard deviation. According to the test result, the total reward was calculated as 274.334 and the walking task was achieved by purposed structure. As a result, the current study has formed the basis for controlling a bipedal robot and the PPO noise standard deviation selection.

Keywords

Kaynakça

  1. Bingol M. C. a. Development of Neural Network Based on Deep Reinforcement Learning to Compensate for Damaged Actuator of a Planar Robot. Global Conference on Engineering Research (GLOBCER’21) June 2021; 310–317.
  2. Bingol, M. C. b. Investigation of the Standard Deviation of Ornstein - Uhlenbeck Noise in the DDPG Algorithm. Gazi University Journal of Science Part C: Design and Technology 2021; 9(2): 200–210.
  3. Dong, Y., Zhang, S., Liu, X., Zhang, Y., & Shen, T. Variance aware reward smoothing for deep reinforcement learning. Neurocomputing 2021; 458, 327–335.
  4. Ghouri, U. H., Zafar, M. U., Bari, S., Khan, H., Khan, M. U. Attitude Control of Quad-copter using Deterministic Policy Gradient Algorithms (DPGA). 2019 2nd International Conference on Communication, Computing and Digital Systems C-CODE 2019; 149–153.
  5. He, K., Zhang, X., Ren, S., & Sun, J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In Proceedings of the IEEE international conference on computer vision 2015; 1026-1034.
  6. https://github.com/openai/gym/blob/b6b4fc38388c42c76516c93fd107dab124af64dd/gym/envs/box2d/bipedal_walker.py (Accessed October 14, 2021) Bipedal-Walker
  7. Lillicrap, T. P., Hunt, J. J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., Wierstra, D. Continuous control with deep reinforcement learning. 4th International Conference on Learning Representations, ICLR 2016.
  8. Lopes, G. C., Ferreira, M., Da Silva Simoes, A., Colombini, E. L. Intelligent control of a quadrotor with proximal policy optimization reinforcement learning. 15th Latin American Robotics Symposium, 6th Brazilian Robotics Symposium and 9th Workshop on Robotics in Education, 2018; 509–514.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Bilgisayar Yazılımı

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

18 Temmuz 2022

Gönderilme Tarihi

3 Aralık 2021

Kabul Tarihi

2 Mart 2022

Yayımlandığı Sayı

Yıl 2022 Cilt: 5 Sayı: 2

Kaynak Göster

APA
Bıngol, M. C. (2022). Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi, 5(2), 783-791. https://doi.org/10.47495/okufbed.1031976
AMA
1.Bıngol MC. Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi. 2022;5(2):783-791. doi:10.47495/okufbed.1031976
Chicago
Bıngol, Mustafa Can. 2022. “Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control”. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi 5 (2): 783-91. https://doi.org/10.47495/okufbed.1031976.
EndNote
Bıngol MC (01 Temmuz 2022) Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi 5 2 783–791.
IEEE
[1]M. C. Bıngol, “Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control”, Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi, c. 5, sy 2, ss. 783–791, Tem. 2022, doi: 10.47495/okufbed.1031976.
ISNAD
Bıngol, Mustafa Can. “Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control”. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi 5/2 (01 Temmuz 2022): 783-791. https://doi.org/10.47495/okufbed.1031976.
JAMA
1.Bıngol MC. Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi. 2022;5:783–791.
MLA
Bıngol, Mustafa Can. “Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control”. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi, c. 5, sy 2, Temmuz 2022, ss. 783-91, doi:10.47495/okufbed.1031976.
Vancouver
1.Mustafa Can Bıngol. Evaluatıon of DDPG and PPO Algorıthms for Bıpedal Robot Control. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi. 01 Temmuz 2022;5(2):783-91. doi:10.47495/okufbed.1031976

Cited By

23487




196541947019414  

1943319434 19435194361960219721 19784  2123822610 23877

* Uluslararası Hakemli Dergi (International Peer Reviewed Journal)

* Yazar/yazarlardan hiçbir şekilde MAKALE BASIM ÜCRETİ vb. şeyler istenmemektedir (Free submission and publication).

* Yılda Ocak, Mart, Haziran, Eylül ve Aralık'ta olmak üzere 5 sayı yayınlanmaktadır (Published 5 times a year)

* Dergide, Türkçe ve İngilizce makaleler basılmaktadır.

*Dergi açık erişimli bir dergidir.

Creative Commons License

Bu web sitesi Creative Commons Atıf 4.0 Uluslararası Lisansı ile lisanslanmıştır.