Araştırma Makalesi

Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot

Cilt: 9 Sayı: 4 29 Aralık 2021
PDF İndir
EN

Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot

Öz

This study’s primary objective was to try to shorten the training time of the Reinforcement Learning (RL) method, which is one of the Machine Learning methods, by using the proportional-integral-derivative (PID) control method during training. In this study, a balancing robot with two wheels that can be controlled independently on the same axis is used. While the robot is in balance, the RL software block follows how the PID block maintains the balance, and the RL blog learned how to behave against disturbing factors without physical falling / rising. In the training of RL, it is necessary to create approximately 500 policy / reward / path equations between the current state and future state matrices. Obviously, the amount of equations will increase considerably when subjects such as old position and acceleration are added. Approximately 1000 trial / error is required for training purposes. This means many falling / rising cycles. With the method we present, the RL block has learned to keep the robot in balance without falling and requiring human intervention in 900 trials. This shortened the training time by about 60%.

Anahtar Kelimeler

Kaynakça

  1. [1] Ali Ghaffari, Azadeh Shariati, Amir H. Shamekhi. “A modified dynamical formulation for two-wheeled self-balancing robots” (Article - DOI 10.1007/s11071-015-2321-9)
  2. [2] Chıa-Hong Chen, Jong-Hann Jean, Dao-Xıang Xu. “Applıcatıon Of Fuzzy Control For Self-Balancıng Two-Wheel Vehıcle” (Article - 978-1-4577-0308-9/11/$26.00 ©2011 IEEE)
  3. [3] Raudys A, Subonien A. “A Review of Self-balancing Robot Reinforcement Learning Algorithms” (Article - ICIST 2020, CCIS 1283, pp. 159–170, 2020. )
  4. [4] Muhammad Atif Imtiaz, Mahum Naveed, Nimra Bibi, Sumair Aziz, Syed Zohaib Hassan Naqvi. “Control System Design, Analysis & Implementation of Two Wheeled Self Balancing Robot” (Article - 978-1-5386-7266-2/18/$31.00 ©2018 IEEE)
  5. [5] Boston Dynamic (website) - https://www.bostondynamics.com/handle
  6. [6] Ascento (website) - https://www.ascento.ethz.ch/
  7. [7] Segway - “Segway Inc.: Reference manual, Segway personal transporter (PT).” Segway Inc., Bedford, NH (2006)
  8. [8] R.E. Parr, “Hierarchical Control and Learning for Markov Decision Processes.” University of California: Berkeley, 1998.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

29 Aralık 2021

Gönderilme Tarihi

21 Haziran 2021

Kabul Tarihi

6 Kasım 2021

Yayımlandığı Sayı

Yıl 2021 Cilt: 9 Sayı: 4

Kaynak Göster

APA
Ataç, E., Yıldız, K., & Ülkü, E. E. (2021). Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji, 9(4), 597-607. https://doi.org/10.29109/gujsc.955562
AMA
1.Ataç E, Yıldız K, Ülkü EE. Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot. GUJS Part C. 2021;9(4):597-607. doi:10.29109/gujsc.955562
Chicago
Ataç, Emrah, Kazım Yıldız, ve Eyüp Emre Ülkü. 2021. “Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot”. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji 9 (4): 597-607. https://doi.org/10.29109/gujsc.955562.
EndNote
Ataç E, Yıldız K, Ülkü EE (01 Aralık 2021) Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji 9 4 597–607.
IEEE
[1]E. Ataç, K. Yıldız, ve E. E. Ülkü, “Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot”, GUJS Part C, c. 9, sy 4, ss. 597–607, Ara. 2021, doi: 10.29109/gujsc.955562.
ISNAD
Ataç, Emrah - Yıldız, Kazım - Ülkü, Eyüp Emre. “Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot”. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji 9/4 (01 Aralık 2021): 597-607. https://doi.org/10.29109/gujsc.955562.
JAMA
1.Ataç E, Yıldız K, Ülkü EE. Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot. GUJS Part C. 2021;9:597–607.
MLA
Ataç, Emrah, vd. “Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot”. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji, c. 9, sy 4, Aralık 2021, ss. 597-0, doi:10.29109/gujsc.955562.
Vancouver
1.Emrah Ataç, Kazım Yıldız, Eyüp Emre Ülkü. Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot. GUJS Part C. 01 Aralık 2021;9(4):597-60. doi:10.29109/gujsc.955562

Cited By

                                     16168      16167     16166     21432        logo.png   


    e-ISSN:2147-9526