Research Article

Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot

Volume: 9 Number: 4 December 29, 2021
EN

Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot

Abstract

This study’s primary objective was to try to shorten the training time of the Reinforcement Learning (RL) method, which is one of the Machine Learning methods, by using the proportional-integral-derivative (PID) control method during training. In this study, a balancing robot with two wheels that can be controlled independently on the same axis is used. While the robot is in balance, the RL software block follows how the PID block maintains the balance, and the RL blog learned how to behave against disturbing factors without physical falling / rising. In the training of RL, it is necessary to create approximately 500 policy / reward / path equations between the current state and future state matrices. Obviously, the amount of equations will increase considerably when subjects such as old position and acceleration are added. Approximately 1000 trial / error is required for training purposes. This means many falling / rising cycles. With the method we present, the RL block has learned to keep the robot in balance without falling and requiring human intervention in 900 trials. This shortened the training time by about 60%.

Keywords

References

  1. [1] Ali Ghaffari, Azadeh Shariati, Amir H. Shamekhi. “A modified dynamical formulation for two-wheeled self-balancing robots” (Article - DOI 10.1007/s11071-015-2321-9)
  2. [2] Chıa-Hong Chen, Jong-Hann Jean, Dao-Xıang Xu. “Applıcatıon Of Fuzzy Control For Self-Balancıng Two-Wheel Vehıcle” (Article - 978-1-4577-0308-9/11/$26.00 ©2011 IEEE)
  3. [3] Raudys A, Subonien A. “A Review of Self-balancing Robot Reinforcement Learning Algorithms” (Article - ICIST 2020, CCIS 1283, pp. 159–170, 2020. )
  4. [4] Muhammad Atif Imtiaz, Mahum Naveed, Nimra Bibi, Sumair Aziz, Syed Zohaib Hassan Naqvi. “Control System Design, Analysis & Implementation of Two Wheeled Self Balancing Robot” (Article - 978-1-5386-7266-2/18/$31.00 ©2018 IEEE)
  5. [5] Boston Dynamic (website) - https://www.bostondynamics.com/handle
  6. [6] Ascento (website) - https://www.ascento.ethz.ch/
  7. [7] Segway - “Segway Inc.: Reference manual, Segway personal transporter (PT).” Segway Inc., Bedford, NH (2006)
  8. [8] R.E. Parr, “Hierarchical Control and Learning for Markov Decision Processes.” University of California: Berkeley, 1998.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

December 29, 2021

Submission Date

June 21, 2021

Acceptance Date

November 6, 2021

Published in Issue

Year 2021 Volume: 9 Number: 4

APA
Ataç, E., Yıldız, K., & Ülkü, E. E. (2021). Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım Ve Teknoloji, 9(4), 597-607. https://doi.org/10.29109/gujsc.955562
AMA
1.Ataç E, Yıldız K, Ülkü EE. Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot. GUJS Part C. 2021;9(4):597-607. doi:10.29109/gujsc.955562
Chicago
Ataç, Emrah, Kazım Yıldız, and Eyüp Emre Ülkü. 2021. “Use of PID Control During Education in Reinforcement Learning on Two Wheel Balance Robot”. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım Ve Teknoloji 9 (4): 597-607. https://doi.org/10.29109/gujsc.955562.
EndNote
Ataç E, Yıldız K, Ülkü EE (December 1, 2021) Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji 9 4 597–607.
IEEE
[1]E. Ataç, K. Yıldız, and E. E. Ülkü, “Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot”, GUJS Part C, vol. 9, no. 4, pp. 597–607, Dec. 2021, doi: 10.29109/gujsc.955562.
ISNAD
Ataç, Emrah - Yıldız, Kazım - Ülkü, Eyüp Emre. “Use of PID Control During Education in Reinforcement Learning on Two Wheel Balance Robot”. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji 9/4 (December 1, 2021): 597-607. https://doi.org/10.29109/gujsc.955562.
JAMA
1.Ataç E, Yıldız K, Ülkü EE. Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot. GUJS Part C. 2021;9:597–607.
MLA
Ataç, Emrah, et al. “Use of PID Control During Education in Reinforcement Learning on Two Wheel Balance Robot”. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım Ve Teknoloji, vol. 9, no. 4, Dec. 2021, pp. 597-0, doi:10.29109/gujsc.955562.
Vancouver
1.Emrah Ataç, Kazım Yıldız, Eyüp Emre Ülkü. Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot. GUJS Part C. 2021 Dec. 1;9(4):597-60. doi:10.29109/gujsc.955562

Cited By

                                TRINDEX     16167        16166    21432    logo.png

      

    e-ISSN:2147-9526