Research Article

An effective method to use centralized Q-learning in multi-robot task allocation

Volume: 27 Number: 5 October 28, 2021
  • Hatice Hilal Ezercan Kayır *
TR EN

An effective method to use centralized Q-learning in multi-robot task allocation

Abstract

The use of Q-learning methods in multi-robot systems is a challenging area. Multi-robot systems have dynamic and partially observable nature because of robot’s independent decision-making and acting mechanisms. Whereas, Q-learning is defined on Markovian environments theoretically. One way to apply Q-learning in multi robot systems is centralized learning. It learns optimal Q-values for state space of overall system and joint action spaces of all agents. In this case, the system can be considered as stationary and optimal solutions can be converged. But, centralized learning requires full knowledge of the environment, perfect inter-robot communication and good computational power. Especially for large systems, the computational cost becomes huge because of exponentially growing learning space size with the number of robots. The proposed approach in this study, subG-CQL, divides the overall system into small-sized sub-groups without adversely affecting the system's task performing abilities. Each sub-group consists of less number of robots performing less tasks and learns in centralized manner for its own team. So, the learning space dimension is reduced to a reasonable level and required communication remains limited to the robots in the same the sub-group. Due the centralized learning is used, it is expected that the successful results are achieved. Experimental studies show that the proposed algorithm provides increase in the task assignment performance of the system and efficient use of system resources.

Keywords

References

  1. [1] Arkin RC., Behavior-Based Robotics. Massachusetts, USA, MIT Press, 1998.
  2. [2] Dai W, Lu H, Xiao J, Zeng Z, Zheng Z. “Multi-robot dynamic task allocation for exploration and destruction”. Journal of Intelligent & Robotics Systems, 98, 455-479, 2020.
  3. [3] Bernstein DS, Givan R, Immerman N, Zilberstein S. “The complexity of decentralized control of Markov decision processes”. Mathematics of Operation
  4. [4] Mataric MJ. “Reinforcement learning in multi-robot domain”, Autonomous Robots, 4(1), 73-83, 1997.
  5. [5] Gerkey BP, Mataric MJ. “A formal analysis and taxonomy of task allocation in multi-robot systems”. The International Journal of Robotics Research, 23(9), 939-954, 2004.
  6. [6] Gerkey BP, Mataric MJ. “Sold!: auction methods for multi robot coordination”. IEEE Transactions on Robotics and Automation, 18(5), 758-768, 2002.
  7. [7] Dias MB, Zlot RM, Kaltra N, Stentz A. “Market-based multirobot coordination: a survey and analysis”. in Proceedings of the IEEE, 94(7), 1257-1270, 2006.
  8. [8] Zlot R, Stentz A, Dias MB, Thayer S. “Multi-robot exploration controlled by a market economy”. IEEE International Conference on Robotics and Automation, Washington DC, USA, 11-15 May 2002.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Authors

Hatice Hilal Ezercan Kayır * This is me
Türkiye

Publication Date

October 28, 2021

Submission Date

January 10, 2021

Acceptance Date

-

Published in Issue

Year 2021 Volume: 27 Number: 5

APA
Ezercan Kayır, H. H. (2021). An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, 27(5), 579-588. https://izlik.org/JA38JA48RB
AMA
1.Ezercan Kayır HH. An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi. 2021;27(5):579-588. https://izlik.org/JA38JA48RB
Chicago
Ezercan Kayır, Hatice Hilal. 2021. “An Effective Method to Use Centralized Q-Learning in Multi-Robot Task Allocation”. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi 27 (5): 579-88. https://izlik.org/JA38JA48RB.
EndNote
Ezercan Kayır HH (October 1, 2021) An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi 27 5 579–588.
IEEE
[1]H. H. Ezercan Kayır, “An effective method to use centralized Q-learning in multi-robot task allocation”, Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, vol. 27, no. 5, pp. 579–588, Oct. 2021, [Online]. Available: https://izlik.org/JA38JA48RB
ISNAD
Ezercan Kayır, Hatice Hilal. “An Effective Method to Use Centralized Q-Learning in Multi-Robot Task Allocation”. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi 27/5 (October 1, 2021): 579-588. https://izlik.org/JA38JA48RB.
JAMA
1.Ezercan Kayır HH. An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi. 2021;27:579–588.
MLA
Ezercan Kayır, Hatice Hilal. “An Effective Method to Use Centralized Q-Learning in Multi-Robot Task Allocation”. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, vol. 27, no. 5, Oct. 2021, pp. 579-88, https://izlik.org/JA38JA48RB.
Vancouver
1.Hatice Hilal Ezercan Kayır. An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi [Internet]. 2021 Oct. 1;27(5):579-88. Available from: https://izlik.org/JA38JA48RB