Araştırma Makalesi

An effective method to use centralized Q-learning in multi-robot task allocation

Cilt: 27 Sayı: 5 28 Ekim 2021
  • Hatice Hilal Ezercan Kayır *
PDF İndir
TR EN

An effective method to use centralized Q-learning in multi-robot task allocation

Abstract

The use of Q-learning methods in multi-robot systems is a challenging area. Multi-robot systems have dynamic and partially observable nature because of robot’s independent decision-making and acting mechanisms. Whereas, Q-learning is defined on Markovian environments theoretically. One way to apply Q-learning in multi robot systems is centralized learning. It learns optimal Q-values for state space of overall system and joint action spaces of all agents. In this case, the system can be considered as stationary and optimal solutions can be converged. But, centralized learning requires full knowledge of the environment, perfect inter-robot communication and good computational power. Especially for large systems, the computational cost becomes huge because of exponentially growing learning space size with the number of robots. The proposed approach in this study, subG-CQL, divides the overall system into small-sized sub-groups without adversely affecting the system's task performing abilities. Each sub-group consists of less number of robots performing less tasks and learns in centralized manner for its own team. So, the learning space dimension is reduced to a reasonable level and required communication remains limited to the robots in the same the sub-group. Due the centralized learning is used, it is expected that the successful results are achieved. Experimental studies show that the proposed algorithm provides increase in the task assignment performance of the system and efficient use of system resources.

Keywords

Kaynakça

  1. [1] Arkin RC., Behavior-Based Robotics. Massachusetts, USA, MIT Press, 1998.
  2. [2] Dai W, Lu H, Xiao J, Zeng Z, Zheng Z. “Multi-robot dynamic task allocation for exploration and destruction”. Journal of Intelligent & Robotics Systems, 98, 455-479, 2020.
  3. [3] Bernstein DS, Givan R, Immerman N, Zilberstein S. “The complexity of decentralized control of Markov decision processes”. Mathematics of Operation
  4. [4] Mataric MJ. “Reinforcement learning in multi-robot domain”, Autonomous Robots, 4(1), 73-83, 1997.
  5. [5] Gerkey BP, Mataric MJ. “A formal analysis and taxonomy of task allocation in multi-robot systems”. The International Journal of Robotics Research, 23(9), 939-954, 2004.
  6. [6] Gerkey BP, Mataric MJ. “Sold!: auction methods for multi robot coordination”. IEEE Transactions on Robotics and Automation, 18(5), 758-768, 2002.
  7. [7] Dias MB, Zlot RM, Kaltra N, Stentz A. “Market-based multirobot coordination: a survey and analysis”. in Proceedings of the IEEE, 94(7), 1257-1270, 2006.
  8. [8] Zlot R, Stentz A, Dias MB, Thayer S. “Multi-robot exploration controlled by a market economy”. IEEE International Conference on Robotics and Automation, Washington DC, USA, 11-15 May 2002.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yazarlar

Hatice Hilal Ezercan Kayır * Bu kişi benim
Türkiye

Yayımlanma Tarihi

28 Ekim 2021

Gönderilme Tarihi

10 Ocak 2021

Kabul Tarihi

-

Yayımlandığı Sayı

Yıl 2021 Cilt: 27 Sayı: 5

Kaynak Göster

APA
Ezercan Kayır, H. H. (2021). An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, 27(5), 579-588. https://izlik.org/JA38JA48RB
AMA
1.Ezercan Kayır HH. An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi. 2021;27(5):579-588. https://izlik.org/JA38JA48RB
Chicago
Ezercan Kayır, Hatice Hilal. 2021. “An effective method to use centralized Q-learning in multi-robot task allocation”. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi 27 (5): 579-88. https://izlik.org/JA38JA48RB.
EndNote
Ezercan Kayır HH (01 Ekim 2021) An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi 27 5 579–588.
IEEE
[1]H. H. Ezercan Kayır, “An effective method to use centralized Q-learning in multi-robot task allocation”, Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, c. 27, sy 5, ss. 579–588, Eki. 2021, [çevrimiçi]. Erişim adresi: https://izlik.org/JA38JA48RB
ISNAD
Ezercan Kayır, Hatice Hilal. “An effective method to use centralized Q-learning in multi-robot task allocation”. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi 27/5 (01 Ekim 2021): 579-588. https://izlik.org/JA38JA48RB.
JAMA
1.Ezercan Kayır HH. An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi. 2021;27:579–588.
MLA
Ezercan Kayır, Hatice Hilal. “An effective method to use centralized Q-learning in multi-robot task allocation”. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi, c. 27, sy 5, Ekim 2021, ss. 579-88, https://izlik.org/JA38JA48RB.
Vancouver
1.Hatice Hilal Ezercan Kayır. An effective method to use centralized Q-learning in multi-robot task allocation. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi [Internet]. 01 Ekim 2021;27(5):579-88. Erişim adresi: https://izlik.org/JA38JA48RB