Araştırma Makalesi

Setting Reward Function of Sensor Based DDQN Model

Sayı: 28 30 Kasım 2021
PDF İndir
TR EN

Setting Reward Function of Sensor Based DDQN Model

Öz

In this study, it is aimed to determine the appropriate reward function of the agent which trained to pass 100 obstacles/objects in Reinforcement Learning (RL) with Double Deep Q Network (DDQN) model. To train the agent, environment is split into sub problems. Several rules and different reward functions defined for the sub problems. A developed mini deep learning library which is called gNet is used for the training.

Anahtar Kelimeler

Kaynakça

  1. R. S. Sutton and A. G. Barto, Introduction to Reinforcement Learning, MIT Press, 1998.
  2. E. Ratner, D. Hadfield-Menell and A. D. Dragan, “Simplifying Reward Design through Divide-and-Conquer,” CoRR, vol. abs/1806.02501, 2018, [Online] http://arxiv.org/abs/1806.02501.
  3. Z. Hu, K. Wan, X. Gao, and Y. Zhai, “A Dynamic Adjusting Reward Function Method for Deep Reinforcement Learning with Adjustable Parameters,” Mathematical Problems in Engineering, vol. 2019, pp. 1-10, DOI: 10.1155/2019/7619483.
  4. C. J. C. H. Watkins and P. Dayan, “Q-Learning,” Machine Learning, vol. 8, 1992, pp. 279-292.
  5. R. E. Bellmann and S. E. Dreyfus, Applied Dynamic Programming, Princeton, NJ, USA: Princeton University Press, 1962.
  6. V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra and M. Riedmiller, "Playing Atari with Deep Reinforcement Learning," CoRR, vol. abs/1312.5602, 2013, [Online] https://arxiv.org/abs/1312.5602
  7. L. Lin, “Reinforcement Learning for Robots Using Neural Networks,” Ph.D. dissertation, School of Computer Science, Carnegie Mellon Univ., Pittsburgh, PA, USA, 1993.
  8. H. van Hasselt, A. Guez, D. Silver, "Deep Reinforcement Learning with Double Q-Learning," in Proc. of the AAAI Conference on Artificial Intelligence, vol. 30, No.1, 2016, [Online] https://arxiv.org/abs/1509.06461

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

30 Kasım 2021

Gönderilme Tarihi

12 Ekim 2021

Kabul Tarihi

14 Ekim 2021

Yayımlandığı Sayı

Yıl 2021 Sayı: 28

Kaynak Göster

APA
Kabataş, M. G., & İlhan Omurca, S. (2021). Setting Reward Function of Sensor Based DDQN Model. Avrupa Bilim ve Teknoloji Dergisi, 28, 539-544. https://doi.org/10.31590/ejosat.1008702
AMA
1.Kabataş MG, İlhan Omurca S. Setting Reward Function of Sensor Based DDQN Model. EJOSAT. 2021;(28):539-544. doi:10.31590/ejosat.1008702
Chicago
Kabataş, Mehmet Gökçay, ve Sevinç İlhan Omurca. 2021. “Setting Reward Function of Sensor Based DDQN Model”. Avrupa Bilim ve Teknoloji Dergisi, sy 28: 539-44. https://doi.org/10.31590/ejosat.1008702.
EndNote
Kabataş MG, İlhan Omurca S (01 Kasım 2021) Setting Reward Function of Sensor Based DDQN Model. Avrupa Bilim ve Teknoloji Dergisi 28 539–544.
IEEE
[1]M. G. Kabataş ve S. İlhan Omurca, “Setting Reward Function of Sensor Based DDQN Model”, EJOSAT, sy 28, ss. 539–544, Kas. 2021, doi: 10.31590/ejosat.1008702.
ISNAD
Kabataş, Mehmet Gökçay - İlhan Omurca, Sevinç. “Setting Reward Function of Sensor Based DDQN Model”. Avrupa Bilim ve Teknoloji Dergisi. 28 (01 Kasım 2021): 539-544. https://doi.org/10.31590/ejosat.1008702.
JAMA
1.Kabataş MG, İlhan Omurca S. Setting Reward Function of Sensor Based DDQN Model. EJOSAT. 2021;:539–544.
MLA
Kabataş, Mehmet Gökçay, ve Sevinç İlhan Omurca. “Setting Reward Function of Sensor Based DDQN Model”. Avrupa Bilim ve Teknoloji Dergisi, sy 28, Kasım 2021, ss. 539-44, doi:10.31590/ejosat.1008702.
Vancouver
1.Mehmet Gökçay Kabataş, Sevinç İlhan Omurca. Setting Reward Function of Sensor Based DDQN Model. EJOSAT. 01 Kasım 2021;(28):539-44. doi:10.31590/ejosat.1008702