APPROXIMATE DYNAMIC PROGRAMMING FOR OPTIMAL SEARCH WITH AN OBSTACLE
Öz
In this paper, we study a class of optimal
search problems where the search region includes a target and an obstacle, each
of which has some shape. The search region is divided into small grid cells and
the searcher examines one of those cells at each time period with the objective
of finding the target with minimum expected cost. The searcher may either take
an action that is quick but risky, or another one that is slow but safe, and
incurs different cost for these actions. We formulate these problems as Markov
Decision Processes (MDPs), but because of the intractability of the state
space, we approximately solve the MDPs using an Approximate Dynamic Programming
(ADP) technique and compare its performance against heuristic decision rules.
Our numerical experiments reveal that the ADP technique outperforms heuristics
on majority of problem instances. Specifically, the ADP technique performs
better than the best heuristic policy in 17 out of 24 problem sets. The percent
improvement in those 17 problem sets is on average 7.3%.
Anahtar Kelimeler
Kaynakça
- Adelman D. “Price-directed replenishment of subsets: methodology and its application to inventory routing”. Manufacturing and Service Operations Management. 5, 4, 348-371, 2003.
- Adelman D. “A price-directed approach to stochastic inventory routing”. Operations Research. 52, 4, 499-514, 2004.
- Benkoski S J. Monticino, M. G., Weisinger, J. R.. “A survey of the search theory literature”. Naval Research Logistics. 38, 469-494, 1991.
- Bertsekas D, Tsitsiklis J. “Neuro-Dynamic Programming”, Athena Scientific, 1996.
- Botea A, Baier J, Harabor D, Hernandez C. “Moving target search with compressed path databases”. In Proceedings of ICAPS-13, 2013.
- Chang HS, Fu MC, Hu J, Marcus SI. “Simulation-based algorithms for Markov Decision Processes”, Springer, 2007.
- Chung TH, Burdick JW. “A decision-making framework for control strategies in probabilistic search”. Proceedings of IEEE International Conference on Robotics and Automation, 2007.
- Chung TH. “On probabilistic search decisions under searcher motion constraints”. Algorithmic Foundations of Robotics, 2010.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Mühendislik
Bölüm
Araştırma Makalesi
Yazarlar
Yasin Göçgün
Türkiye
Yayımlanma Tarihi
1 Mart 2019
Gönderilme Tarihi
30 Nisan 2018
Kabul Tarihi
8 Ağustos 2018
Yayımlandığı Sayı
Yıl 2019 Cilt: 7 Sayı: 1