Research Article

A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments

Volume: 18 Number: 1 March 29, 2023
EN

A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments

Abstract

Tactical UAV path planning under radar threat using reinforcement learning involves particular challenges ranging from modeling related difficulties to sparse feedback problem. Learning goal-directed behavior with sparse feedback from complex environments is a fundamental challenge for reinforcement learning algorithms. In this paper we extend our previous work in this area to provide a solution to the problem setting stated above, using Hierarchical Reinforcement Learning (HRL) in a novel way that involves a meta controller for higher level goal assignment and a controller that determines the lower-level actions of the agent. Our meta controller is based on a regression model trained using a state transition scheme that defines the evolution of goal designation, whereas our lower-level controller is based on a Deep Q Network (DQN) and is trained via reinforcement learning iterations. This two-layer framework ensures that an optimal plan for a complex path, organized as multiple goals, is achieved gradually, through piecewise assignment of sub-goals, and thus as a result of a staged, efficient and rigorous procedure.

Keywords

References

  1. Abell DC, Caraway III WD. A method for the determination of target aspect angle with respect to a radar, July, 1998.
  2. Bertsekas DP. Reinforcement Learning and Optimal Control. Athena Scientific, Belmont, Massachusetts.
  3. Bouhamed O, Ghazzai H, Besbes H, Massoud Y. Autonomous uav navigation: A ddpg-based deep reinforcement learning approach, 2020.
  4. Brockman G, Cheung V, Pettersson L, Schneider J, Schulman J, Tang J, Zaremba W. Openai gym. arXiv preprint arXiv:1606.01540, 2016.
  5. Challita U, Saad W, Bettstetter C. Deep reinforcement learning for interference-aware path planning of cellular-connected uavs. In 2018 IEEE International Conference on Communications (ICC), 2018, pp. 1–7.
  6. Fujita Y, Nagarajan P, Kataoka T, Ishikawa T. Chainerrl: A deep reinforcement learning library. Journal of Machine Learning Research 2021; 22(77): 1–14.
  7. Garcia F, Rachelson E. Markov Decision Processes, pp. 1–38. John Wiley and Sons, Ltd, 2013.
  8. Gosavi A. Control Optimization with Reinforcement Learning pp. 197–268. Springer US, Boston, MA, 2015.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

March 29, 2023

Submission Date

December 16, 2022

Acceptance Date

March 21, 2023

Published in Issue

Year 2023 Volume: 18 Number: 1

APA
Alpdemir, M. N. (2023). A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments. Turkish Journal of Science and Technology, 18(1), 243-259. https://doi.org/10.55525/tjst.1219845
AMA
1.Alpdemir MN. A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments. TJST. 2023;18(1):243-259. doi:10.55525/tjst.1219845
Chicago
Alpdemir, Mahmut Nedim. 2023. “A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments”. Turkish Journal of Science and Technology 18 (1): 243-59. https://doi.org/10.55525/tjst.1219845.
EndNote
Alpdemir MN (March 1, 2023) A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments. Turkish Journal of Science and Technology 18 1 243–259.
IEEE
[1]M. N. Alpdemir, “A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments”, TJST, vol. 18, no. 1, pp. 243–259, Mar. 2023, doi: 10.55525/tjst.1219845.
ISNAD
Alpdemir, Mahmut Nedim. “A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments”. Turkish Journal of Science and Technology 18/1 (March 1, 2023): 243-259. https://doi.org/10.55525/tjst.1219845.
JAMA
1.Alpdemir MN. A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments. TJST. 2023;18:243–259.
MLA
Alpdemir, Mahmut Nedim. “A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments”. Turkish Journal of Science and Technology, vol. 18, no. 1, Mar. 2023, pp. 243-59, doi:10.55525/tjst.1219845.
Vancouver
1.Mahmut Nedim Alpdemir. A Hierarchical Reinforcement Learning Framework for UAV Path Planning in Tactical Environments. TJST. 2023 Mar. 1;18(1):243-59. doi:10.55525/tjst.1219845

Cited By