Research Article

Integrating Proximal Policy Optimization with Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control

Volume: 13 Number: 1 March 31, 2026

Integrating Proximal Policy Optimization with Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control

Abstract

This study presents the design and implementation of a reinforcement learning (RL)-based framework for the control of an autonomous underwater vehicle (AUV) directly within Unreal Engine (UE). A high-fidelity aquatic environment was created using UE’s native Water System to simulate hydrodynamic forces and buoyancy. Unlike studies assuming continuous control, this research addresses the challenge of stabilizing an AUV subject to severe discrete 'bang-bang' hardware constraints. A parallelized Proximal Policy Optimization (PPO) algorithm was employed to synthesize adaptive control policies. Comparative analysis against tuned Proportional-Integral-Derivative (PID) baselines demonstrates that the RL agent outperforms classical methods in three key metrics: (1) in longitudinal navigation, the agent learned an emergent "pulsing" strategy—mimicking Pulse-Width Modulation (PWM)—to overcome these discrete actuation constraints, reducing steady-state error compared to the Proportional-Derivative (PD) baseline; (2) in vertical depth control, the agent autonomously learned gravity compensation, settling faster than integral-based controllers while avoiding buoyancy-induced stalling; and (3) in heading control, the agent demonstrated superior dynamic handling, completing stabilization maneuvers faster than the baseline. The key architectural innovation lies in the direct integration of UE’s Learning Agents plugin, eliminating the need for external middleware. This native integration enables real-time synchronization between simulation physics and learning processes, establishing a high-fidelity platform for developing intelligent underwater control systems.

Keywords

References

  1. Amer, A., Álvarez-Tuñón, O., Uğurlu, H. İ., Le Fevre Sejersen, J., Brodskiy, Y., & Kayacan, E. (2023, December 5-8). UNav-Sim: A visually realistic underwater robotics simulator and synthetic data-generation framework. 2023 21st International Conference on Advanced Robotics (ICAR), 570–576. Abu Dhabi, UAE. https://doi.org/10.1109/ICAR58858.2023.10406819
  2. Behrje, U., Amory, A., Meyer, B., & Maehle, E. (2018, June 20–21). System identification and sliding mode depth control of the micro AUV SEMBIO. Proceedings of the 50th International Symposium on Robotics (ISR 2018), 344–351. Munich, Germany.
  3. Benjamin, M. R., Schmidt, H., Newman, P. M., & Leonard, J. J. (2010). Nested autonomy for unmanned marine vehicles with MOOS-IvP. Journal of Field Robotics, 27(5), 834–875.
  4. Cai, L., Chang, K., & Girdhar, Y. (2025). Learning to swim: Reinforcement learning for 6-DOF control of thruster-driven autonomous underwater vehicles. 2025 IEEE International Conference on Robotics and Automation (ICRA). https://arxiv.org/abs/2410.00120
  5. Chamusca, I. L., De Jesus Santos, F. V., Ferreira, C. V., Murari, T. B., Apolinario Junior, A. L., & Winkler, I. (2022). Evaluation of design guidelines for the development of intuitive virtual reality authoring tools: A case study with NVIDIA Omniverse. 2022 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct), 419–424. Singapore. https://doi.org/10.1109/ISMAR-Adjunct57072.2022.00078
  6. Eriksson, J., & Wingård, J. (2022). Improving the accuracy of FFT-based GPGPU ocean surface simulations [MSc Thesis, Chalmers University of Technology and University of Gothenburg].
  7. Farhang, A. R., Mulcahy, B., Holden, D., Matthews, I., & Yue, Y. (2024). Humanlike behavior in a third-person shooter with imitation learning. 2024 IEEE Conference on Games (CoG), 1–4. Milan, Italy. https://doi.org/10.1109/CoG60054.2024.10645651
  8. Fossen, T. I. (2011). Handbook of marine craft hydrodynamics and motion control. Chichester, UK: John Wiley & Sons.

Details

Primary Language

English

Subjects

Autonomous Agents and Multiagent Systems

Journal Section

Research Article

Publication Date

March 31, 2026

Submission Date

October 30, 2025

Acceptance Date

March 24, 2026

Published in Issue

Year 2026 Volume: 13 Number: 1

APA
Mol, M., & Karaarslan, A. (2026). Integrating Proximal Policy Optimization with Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control. Gazi University Journal of Science Part A: Engineering and Innovation, 13(1), 165-199. https://doi.org/10.54287/gujsa.1813751
AMA
1.Mol M, Karaarslan A. Integrating Proximal Policy Optimization with Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control. GU J Sci, Part A. 2026;13(1):165-199. doi:10.54287/gujsa.1813751
Chicago
Mol, Mahmut, and Ahmet Karaarslan. 2026. “Integrating Proximal Policy Optimization With Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control”. Gazi University Journal of Science Part A: Engineering and Innovation 13 (1): 165-99. https://doi.org/10.54287/gujsa.1813751.
EndNote
Mol M, Karaarslan A (March 1, 2026) Integrating Proximal Policy Optimization with Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control. Gazi University Journal of Science Part A: Engineering and Innovation 13 1 165–199.
IEEE
[1]M. Mol and A. Karaarslan, “Integrating Proximal Policy Optimization with Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control”, GU J Sci, Part A, vol. 13, no. 1, pp. 165–199, Mar. 2026, doi: 10.54287/gujsa.1813751.
ISNAD
Mol, Mahmut - Karaarslan, Ahmet. “Integrating Proximal Policy Optimization With Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control”. Gazi University Journal of Science Part A: Engineering and Innovation 13/1 (March 1, 2026): 165-199. https://doi.org/10.54287/gujsa.1813751.
JAMA
1.Mol M, Karaarslan A. Integrating Proximal Policy Optimization with Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control. GU J Sci, Part A. 2026;13:165–199.
MLA
Mol, Mahmut, and Ahmet Karaarslan. “Integrating Proximal Policy Optimization With Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control”. Gazi University Journal of Science Part A: Engineering and Innovation, vol. 13, no. 1, Mar. 2026, pp. 165-99, doi:10.54287/gujsa.1813751.
Vancouver
1.Mahmut Mol, Ahmet Karaarslan. Integrating Proximal Policy Optimization with Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control. GU J Sci, Part A. 2026 Mar. 1;13(1):165-99. doi:10.54287/gujsa.1813751