Effective Exploration via Intrinsic Motivation in Reinforcement Learning
Abstract
Keywords
Ethical Statement
References
- [1] R. S. Sutton, A. G. Barto, Reinforcement learning: An introduction, Adaptive Computation and Machine Learning, MIT Press, 1998.
- [2] M. Chevalier-Boisvert, B. Dai, M. Towers, R. de Lazcano, L. Willems, S. Lahlou, S. Pal, P. S. Castro, J. Terry, Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks, Proceedings of the Conference on Neural Information Processing Systems, New Orleans, Louisiana, 2023, pp. 1–12.
- [3] P. Ladosz, L. Weng, M. Kim, H. Oh, Exploration in deep reinforcement learning: A survey, Information Fusion 85, (2022) 1–22.
- [4] Y. Burda, H. Edwards, A. Storkey, O. Klimov, Exploration by random network distillation, Seventh International Conference on Learning Representations, New Orleans, 2019, pp. 1–17.
- [5] M. Seurin, F. Strub, P. Preux, O. Pietquin, Don’t Do What Doesn’t Matter: Intrinsic motivation with action usefulness, Internationnal Joint Conference on Artificial Intelligence, Montreal, 2021, pp. 2950–2956.
- [6] J. Schulman, F. Wolski, P. Dhariwal, A. Radford, O. Klimov, Proximal policy optimization algorithms (2017) Accessed 13 Aug 2018.
- [7] R. Bellman, Dynamic programming, Science 153 (3731) (1966) 34–37
- [8] L. P. Kaelbling, M. L. Littman, A. R. Cassandra, Planning and acting in partially observable stochastic domains, Artificial Intelligence 101 (1-2) (1998) 99–134.
Details
Primary Language
English
Subjects
Reinforcement Learning
Journal Section
Research Article
Publication Date
June 30, 2026
Submission Date
April 3, 2026
Acceptance Date
June 16, 2026
Published in Issue
Year 2026 Volume: 12 Number: 2