Research Article

Effective Exploration via Intrinsic Motivation in Reinforcement Learning

Volume: 12 Number: 2 June 30, 2026

Effective Exploration via Intrinsic Motivation in Reinforcement Learning

Abstract

Reinforcement learning agents often struggle in sparse-reward environments where feedback is limited and appears only after a sequence of correct actions. In partialobservable navigation tasks, simple exploration strategies are often insufficient. This study investigates intrinsic motivation mechanisms, specifically focusing on the “Don’t Do What Doesn’t Matter” (DoWhaM) method, which rewards rare but effective actions. To address its limitations in spatial tasks, we propose Area-aware DoWhaM Adaptation (ADA). This method extends action-usefulness with spatial novelty bonuses to encourage expanding the visible area. We evaluate ADA against DoWhaM and a Count-Based baselines in various MiniGrid environments. Results indicate that ADA improves sample efficiency in the early stages of training. In dynamic environments where the layout changes in every episode, ADA significantly outperforms the Count-Based baseline and learns faster than DoWhaM. These findings suggest that combining action-usefulness with spatial novelty provides a robust heuristic for exploration in procedurally generated tasks.

Keywords

Ethical Statement

No approval from the Board of Ethics is required.

References

  1. [1] R. S. Sutton, A. G. Barto, Reinforcement learning: An introduction, Adaptive Computation and Machine Learning, MIT Press, 1998.
  2. [2] M. Chevalier-Boisvert, B. Dai, M. Towers, R. de Lazcano, L. Willems, S. Lahlou, S. Pal, P. S. Castro, J. Terry, Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks, Proceedings of the Conference on Neural Information Processing Systems, New Orleans, Louisiana, 2023, pp. 1–12.
  3. [3] P. Ladosz, L. Weng, M. Kim, H. Oh, Exploration in deep reinforcement learning: A survey, Information Fusion 85, (2022) 1–22.
  4. [4] Y. Burda, H. Edwards, A. Storkey, O. Klimov, Exploration by random network distillation, Seventh International Conference on Learning Representations, New Orleans, 2019, pp. 1–17.
  5. [5] M. Seurin, F. Strub, P. Preux, O. Pietquin, Don’t Do What Doesn’t Matter: Intrinsic motivation with action usefulness, Internationnal Joint Conference on Artificial Intelligence, Montreal, 2021, pp. 2950–2956.
  6. [6] J. Schulman, F. Wolski, P. Dhariwal, A. Radford, O. Klimov, Proximal policy optimization algorithms (2017) Accessed 13 Aug 2018.
  7. [7] R. Bellman, Dynamic programming, Science 153 (3731) (1966) 34–37
  8. [8] L. P. Kaelbling, M. L. Littman, A. R. Cassandra, Planning and acting in partially observable stochastic domains, Artificial Intelligence 101 (1-2) (1998) 99–134.

Details

Primary Language

English

Subjects

Reinforcement Learning

Journal Section

Research Article

Publication Date

June 30, 2026

Submission Date

April 3, 2026

Acceptance Date

June 16, 2026

Published in Issue

Year 2026 Volume: 12 Number: 2

APA
Eren, B., & Demir, A. (2026). Effective Exploration via Intrinsic Motivation in Reinforcement Learning. Journal of Advanced Research in Natural and Applied Sciences, 12(2), 171-192. https://doi.org/10.28979/jarnas.1922504
AMA
1.Eren B, Demir A. Effective Exploration via Intrinsic Motivation in Reinforcement Learning. JARNAS. 2026;12(2):171-192. doi:10.28979/jarnas.1922504
Chicago
Eren, Berkay, and Alper Demir. 2026. “Effective Exploration via Intrinsic Motivation in Reinforcement Learning”. Journal of Advanced Research in Natural and Applied Sciences 12 (2): 171-92. https://doi.org/10.28979/jarnas.1922504.
EndNote
Eren B, Demir A (June 1, 2026) Effective Exploration via Intrinsic Motivation in Reinforcement Learning. Journal of Advanced Research in Natural and Applied Sciences 12 2 171–192.
IEEE
[1]B. Eren and A. Demir, “Effective Exploration via Intrinsic Motivation in Reinforcement Learning”, JARNAS, vol. 12, no. 2, pp. 171–192, June 2026, doi: 10.28979/jarnas.1922504.
ISNAD
Eren, Berkay - Demir, Alper. “Effective Exploration via Intrinsic Motivation in Reinforcement Learning”. Journal of Advanced Research in Natural and Applied Sciences 12/2 (June 1, 2026): 171-192. https://doi.org/10.28979/jarnas.1922504.
JAMA
1.Eren B, Demir A. Effective Exploration via Intrinsic Motivation in Reinforcement Learning. JARNAS. 2026;12:171–192.
MLA
Eren, Berkay, and Alper Demir. “Effective Exploration via Intrinsic Motivation in Reinforcement Learning”. Journal of Advanced Research in Natural and Applied Sciences, vol. 12, no. 2, June 2026, pp. 171-92, doi:10.28979/jarnas.1922504.
Vancouver
1.Berkay Eren, Alper Demir. Effective Exploration via Intrinsic Motivation in Reinforcement Learning. JARNAS. 2026 Jun. 1;12(2):171-92. doi:10.28979/jarnas.1922504

 

 

 

TR Dizin 20466
 

 

SAO/NASA Astrophysics Data System (ADS)    34270

                                                   American Chemical Society-Chemical Abstracts Service CAS    34922 

 

DOAJ 32869

EBSCO 32870

Scilit 30371                        

SOBİAD 20460

 

29804 JARNAS is licensed under a Creative Commons Attribution-NonCommercial 4.0 International Licence (CC BY-NC).