EN
Improving Sample Efficiency of Reinforcement Learning Control Using Autoencoders
Abstract
This study presents a novel approach for improving the sample efficiency of reinforcement learning (RL) control of dynamic systems by utilizing autoencoders. The main objective of this research is to investigate the effectiveness of autoencoders in enhancing the learning process and improving the resulting policies in RL control problems. In literature most applications use only autoencoder’s latent space while learning. This approach can cause loss of information, difficulty in interpreting latent space, difficulty in handling dynamic environments and outdated representation. In this study, proposed novel approach overcomes these problems and enhances sample efficiency using both states and their latent space while learning. The methodology consists of two main steps. First, a denoising-contractive autoencoder is developed and implemented for RL control problems, with a specific focus on its applicability to state representation and feature extraction. The second step involves training a Deep Reinforcement Learning algorithm using the augmented states generated by the autoencoder. The algorithm is compared against a baseline Deep Q-Network (DQN) algorithm in the LunarLander environment, where observations from the environment are subject to Gaussian noise.
Keywords
References
- R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction. MIT press, 2018.
- V. Mnih, K. Kavukcuoglu, D. Silver, et al., “Humanlevel control through deep reinforcement learning,” nature, vol. 518, no. 7540, pp. 529–533, 2015.
- D. Silver, J. Schrittwieser, K. Simonyan, et al., “Mastering the game of go without human knowledge,” nature, vol. 550, no. 7676, pp. 354–359, 2017.
- G. Dulac-Arnold, D. Mankowitz, and T. Hester, Challenges of real-world reinforcement learning, 2019. arXiv: 1904.12901 [cs.LG].
- R. S. Sutton, “Dyna, an integrated architecture for learning, planning, and reacting,” ACM Sigart Bulletin, vol. 2, no. 4, pp. 160–163, 1991.
- K. Chua, R. Calandra, R. McAllister, and S. Levine, “Deep reinforcement learning in a handful of trials using probabilistic dynamics models,” Advances in neural information processing systems, vol. 31, 2018.
- R. Agarwal, C. Liang, D. Schuurmans, and M. Norouzi, “Learning to generalize from sparse and underspecified rewards,” in International conference on machine learning, PMLR, 2019, pp. 130–140.
- T. Nguyen, T. M. Luu, T. Vu, and C. D. Yoo, “Sample-efficient reinforcement learning representation learning with curiosity contrastive forward dynamics model,” in 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, 2021, pp. 3471–3477.
Details
Primary Language
English
Subjects
Artificial Intelligence (Other), Control Engineering, Mechatronics and Robotics (Other)
Journal Section
Research Article
Publication Date
July 20, 2024
Submission Date
May 9, 2024
Acceptance Date
May 15, 2024
Published in Issue
Year 2024 Volume: 1 Number: 1
APA
Er, B., & Doğan, M. (2024). Improving Sample Efficiency of Reinforcement Learning Control Using Autoencoders. ITU Computer Science AI and Robotics, 1(1), 39-48. https://izlik.org/JA28FE73FT
AMA
1.Er B, Doğan M. Improving Sample Efficiency of Reinforcement Learning Control Using Autoencoders. ITU Computer Science AI and Robotics. 2024;1(1):39-48. https://izlik.org/JA28FE73FT
Chicago
Er, Burak, and Mustafa Doğan. 2024. “Improving Sample Efficiency of Reinforcement Learning Control Using Autoencoders”. ITU Computer Science AI and Robotics 1 (1): 39-48. https://izlik.org/JA28FE73FT.
EndNote
Er B, Doğan M (July 1, 2024) Improving Sample Efficiency of Reinforcement Learning Control Using Autoencoders. ITU Computer Science AI and Robotics 1 1 39–48.
IEEE
[1]B. Er and M. Doğan, “Improving Sample Efficiency of Reinforcement Learning Control Using Autoencoders”, ITU Computer Science AI and Robotics, vol. 1, no. 1, pp. 39–48, July 2024, [Online]. Available: https://izlik.org/JA28FE73FT
ISNAD
Er, Burak - Doğan, Mustafa. “Improving Sample Efficiency of Reinforcement Learning Control Using Autoencoders”. ITU Computer Science AI and Robotics 1/1 (July 1, 2024): 39-48. https://izlik.org/JA28FE73FT.
JAMA
1.Er B, Doğan M. Improving Sample Efficiency of Reinforcement Learning Control Using Autoencoders. ITU Computer Science AI and Robotics. 2024;1:39–48.
MLA
Er, Burak, and Mustafa Doğan. “Improving Sample Efficiency of Reinforcement Learning Control Using Autoencoders”. ITU Computer Science AI and Robotics, vol. 1, no. 1, July 2024, pp. 39-48, https://izlik.org/JA28FE73FT.
Vancouver
1.Burak Er, Mustafa Doğan. Improving Sample Efficiency of Reinforcement Learning Control Using Autoencoders. ITU Computer Science AI and Robotics [Internet]. 2024 Jul. 1;1(1):39-48. Available from: https://izlik.org/JA28FE73FT