Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks
Abstract
The fast increase of devices in Internet of Things (IoT) networks, which is projected to grow to over 21 billion devices by 2025, will require more sophisticated management paradigms, as current rule-based and reactive frameworks cannot support networks of this scale. Rule-based systems are capable of processing only of predefined failure patterns and when it comes to complex situations like simultaneous multiple failures and cascading failures, they cannot work. The proposed paper suggests a self-healing framework of reinforcement learning (RL) with respect to an IoT sensor network. The original innovation of the framework is that it brings the MAPE-K cycle (Monitor-Analyze-Plan-Execute over Knowledge) framework, the fundamental reference model of autonomous computing, to which a learning element is added to form the MAPE-K+L model. This aspect provides the system to be able to enhance its policy gradually through learning on the past failures. The proposed framework has been tested in a custom Python/Gymnasium simulation framework, with six failure modes (single node failure, sensor drift, gateway failure, concurrent multiple failure, network congestion, cascading failure) in cluster topology networks, using 50 to 500 nodes. The Q-Learning and Deep Q-Network (DQN) agents were fully contrasted with random (RND) and rule-based (RB) baselines. The Q-Learning agent in the multiple failure scenario decreased the mean recovery time (MTTR) by 51.9 and 32.8 percent relative to random selection and rule-based approach respectively (p<0.001, Cohen d=1.424). The DQN agent had the best cumulative reward and the most stable performance in the cascading failure case; scalability experiments proved that DQN can work with a stable performance even in the 500-node networks.
Keywords
References
- Adeniyi, O., Sadiq, A. S., Pillai, P., Taheir, M. A., & Kaiwartya, O. (2023). Proactive self-healing approaches in mobile edge computing: A systematic literature review. Computers, 12(3), 63. https://doi.org/10.3390/computers12030063
- Albrecht, S. V., Christianos, F., & Schäfer, L. (2024). Multi-Agent Reinforcement Learning: Foundations and Modern Approaches. MIT Press.
- Aldrini, J., Chihi, I., & Sidhom, L. (2024). Fault diagnosis and self-healing for smart manufacturing: a review. Journal of Intelligent Manufacturing, 35(6), 2441-2473. https://doi.org/10.1007/s10845-023-02165-6
- Alhanaf, A. S., Balik, H. H., & Farsadi, M. (2023). Intelligent fault detection and classification schemes for smart grids based on deep neural networks. Energies, 16(22), 7680. https://doi.org/10.3390/en16227680
- Aliu, O. G., Imran, A., Imran, M. A., & Evans, B. (2013). A survey of self organisation in future cellular networks. IEEE Communications Surveys & Tutorials, 15(1), 336-361. https://doi.org/10.1109/SURV.2012.021312.00116
- Aminikhanghahi, S., & Cook, D. J. (2017). A survey of methods for time series change point detection. Knowledge and Information Systems, 51(2), 339-367. https://doi.org/10.1007/s10115-016-0987-z
- Asghar, M. Z., Nieminen, P., Hämäläinen, S., Ristaniemi, T., Imran, M. A., & Hämäläinen, T. (2017). Towards proactive context-aware self-healing for 5G networks. Computer Networks, 128, 5-13. https://doi.org/10.1016/j.comnet.2017.04.053
- Chandola, V., Banerjee, A., & Kumar, V. (2009). Anomaly detection: A survey. ACM Computing Surveys, 41(3), 15. https://doi.org/10.1145/1541880.1541882
Details
Primary Language
English
Subjects
Cyberphysical Systems and Internet of Things, System and Network Security, Network Engineering
Journal Section
Research Article
Authors
Tunahan Timuçin
*
0000-0003-0332-4118
Türkiye
Publication Date
June 30, 2026
Submission Date
March 24, 2026
Acceptance Date
April 22, 2026
Published in Issue
Year 2026 Volume: 13 Number: 2