Research Article

Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks

Volume: 13 Number: 2 June 30, 2026

Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks

Abstract

The fast increase of devices in Internet of Things (IoT) networks, which is projected to grow to over 21 billion devices by 2025, will require more sophisticated management paradigms, as current rule-based and reactive frameworks cannot support networks of this scale. Rule-based systems are capable of processing only of predefined failure patterns and when it comes to complex situations like simultaneous multiple failures and cascading failures, they cannot work. The proposed paper suggests a self-healing framework of reinforcement learning (RL) with respect to an IoT sensor network. The original innovation of the framework is that it brings the MAPE-K cycle (Monitor-Analyze-Plan-Execute over Knowledge) framework, the fundamental reference model of autonomous computing, to which a learning element is added to form the MAPE-K+L model. This aspect provides the system to be able to enhance its policy gradually through learning on the past failures. The proposed framework has been tested in a custom Python/Gymnasium simulation framework, with six failure modes (single node failure, sensor drift, gateway failure, concurrent multiple failure, network congestion, cascading failure) in cluster topology networks, using 50 to 500 nodes. The Q-Learning and Deep Q-Network (DQN) agents were fully contrasted with random (RND) and rule-based (RB) baselines. The Q-Learning agent in the multiple failure scenario decreased the mean recovery time (MTTR) by 51.9 and 32.8 percent relative to random selection and rule-based approach respectively (p<0.001, Cohen d=1.424). The DQN agent had the best cumulative reward and the most stable performance in the cascading failure case; scalability experiments proved that DQN can work with a stable performance even in the 500-node networks.

Keywords

References

  1. Adeniyi, O., Sadiq, A. S., Pillai, P., Taheir, M. A., & Kaiwartya, O. (2023). Proactive self-healing approaches in mobile edge computing: A systematic literature review. Computers, 12(3), 63. https://doi.org/10.3390/computers12030063
  2. Albrecht, S. V., Christianos, F., & Schäfer, L. (2024). Multi-Agent Reinforcement Learning: Foundations and Modern Approaches. MIT Press.
  3. Aldrini, J., Chihi, I., & Sidhom, L. (2024). Fault diagnosis and self-healing for smart manufacturing: a review. Journal of Intelligent Manufacturing, 35(6), 2441-2473. https://doi.org/10.1007/s10845-023-02165-6
  4. Alhanaf, A. S., Balik, H. H., & Farsadi, M. (2023). Intelligent fault detection and classification schemes for smart grids based on deep neural networks. Energies, 16(22), 7680. https://doi.org/10.3390/en16227680
  5. Aliu, O. G., Imran, A., Imran, M. A., & Evans, B. (2013). A survey of self organisation in future cellular networks. IEEE Communications Surveys & Tutorials, 15(1), 336-361. https://doi.org/10.1109/SURV.2012.021312.00116
  6. Aminikhanghahi, S., & Cook, D. J. (2017). A survey of methods for time series change point detection. Knowledge and Information Systems, 51(2), 339-367. https://doi.org/10.1007/s10115-016-0987-z
  7. Asghar, M. Z., Nieminen, P., Hämäläinen, S., Ristaniemi, T., Imran, M. A., & Hämäläinen, T. (2017). Towards proactive context-aware self-healing for 5G networks. Computer Networks, 128, 5-13. https://doi.org/10.1016/j.comnet.2017.04.053
  8. Chandola, V., Banerjee, A., & Kumar, V. (2009). Anomaly detection: A survey. ACM Computing Surveys, 41(3), 15. https://doi.org/10.1145/1541880.1541882

Details

Primary Language

English

Subjects

Cyberphysical Systems and Internet of Things, System and Network Security, Network Engineering

Journal Section

Research Article

Publication Date

June 30, 2026

Submission Date

March 24, 2026

Acceptance Date

April 22, 2026

Published in Issue

Year 2026 Volume: 13 Number: 2

APA
Timuçin, T. (2026). Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks. Gazi University Journal of Science Part A: Engineering and Innovation, 13(2), 764-783. https://doi.org/10.54287/gujsa.1914900
AMA
1.Timuçin T. Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks. GU J Sci, Part A. 2026;13(2):764-783. doi:10.54287/gujsa.1914900
Chicago
Timuçin, Tunahan. 2026. “Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks”. Gazi University Journal of Science Part A: Engineering and Innovation 13 (2): 764-83. https://doi.org/10.54287/gujsa.1914900.
EndNote
Timuçin T (June 1, 2026) Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks. Gazi University Journal of Science Part A: Engineering and Innovation 13 2 764–783.
IEEE
[1]T. Timuçin, “Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks”, GU J Sci, Part A, vol. 13, no. 2, pp. 764–783, June 2026, doi: 10.54287/gujsa.1914900.
ISNAD
Timuçin, Tunahan. “Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks”. Gazi University Journal of Science Part A: Engineering and Innovation 13/2 (June 1, 2026): 764-783. https://doi.org/10.54287/gujsa.1914900.
JAMA
1.Timuçin T. Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks. GU J Sci, Part A. 2026;13:764–783.
MLA
Timuçin, Tunahan. “Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks”. Gazi University Journal of Science Part A: Engineering and Innovation, vol. 13, no. 2, June 2026, pp. 764-83, doi:10.54287/gujsa.1914900.
Vancouver
1.Tunahan Timuçin. Reinforcement Learning-Based Self-Healing Framework in IoT Sensor Networks. GU J Sci, Part A. 2026 Jun. 1;13(2):764-83. doi:10.54287/gujsa.1914900