Deep Reinforcement Learning for Simulation-Based Determination of COVID-19 Pandemic Mitigation Policies

Mahmut Lütfullah Özbilen; Emre Eğriboz; Ruşen Halepmollası; İsmail Bilgen; Mehmet Haklıdır

Short Report

Deep Reinforcement Learning for Simulation-Based Determination of COVID-19 Pandemic Mitigation Policies

Year 2021, Volume: 1 Issue: 2, 29 - 38, 30.09.2021

Mahmut Lütfullah Özbilen Emre Eğriboz Ruşen Halepmollası İsmail Bilgen Mehmet Haklıdır

Abstract

A global health emergency has been declared by WHO at the beginning of 2020 based on the increasing number of cases in the COVID-19 epidemic. Governments around the world have taken unprecedented measures. However, there is no guarantee that the measures taken are best to mitigate the effect of pandemic. We investigate the impact of government policies regarding interventions on deaths related to the COVID-19 and mitigation of the economic decline. In a simulation environment, we use Reinforcement Learning (RL) to explore the optimal policies to prevent COVID-19 outbreak. We use a specific simulator called PandemicSimulator which has detailed abilities to simulate spread of disease and people interactions at different locations. The simulator is utilized to train RL agents to take mitigation policies with minimum economic damage of the pandemic without exceeding the hospital capacity. We use Deep Q Networks to train the RL agent. We compare the performance of the our agent’s policy with the policy applied by the United Kingdom in terms of critical patients, deaths and economic damage. Results show that policies improved by the RL agent can help decision makers in the pandemic mitigation policies.

Keywords

reinforcement learning , deep learning , COVID-19 , pandemic

References

1. Aleta, A., Martin-Corral, D., y Piontti, A. P., Ajelli, M., Litvinova, M., Chinazzi, M., Dean, N. E., Halloran, M. E., Longini Jr, I. M., Merler, S., et al. (2020). Modelling the impact of testing, contact tracing and house- hold quarantine on second waves of covid-19. Nature Human Behaviour, 4 (9), 964–971.
2. Awasthi, R., Guliani, K. K., Bhatt, A., Gill, M. S., Nagori, A., Kumaraguru, P., & Sethi, T. (2020). Vacsim: Learning effective strategies for covid19 vaccine distribution using reinforcement learning. arXiv preprint arXiv:2009.06602.
3. Arregui, S., Aleta, A., Sanz, J., & Moreno, Y. (2018). Projecting social contact matrices to different demographic structures. PLoS computational biology, 14 (12), e1006638.
4. Anderson, R. M., Heesterbeek, H., Klinkenberg, D., & Hollingsworth, T.D. (2020) How will country-based mitigation measures influence thecourse of the covid-19 epidemic? The lancet, 395 (10228), 931–934. Bednarski, B. P., Singh, A. D., & Jones, W. M. (2020). On collaborative re- inforcement learning to optimize the redistribution of critical medical supplies throughout the covid-19 pandemic. Journal of the American Medical Informatics Association.
5. Dervisoglu, H., Bilgen, I, Halepmollasi, R., Can, B., Haklidir, M., (2021) Unfairness of Deep Learning Methods Arising Gender Bias in Covid-19 Diagnosis of Medical Images. Artificial Intelligence Theory and Application, 2, (Special Issue), 81-94.
6. Elgin, C., Basbug, G., & Yalaman, A. (2020). Economic policy responses to a pandemic: Developing the covid-19 economic stimulus index. Covid Economics, 1 (3), 40–53.M.L. Ozbilen et al. Artificial Intelligence Theory and Applications: 3 (2021) 29-38
7. Gottesman, O., Johansson, F., Komorowski, M., Faisal, A., Sontag, D., Doshi- Velez, F., & Celi, L. A. (2019). Guidelines for reinforcement learning in healthcare. Nature medicine, 25 (1), 16–18.
8. Hu, C., Lovejoy, W. S., & Shafer, S. L. (1994). Comparison of some control strategies for threecompartment pk/pd models. Journal of Pharma- cokinetics and Biopharmaceutics, 22 (6), 525–550.
9. Jaiswal, A. K., Tiwari, P., Kumar, S., Gupta, D., Khanna, A., & Rodrigues,J. J. (2019). Identifying pneumonia in chest x-rays: A deep learning approach. Measurement, 145, 511–518.
10. Kompella*, V., Capobianco*, R., Jong, S., Browne, J., Fox, S., Meyers, L., Wurman, P., & Stone, P. (2020). Reinforcement learning for optimization of covid- 19 mitigation policies.
11. Li, C. Y., Liang, X., Hu, Z., & Xing, E. P. (2018). Hybrid retrieval-generation reinforced agent for medical image report generation. arXiv preprint arXiv:1805.08298.
12. Liu, C. (2020). A microscopic epidemic model and pandemic prediction using multiagent reinforcement learning. arXiv preprint arXiv:2004.12959.
13. Martin-Calvo, D., Aleta, A., Pentland, A., Moreno, Y., & Moro, E. (2020) Effectiveness of social distancing strategies for protecting a community from a pandemic with a data driven contact network based on census and real-world mobility data. Complex. Dig.
14. Max Roser, E. O.-O., Hannah Ritchie, & Hasell, J. (2020). Coronavirus pan- demic (covid-19) [https://ourworldindata.org/coronavirus]. Our World in Data. 15. Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I.,Wierstra, D., & Riedmiller, M. A. (2013). Playing atari with deep reinforcement learning. CoRR, abs/1312.5602. http://arxiv.org/abs/1312.5602
16. Najar, O., (2021) Hypothetical Framework For Early Detection of Covid19 From Symptomatic Information by Using Deep Learning. Artificial Intelligence Theory and Application, 2, (Special Issue), 115-121.
17. Organization, W. H. et al. (2020). Coronavirus disease 2019 (covid-19): Situation report, 82. Ozbilen, M., Egriboz, E., Halepmollasi, R., Bilgen, I, Haklidir, M., (2021) A Deep Reinforcement Learning Approach to Explore Optimal Policies for Covid-19 Pandemic Mitigation: Preliminary Analysis. Artificial Intelligence Theory and Application, 2, (Special Issue).
18. Schaefer, A. J., Bailey, M. D., Shechter, S. M., & Roberts, M. S. (2005). Modeling medical treatment using markov decision processes. Operations research and health care (pp. 593–612). Springer.
19. Taubenberger, J. K., & Morens, D. M. (2006). 1918 influenza: The mother of all pandemics. Revista Biomedica, 17 (1), 69–79.
20. Yu, C., Liu, J., & Nemati, S. (2019). Reinforcement learning in healthcare: A survey. arXiv preprint arXiv:1908.08796.

There are 19 citations in total.

Details

Primary Language	English
Subjects	Engineering, Clinical Sciences
Journal Section	Research Articles
Authors	Mahmut Lütfullah Özbilen This is me Emre Eğriboz This is me Ruşen Halepmollası This is me İsmail Bilgen This is me Mehmet Haklıdır This is me
Publication Date	September 30, 2021
Published in Issue	Year 2021 Volume: 1 Issue: 2

Cite

APA	Özbilen, M. L., Eğriboz, E., Halepmollası, R., … Bilgen, İ. (2021). Deep Reinforcement Learning for Simulation-Based Determination of COVID-19 Pandemic Mitigation Policies. Artificial Intelligence Theory and Applications, 1(2), 29-38.

Download Cover Image

Article Files

Full Text