Research Article

A k-mer based metaheuristic approach for detecting COVID-19 variants

Volume: 14 Number: 1 March 23, 2023
EN TR

A k-mer based metaheuristic approach for detecting COVID-19 variants

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) belongs to coronaviridae family and a change in the genetic sequence of SARS-CoV-2 is named as a mutation that causes to variants of SARS-CoV-2. In this paper, we propose a novel and efficient method to predict SARS-CoV-2 variants of concern from whole human genome sequences. In this method, we describe 16 dinucleotide and 64 trinucleotide features to differentiate SARS-CoV-2 variants of concern. The efficacy of the proposed features is proved by using four classifiers, k-nearest neighbor, support vector machines, multilayer perceptron, and random forest. The proposed method is evaluated on the dataset including 223,326 complete human genome sequences including recently designated variants of concern, Alpha, Beta, Gamma, Delta, and Omicron variants. Experimental results present that overall accuracy for detecting SARS-CoV-2 variants of concern remarkably increases when trinucleotide features rather than dinucleotide features are used. Furthermore, we use the whale optimization algorithm, which is a state-of-the-art method for reducing the number of features and choosing the most relevant features. We select 44 trinucleotide features out of 64 to differentiate SARS-CoV-2 variants with acceptable accuracy as a result of the whale optimization method. Experimental results indicate that the SVM classifier with selected features achieves about 99% accuracy, sensitivity, specificity, precision on average. The proposed method presents an admirable performance for detecting SARS-CoV-2 variants.

Keywords

References

  1. [1] Volz, E., Mishra, S., Chand, M., Barrett, J. C., & al., R. J. et. (2021). Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature, 593(7858), 266–269. doi:10.1038/s41586-021-03470-x
  2. [2] Lauring, A. S., & Malani, P. N. (09 2021). Variants of SARS-CoV-2. JAMA, 326(9), 880–880. doi:10.1001/jama.2021.14181
  3. [3] Tegally, H., Wilkinson, E., Giovanetti, M., & al., A. I. et. (2021). Detection of a SARS-CoV-2 variant of concern in South Africa. Nature, 592(7854), 438–443. doi:10.1038/s41586-021-03402-9
  4. [4] Sabino, E. C., Buss, L. F., Carvalho, M. P. S., & al., E. (2021). Resurgence of COVID-19 in Manaus, Brazil, despite high seroprevalence. The Lancet, 397(10273), 452–455. doi:10.1016/s0140-6736(21)00183-5
  5. [5] Mlcochova, P., Kemp, S. A., Dhar, M. S., & al., G. P. et. (2021). SARS-CoV-2 B.1.617.2 Delta variant replication and immune evasion. Nature, 599(7883), 114–119. doi:10.1038/s41586-021-03944-y
  6. [6] Sahoo, J. P., & Samal, K. C. (2021). World on alert: WHO designated south African new COVID strain (Omicron/B.1.1.529) as a variant of concern. Biotica Research Today, 3(11), 1086–1088.
  7. [7] Jiang, X., Coffee, M., Bari, A., Wang, J., Jiang, X., Huang, J., … Huang, Y. (2020). Towards an Artificial Intelligence Framework for Data-Driven Prediction of Coronavirus Clinical Severity. Computers, Materials $\&$ Continua, 62(3), 537–551. doi:10.32604/cmc.2020.010691
  8. [8] Zoabi, Y., Deri-Rozov, S., & Shomron, N. (2021). Machine learning-based prediction of COVID-19 diagnosis based on symptoms. Npj Digital Medicine, 4(1), 3. doi:10.1038/s41746-020-00372-6

Details

Primary Language

English

Subjects

-

Journal Section

Research Article

Publication Date

March 23, 2023

Submission Date

October 27, 2022

Acceptance Date

February 3, 2023

Published in Issue

Year 2023 Volume: 14 Number: 1

APA
Arslan, H. (2023). A k-mer based metaheuristic approach for detecting COVID-19 variants. Dicle Üniversitesi Mühendislik Fakültesi Mühendislik Dergisi, 14(1), 17-26. https://doi.org/10.24012/dumf.1195600
AMA
1.Arslan H. A k-mer based metaheuristic approach for detecting COVID-19 variants. DUJE. 2023;14(1):17-26. doi:10.24012/dumf.1195600
Chicago
Arslan, Hilal. 2023. “A K-Mer Based Metaheuristic Approach for Detecting COVID-19 Variants”. Dicle Üniversitesi Mühendislik Fakültesi Mühendislik Dergisi 14 (1): 17-26. https://doi.org/10.24012/dumf.1195600.
EndNote
Arslan H (March 1, 2023) A k-mer based metaheuristic approach for detecting COVID-19 variants. Dicle Üniversitesi Mühendislik Fakültesi Mühendislik Dergisi 14 1 17–26.
IEEE
[1]H. Arslan, “A k-mer based metaheuristic approach for detecting COVID-19 variants”, DUJE, vol. 14, no. 1, pp. 17–26, Mar. 2023, doi: 10.24012/dumf.1195600.
ISNAD
Arslan, Hilal. “A K-Mer Based Metaheuristic Approach for Detecting COVID-19 Variants”. Dicle Üniversitesi Mühendislik Fakültesi Mühendislik Dergisi 14/1 (March 1, 2023): 17-26. https://doi.org/10.24012/dumf.1195600.
JAMA
1.Arslan H. A k-mer based metaheuristic approach for detecting COVID-19 variants. DUJE. 2023;14:17–26.
MLA
Arslan, Hilal. “A K-Mer Based Metaheuristic Approach for Detecting COVID-19 Variants”. Dicle Üniversitesi Mühendislik Fakültesi Mühendislik Dergisi, vol. 14, no. 1, Mar. 2023, pp. 17-26, doi:10.24012/dumf.1195600.
Vancouver
1.Hilal Arslan. A k-mer based metaheuristic approach for detecting COVID-19 variants. DUJE. 2023 Mar. 1;14(1):17-26. doi:10.24012/dumf.1195600

Cited By