Araştırma Makalesi

Using transfer learning models for DNA sequence similarity via fCGR method

Cilt: 14 Sayı: 2 15 Nisan 2025
PDF İndir
EN TR

Using transfer learning models for DNA sequence similarity via fCGR method

Abstract

Similarity analysis of DNA sequences is a critical issue for understanding evolutionary relationships and identifying genetic mutations. Since traditional alignment-based methods have high computational costs, this study investigated the applicability of transfer learning models for alignment-independent DNA similarity analysis. DNA sequences were visualized with the Frequency Chaos Game Representation (fCGR) method and feature extraction was performed with ResNet50, EfficientNetB0, and MobileNet models. Three similarity metrics such as cosine similarity, Euclidean distance, and correlation and four different hierarchical clustering methods were compared. The results show that cosine similarity metric reflects genetic similarities better. MobileNet provided the highest accuracy rate with its lightweight structure and efficient feature extraction. Feature vectors visualized with PCA exhibited strong clustering tendencies and were in agreement with reference phylogenetic trees. The study demonstrates the applicability of transfer learning in genetic analyses and shows that scalable and biologically meaningful analyses can be performed.

Keywords

Kaynakça

  1. Z. D. Stephens et al., Big Data: Astronomical or Genomical?, PLoS Biol, 13, 7, p. e1002195, 2015. https://doi.org/10.1371/JOURNAL.PBIO.1002195.
  2. S. B. Needleman and C. D. Wunsch, A general method applicable to the search for similarities in the amino acid sequence of two proteins, J Mol Biol, 48, 3, 443–453, Mar. 1970. https://doi.org/10.1016/0022-2836(70)90057-4.
  3. S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, Basic local alignment search tool, J Mol Biol, 215, 3, 403–410, Oct. 1990. https://doi.org/10.1016/S0022-2836(05)80360-2.
  4. S. Vinga and J. Almeida, Alignment-free sequence comparison—a review, Bioinformatics, 19, 4, 513–523, Mar. 2003. https://doi.org/10.1093/BIOINFORMATICS/BTG005
  5. H. F. Löchel and D. Heider, Chaos game representation and its applications in bioinformatics, Comput Struct Biotechnol J, 19, 6263–6271, Jan. 2021. https://doi.org/10.1016/J.CSBJ.2021.11.008.
  6. M. Yousef and J. Allmer, Deep learning in bioinformatics, Turkish Journal of Biology, 47, 6, p. 366, 2023. https://doi.org/10.55730/1300-0152.2671.
  7. H. Gunasekaran, K. Ramalakshmi, A. Rex Macedo Arokiaraj, S. D. Kanmani, C. Venkatesan, and C. S. G. Dhas, Analysis of DNA Sequence Classification Using CNN and Hybrid Models, Comput Math Methods Med, 2021, 1, p. 1835056, Jan. 2021. https://doi.org/10.1155/2021/1835056.
  8. A. Zielezinski, S. Vinga, J. Almeida, and W. M. Karlowski, Alignment-free sequence comparison: benefits, applications, and tools, Genome Biology 2017 18:1, 18, 1, 1–17, Oct. 2017. https://doi.org/10.1186/S13059-017-1319-7.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Derin Öğrenme , Veri Mühendisliği ve Veri Bilimi

Bölüm

Araştırma Makalesi

Erken Görünüm Tarihi

17 Mart 2025

Yayımlanma Tarihi

15 Nisan 2025

Gönderilme Tarihi

29 Ekim 2024

Kabul Tarihi

5 Şubat 2025

Yayımlandığı Sayı

Yıl 2025 Cilt: 14 Sayı: 2

Kaynak Göster

APA
Delibaş, E. (2025). Using transfer learning models for DNA sequence similarity via fCGR method. Niğde Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi, 14(2), 516-531. https://doi.org/10.28948/ngumuh.1575701
AMA
1.Delibaş E. Using transfer learning models for DNA sequence similarity via fCGR method. NÖHÜ Müh. Bilim. Derg. 2025;14(2):516-531. doi:10.28948/ngumuh.1575701
Chicago
Delibaş, Emre. 2025. “Using transfer learning models for DNA sequence similarity via fCGR method”. Niğde Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi 14 (2): 516-31. https://doi.org/10.28948/ngumuh.1575701.
EndNote
Delibaş E (01 Nisan 2025) Using transfer learning models for DNA sequence similarity via fCGR method. Niğde Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi 14 2 516–531.
IEEE
[1]E. Delibaş, “Using transfer learning models for DNA sequence similarity via fCGR method”, NÖHÜ Müh. Bilim. Derg., c. 14, sy 2, ss. 516–531, Nis. 2025, doi: 10.28948/ngumuh.1575701.
ISNAD
Delibaş, Emre. “Using transfer learning models for DNA sequence similarity via fCGR method”. Niğde Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi 14/2 (01 Nisan 2025): 516-531. https://doi.org/10.28948/ngumuh.1575701.
JAMA
1.Delibaş E. Using transfer learning models for DNA sequence similarity via fCGR method. NÖHÜ Müh. Bilim. Derg. 2025;14:516–531.
MLA
Delibaş, Emre. “Using transfer learning models for DNA sequence similarity via fCGR method”. Niğde Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi, c. 14, sy 2, Nisan 2025, ss. 516-31, doi:10.28948/ngumuh.1575701.
Vancouver
1.Emre Delibaş. Using transfer learning models for DNA sequence similarity via fCGR method. NÖHÜ Müh. Bilim. Derg. 01 Nisan 2025;14(2):516-31. doi:10.28948/ngumuh.1575701