Araştırma Makalesi
BibTex RIS Kaynak Göster

DNA dizileri için sezgisel yöntemlere dayalı çoklu dizi hizalama yaklaşımlarının karşılaştırılması

Yıl 2026, Cilt: 41 Sayı: 1 , 479 - 494 , 31.03.2026
https://doi.org/10.17341/gazimmfd.1610635
https://izlik.org/JA88UP94XJ

Öz

Biyoenformatik, biyolojik verileri kavramsallaştırmak ve aralarındaki ilişkileri anlamak için matematik, bilgi işlem ve istatistikten faydalanan bir bilim dalıdır. Artan biyolojik veri hacmi, dizi hizalama problemini daha karmaşık hale getirerek manuel çözümleri imkânsız kılmıştır. Bu nedenle, otomatik hesaplama sistemleri geliştirilmiştir. Dizi hizalaması, çiftli ve çoklu olmak üzere iki kategoriye ayrılır. Bu çalışma, genetik algoritma (GA), diferansiyel evrim (DE) ve simüle edilmiş tavlama (SA) algoritmaları gibi meta-sezgisel yöntemleri kullanarak çoklu dizi hizalama problemine odaklanmaktadır. GA, DE, GASA ve DESA algoritmaları önerilmiş ve DNA dizileri üzerinde Needleman-Wunsch algoritmasıyla hizalamalar yapılmıştır. Deneyler, GA'nın en iyi çalışma zamanına sahip olduğunu, DE ve DESA'nın ise daha uzun sürdüğünü göstermiştir. GA, DE ve DESA, GASA'ya göre daha yüksek hizalama puanları üretmiştir. Önerilen algoritmalar, Clustal algoritmasına kıyasla çoğu veri kümesinde daha iyi hizalama puanları sunarken, Clustal en hızlı çalışmayı gerçekleştirmiştir. Ayrıca, önerilen yöntemlerin aynı uzunluktaki diziler üzerinde mevcut algoritmalardan üstün performans gösterdiği tespit edilmiştir. Özellikle dört veri kümesinde (trnN-GUU_rps12, rrn4.5_rps12, rrn5_rps12, psbT_pbf1) önerilen algoritmalar daha yüksek puanlar elde etmiştir.

Destekleyen Kurum

Ege Üniversitesi Bilimsel Araştırma Projeleri Koordinatörlüğü

Proje Numarası

FOA-2020-20981

Teşekkür

Bu çalışma, Ege Üniversitesi Bilimsel Araştırma Projeleri Koordinatörlüğü tarafından "Cicer ve Lens Türlerinin Kloroplast DNA Dizilerinin Yeni Nesil Dizileme ile Dizilenmesi ve Genom Organizasyonlarının Belirlenerek Karşılaştırmalı Genom Analizlerinin Yapılması" başlıklı proje kapsamında "FO A-2020-20981" proje numarası ile maddi olarak desteklenmiştir.

Kaynakça

  • 1. Cohen, J., Bioinformatics-an introduction for computer scientists, ACM Comput. Surv. ,36 (2), 122-158, 2004.
  • 2. Luscombe N.M., Greenbaum D., Gerstein M., What is bioinformatics? An introduction and overview, Yearbook of medical informatics, 10 (1), 83-100, 2001.
  • 3. Karcioglu A.A., Bulut H., Improving hash-q exact string-matching algorithm with perfect hashing for DNA sequences, Computers in Biology and Medicine,131, 104292, 2021.
  • 4. Karcıoğlu A.A, Bulut H., Q-gram hash comparison based multiple exact string matching algorithm for DNA sequences, Journal of the Faculty of Engineering and Architecture of Gazi University, 38 (2), 2023.
  • 5. Karcioglu A.A., Bulut H., The WM-q multiple exact string matching algorithm for DNA sequences, Computers in Biology and Medicine, 136, 104656, 2021.
  • 6. Karcioglu A.A., Bulut H., q-frame hash comparison based exact string matching algorithms for DNA sequences, Concurrency and Computation: Practice and Experience, 34 (9), 2022.
  • 7. Botta M., Negro G., Multiple sequence alignment with genetic algorithms, Computational Intelligence Methods for Bioinformatics and Biostatistics, Springer, Berlin, Germany, 206-214, 2009.
  • 8. Haque W., Aravind A., Reddy B., Pairwise sequence alignment algorithms: a survey, Information Science, Technology and Applications Conference, 96-103, 2009.
  • 9. Lee Z.J., Su S.F., Chuang C.C., Liu K.H., Genetic algorithm with ant colony optimization (GA-ACO) for multiple sequence alignment, Applied Soft Computing, 8 (1), 55-78, 2008.
  • 10. Zhu X., Li K., Salah A., A data parallel strategy for aligning multiple biological sequences on multi-core computers, Computers in biology and medicine, 43 (4), 350-361, 2013.
  • 11. Liu X., Yang X., Wang C., Yao Y., Dai Q., Number of distinct sequence alignments with k-match and match sections, Computers in biology and medicine, 63, 287-292, 2015.
  • 12. Naorem L. D., Sharma N., Raghava G. P., A web server for predicting and scanning of IL-5 inducing peptides using alignment-free and alignment-based method,Computers in Biology and Medicine, 158, 106864, 2023.
  • 13. Pais F.S.M., Ruy P.D.C., Oliveira G., Coimbra R.S., Assessing the efficiency of multiple sequence alignment programs, Algorithms for Molecular Biology, 9 (1), 4, 2014.
  • 14. Aktan M.N., Bulut H., Metaheuristic task scheduling algorithms for cloud computing environments, Concurrency and Computation: Practice and Experience, 34 (9), 2022.
  • 15. Çavga S. H., Performance of neural networks and heuristic models for disease prediction from liver enzymes: Application to biochemistry device output, Journal of the Faculty of Engineering and Architecture of Gazi University, 39 (4), 2263-2270, 2024
  • 16. Kaya F., Conker Ç., Hybrid input shaper design and genetic algorithm-based multi-objective optimization for elimination of residual vibrations at specific frequencies in flexible systems, Journal of the Faculty of Engineering and Architecture of Gazi University (Advanced Online Publication), 2541-2552, 2025.
  • 17. Arıkan M., Hybrid simulated annealing–tabu search algorithms for solving U-shaped type-2 assembly line balancing problems with workload smoothing objective, Journal of the Faculty of Engineering and Architecture of Gazi University, 39 (3), 1457–1472, 2024.
  • 18. Yalcin A., Deliktas D., Genetic algorithm based on weighted goal programming for doctor rostering problem, Journal of the Faculty of Engıneerıng and Archıtecture of Gazı Unıversıty, 39 (4), 2567-2585, 2024.
  • 19. Erdirik H., Karcıoğlu A. A., Tanyolaç B., Bulut H., Meta-Sezgisel Tabanlı Clustal-SA Algoritmasını Kullanarak DNA Sekanslarında Çoklu Dizi Hizalama, Journal of the Institute of Science and Technology, 14 (2), 544-562, 2024.
  • 20. Bucak, İ. Ö., Uslan, V., Sequence alignment from the perspective of stochastic optimization: a Survey, Turkish Journal of Electrical Engineering and Computer Sciences, 19 (1), 157-173, 2011.
  • 21. Nalbantoğlu O.U., Dynamic Programming, Multiple Sequence Alignment Methods, Methods in Molecular Biology, 1079, Russell D.J., Springer, 3–27, 2014.
  • 22. Karadimitriou K., Kraft D.H., Genetic algorithms and the multiple sequence alignment problem in biology, Second Annual Molecular Biology and Biotechnology Conference, Baton Rouge, 1–7, 1996.
  • 23. Onwubolu G., Davendra D., Scheduling flow shops using differential evolution algorithm, European Journal of Operational Research, 171 (2), 674-692, 2006.
  • 24. Major Differences. Difference between Global and Local Sequence Alignment. https://www.majordifferences.com/2016/05/difference-between-global-and-local.html. Erişim tarihi: 28.12.2024.
  • 25. Likic V., The Needleman–Wunsch Algorithm for Sequence Alignment, Lecture Notes, 7th Melbourne Bioinformatics Course, Molecular Science and Biotechnology Institute, University of Melbourne, 1–46, 2008.
  • 26. Needleman S. B., Wunsch C. D., A general method applicable to the search for similarities in the amino acid sequence of two proteins, Journal of Molecular Biology, 48 (3), 443–453, 1970.
  • 27. Diamantis S., Charissi A., Comparison of Multiple Sequence Alignment Programs, MSc Bioinformatics Report, National and Kapodistrian University of Athens, 2005.
  • 28. Smith T.F., Waterman M.S., Identification of common molecular subsequences, Journal of Molecular Biology, 147 (1), 195–197, 1981.
  • 29. Thompson J.D., Higgins D.G., Gibson T.J., CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice, Nucleic Acids Research, 22 (22), 4673–4680, 1994.
  • 30. Doğan H., Otu H.H., Objective Functions, Multiple Sequence Alignment Methods, Methods in Molecular Biology, Cilt 1079, Editör: Russell D.J., Springer, 45–58, 2014.
  • 31. Sastry K., Goldberg D., Kendall G., Genetic Algorithms, Search Methodologies: Introductory Tutorials in Optimization and Decision Support Techniques, Editör: Burke E.K., Kendall G., Springer, 97–125, 2005.
  • 32. Mirjalili S., Evolutionary Algorithms and Neural Networks, Studies in Computational Intelligence, Springer, Berlin, 780, 2019.
  • 33. Karaboğa D., Ökdem S., A simple and global optimization algorithm for engineering problems: differential evolution algorithm, Turkish Journal of Electrical Engineering and Computer Sciences, 12 (1), 53–60, 2004.
  • 34. Aarts E.H., Van Laarhoven P.J., Simulated annealing: a pedestrian review of the theory and some applications, Pattern Recognition Theory and Applications, Springer, Berlin, 179–192, 1987. 35. Edgar R.C., Batzoglou S., Multiple sequence alignment, Current Opinion in Structural Biology, 16 (3), 368–373, 2006.

Comparison of multiple sequence alignment approaches based on heuristic methods for DNA sequences

Yıl 2026, Cilt: 41 Sayı: 1 , 479 - 494 , 31.03.2026
https://doi.org/10.17341/gazimmfd.1610635
https://izlik.org/JA88UP94XJ

Öz

Bioinformatics utilizes mathematics, computing, and statistics to analyze biological data and establish relationships among them. As biological data volumes increase, the sequence alignment problem becomes more complex, rendering manual solutions impractical and necessitating automated computational systems. Sequence alignment is categorized into pairwise and multiple sequence alignment. This study addresses the multiple sequence alignment problem using meta-heuristic approaches, specifically genetic algorithms (GA), differential evolution (DE), and simulated annealing (SA). Four algorithmic variations—GA, DE, GASA, and DESA—are proposed, reordering DNA sequences with the Needleman-Wunsch algorithm to produce diverse alignments. Experimental results show that GA, DE, and DESA achieved higher alignment scores than GASA, with GA exhibiting the fastest runtime, while DE and DESA had longer runtimes than both GA and GASA. Compared to the Clustal algorithm, the proposed algorithms generally achieved better alignment scores across most datasets, though Clustal demonstrated the fastest runtime. Specifically, for four datasets (trnN-GUU_rps12, rrn4.5_rps12, rrn5_rps12, psbT_pbf1), the proposed algorithms outperformed Clustal in alignment scores. Additionally, the proposed methods proved more effective on datasets with sequences of equal length compared to existing alignment algorithms. These findings highlight the potential of the proposed algorithms in improving multiple sequence alignment outcomes.

Destekleyen Kurum

Ege University Scientific Research Projects Coordinatorship

Proje Numarası

FOA-2020-20981

Teşekkür

This study was financially supported by Ege University Scientific Research Projects Coordination Office under the project titled "Sequencing of Chloroplast DNA Sequences of Cicer and Lens Species by Next Generation Sequencing and Conducting Comparative Genome Analyses by Determining their Genome Organization" with the project number "FO A-2020-20981".

Kaynakça

  • 1. Cohen, J., Bioinformatics-an introduction for computer scientists, ACM Comput. Surv. ,36 (2), 122-158, 2004.
  • 2. Luscombe N.M., Greenbaum D., Gerstein M., What is bioinformatics? An introduction and overview, Yearbook of medical informatics, 10 (1), 83-100, 2001.
  • 3. Karcioglu A.A., Bulut H., Improving hash-q exact string-matching algorithm with perfect hashing for DNA sequences, Computers in Biology and Medicine,131, 104292, 2021.
  • 4. Karcıoğlu A.A, Bulut H., Q-gram hash comparison based multiple exact string matching algorithm for DNA sequences, Journal of the Faculty of Engineering and Architecture of Gazi University, 38 (2), 2023.
  • 5. Karcioglu A.A., Bulut H., The WM-q multiple exact string matching algorithm for DNA sequences, Computers in Biology and Medicine, 136, 104656, 2021.
  • 6. Karcioglu A.A., Bulut H., q-frame hash comparison based exact string matching algorithms for DNA sequences, Concurrency and Computation: Practice and Experience, 34 (9), 2022.
  • 7. Botta M., Negro G., Multiple sequence alignment with genetic algorithms, Computational Intelligence Methods for Bioinformatics and Biostatistics, Springer, Berlin, Germany, 206-214, 2009.
  • 8. Haque W., Aravind A., Reddy B., Pairwise sequence alignment algorithms: a survey, Information Science, Technology and Applications Conference, 96-103, 2009.
  • 9. Lee Z.J., Su S.F., Chuang C.C., Liu K.H., Genetic algorithm with ant colony optimization (GA-ACO) for multiple sequence alignment, Applied Soft Computing, 8 (1), 55-78, 2008.
  • 10. Zhu X., Li K., Salah A., A data parallel strategy for aligning multiple biological sequences on multi-core computers, Computers in biology and medicine, 43 (4), 350-361, 2013.
  • 11. Liu X., Yang X., Wang C., Yao Y., Dai Q., Number of distinct sequence alignments with k-match and match sections, Computers in biology and medicine, 63, 287-292, 2015.
  • 12. Naorem L. D., Sharma N., Raghava G. P., A web server for predicting and scanning of IL-5 inducing peptides using alignment-free and alignment-based method,Computers in Biology and Medicine, 158, 106864, 2023.
  • 13. Pais F.S.M., Ruy P.D.C., Oliveira G., Coimbra R.S., Assessing the efficiency of multiple sequence alignment programs, Algorithms for Molecular Biology, 9 (1), 4, 2014.
  • 14. Aktan M.N., Bulut H., Metaheuristic task scheduling algorithms for cloud computing environments, Concurrency and Computation: Practice and Experience, 34 (9), 2022.
  • 15. Çavga S. H., Performance of neural networks and heuristic models for disease prediction from liver enzymes: Application to biochemistry device output, Journal of the Faculty of Engineering and Architecture of Gazi University, 39 (4), 2263-2270, 2024
  • 16. Kaya F., Conker Ç., Hybrid input shaper design and genetic algorithm-based multi-objective optimization for elimination of residual vibrations at specific frequencies in flexible systems, Journal of the Faculty of Engineering and Architecture of Gazi University (Advanced Online Publication), 2541-2552, 2025.
  • 17. Arıkan M., Hybrid simulated annealing–tabu search algorithms for solving U-shaped type-2 assembly line balancing problems with workload smoothing objective, Journal of the Faculty of Engineering and Architecture of Gazi University, 39 (3), 1457–1472, 2024.
  • 18. Yalcin A., Deliktas D., Genetic algorithm based on weighted goal programming for doctor rostering problem, Journal of the Faculty of Engıneerıng and Archıtecture of Gazı Unıversıty, 39 (4), 2567-2585, 2024.
  • 19. Erdirik H., Karcıoğlu A. A., Tanyolaç B., Bulut H., Meta-Sezgisel Tabanlı Clustal-SA Algoritmasını Kullanarak DNA Sekanslarında Çoklu Dizi Hizalama, Journal of the Institute of Science and Technology, 14 (2), 544-562, 2024.
  • 20. Bucak, İ. Ö., Uslan, V., Sequence alignment from the perspective of stochastic optimization: a Survey, Turkish Journal of Electrical Engineering and Computer Sciences, 19 (1), 157-173, 2011.
  • 21. Nalbantoğlu O.U., Dynamic Programming, Multiple Sequence Alignment Methods, Methods in Molecular Biology, 1079, Russell D.J., Springer, 3–27, 2014.
  • 22. Karadimitriou K., Kraft D.H., Genetic algorithms and the multiple sequence alignment problem in biology, Second Annual Molecular Biology and Biotechnology Conference, Baton Rouge, 1–7, 1996.
  • 23. Onwubolu G., Davendra D., Scheduling flow shops using differential evolution algorithm, European Journal of Operational Research, 171 (2), 674-692, 2006.
  • 24. Major Differences. Difference between Global and Local Sequence Alignment. https://www.majordifferences.com/2016/05/difference-between-global-and-local.html. Erişim tarihi: 28.12.2024.
  • 25. Likic V., The Needleman–Wunsch Algorithm for Sequence Alignment, Lecture Notes, 7th Melbourne Bioinformatics Course, Molecular Science and Biotechnology Institute, University of Melbourne, 1–46, 2008.
  • 26. Needleman S. B., Wunsch C. D., A general method applicable to the search for similarities in the amino acid sequence of two proteins, Journal of Molecular Biology, 48 (3), 443–453, 1970.
  • 27. Diamantis S., Charissi A., Comparison of Multiple Sequence Alignment Programs, MSc Bioinformatics Report, National and Kapodistrian University of Athens, 2005.
  • 28. Smith T.F., Waterman M.S., Identification of common molecular subsequences, Journal of Molecular Biology, 147 (1), 195–197, 1981.
  • 29. Thompson J.D., Higgins D.G., Gibson T.J., CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice, Nucleic Acids Research, 22 (22), 4673–4680, 1994.
  • 30. Doğan H., Otu H.H., Objective Functions, Multiple Sequence Alignment Methods, Methods in Molecular Biology, Cilt 1079, Editör: Russell D.J., Springer, 45–58, 2014.
  • 31. Sastry K., Goldberg D., Kendall G., Genetic Algorithms, Search Methodologies: Introductory Tutorials in Optimization and Decision Support Techniques, Editör: Burke E.K., Kendall G., Springer, 97–125, 2005.
  • 32. Mirjalili S., Evolutionary Algorithms and Neural Networks, Studies in Computational Intelligence, Springer, Berlin, 780, 2019.
  • 33. Karaboğa D., Ökdem S., A simple and global optimization algorithm for engineering problems: differential evolution algorithm, Turkish Journal of Electrical Engineering and Computer Sciences, 12 (1), 53–60, 2004.
  • 34. Aarts E.H., Van Laarhoven P.J., Simulated annealing: a pedestrian review of the theory and some applications, Pattern Recognition Theory and Applications, Springer, Berlin, 179–192, 1987. 35. Edgar R.C., Batzoglou S., Multiple sequence alignment, Current Opinion in Structural Biology, 16 (3), 368–373, 2006.
Toplam 34 adet kaynakça vardır.

Ayrıntılar

Birincil Dil Türkçe
Konular Veri Yapıları ve Algoritmalar, İnformetrik
Bölüm Araştırma Makalesi
Yazarlar

Hatice Erdirik 0000-0002-5816-4804

Abdullah Ammar Karcıoğlu 0000-0002-0907-751X

Bahattin Tanyolaç 0000-0002-4368-0988

Hasan Bulut 0000-0002-4872-5698

Proje Numarası FOA-2020-20981
Gönderilme Tarihi 31 Aralık 2024
Kabul Tarihi 6 Ocak 2026
Yayımlanma Tarihi 31 Mart 2026
DOI https://doi.org/10.17341/gazimmfd.1610635
IZ https://izlik.org/JA88UP94XJ
Yayımlandığı Sayı Yıl 2026 Cilt: 41 Sayı: 1

Kaynak Göster

APA Erdirik, H., Karcıoğlu, A. A., Tanyolaç, B., & Bulut, H. (2026). DNA dizileri için sezgisel yöntemlere dayalı çoklu dizi hizalama yaklaşımlarının karşılaştırılması. Gazi Üniversitesi Mühendislik Mimarlık Fakültesi Dergisi, 41(1), 479-494. https://doi.org/10.17341/gazimmfd.1610635
AMA 1.Erdirik H, Karcıoğlu AA, Tanyolaç B, Bulut H. DNA dizileri için sezgisel yöntemlere dayalı çoklu dizi hizalama yaklaşımlarının karşılaştırılması. GUMMFD. 2026;41(1):479-494. doi:10.17341/gazimmfd.1610635
Chicago Erdirik, Hatice, Abdullah Ammar Karcıoğlu, Bahattin Tanyolaç, ve Hasan Bulut. 2026. “DNA dizileri için sezgisel yöntemlere dayalı çoklu dizi hizalama yaklaşımlarının karşılaştırılması”. Gazi Üniversitesi Mühendislik Mimarlık Fakültesi Dergisi 41 (1): 479-94. https://doi.org/10.17341/gazimmfd.1610635.
EndNote Erdirik H, Karcıoğlu AA, Tanyolaç B, Bulut H (01 Mart 2026) DNA dizileri için sezgisel yöntemlere dayalı çoklu dizi hizalama yaklaşımlarının karşılaştırılması. Gazi Üniversitesi Mühendislik Mimarlık Fakültesi Dergisi 41 1 479–494.
IEEE [1]H. Erdirik, A. A. Karcıoğlu, B. Tanyolaç, ve H. Bulut, “DNA dizileri için sezgisel yöntemlere dayalı çoklu dizi hizalama yaklaşımlarının karşılaştırılması”, GUMMFD, c. 41, sy 1, ss. 479–494, Mar. 2026, doi: 10.17341/gazimmfd.1610635.
ISNAD Erdirik, Hatice - Karcıoğlu, Abdullah Ammar - Tanyolaç, Bahattin - Bulut, Hasan. “DNA dizileri için sezgisel yöntemlere dayalı çoklu dizi hizalama yaklaşımlarının karşılaştırılması”. Gazi Üniversitesi Mühendislik Mimarlık Fakültesi Dergisi 41/1 (01 Mart 2026): 479-494. https://doi.org/10.17341/gazimmfd.1610635.
JAMA 1.Erdirik H, Karcıoğlu AA, Tanyolaç B, Bulut H. DNA dizileri için sezgisel yöntemlere dayalı çoklu dizi hizalama yaklaşımlarının karşılaştırılması. GUMMFD. 2026;41:479–494.
MLA Erdirik, Hatice, vd. “DNA dizileri için sezgisel yöntemlere dayalı çoklu dizi hizalama yaklaşımlarının karşılaştırılması”. Gazi Üniversitesi Mühendislik Mimarlık Fakültesi Dergisi, c. 41, sy 1, Mart 2026, ss. 479-94, doi:10.17341/gazimmfd.1610635.
Vancouver 1.Hatice Erdirik, Abdullah Ammar Karcıoğlu, Bahattin Tanyolaç, Hasan Bulut. DNA dizileri için sezgisel yöntemlere dayalı çoklu dizi hizalama yaklaşımlarının karşılaştırılması. GUMMFD. 01 Mart 2026;41(1):479-94. doi:10.17341/gazimmfd.1610635