EN
TR
A Hybrid Metaheuristic based Feature Selection Framework for In-silico Mutagenicity Prediction
Öz
Mutagenicity is both a toxic risk to humans and an indicator of carcinogenicity. Hence, estimating mutagenicity in the early stages of drug design is crucial to minimize last-stage failures and withdrawals in drug discovery. Recently, in-silico methods have started to play critical and essential roles in the drug development process because they are low cost and low effort procedures. This study aims to predict mutagenicity of chemicals using in-silico methods. To achieve this goal, a two-phased flexible framework was proposed: 1) searching the effective and representative descriptors subset with Butterfly Optimization Algorithm (BOA) and Particle Swarm Optimization and 2) predicting mutagenicity of chemicals by the selected descriptor using gradient boosted tree-based ensemble methods. The study used two datasets: one including 8167 compounds for descriptor selection and modelling, and another containing 716 external compounds to validate the efficacy of our models. The datasets comprise 162 descriptors calculated using PaDEL. The results of both the cross-validation and the external data showed that descriptors reduced by nearly one-third by BOA (51 descriptors) yielded similar or slightly better predictive results than results obtained with the entire data set. The accuracy range attained by the proposed approach using BOA is approximately 91.9% to 97.91% for the external set and 83.35% to 86.47% for the test set. This research contributes that using optimization techniques for improving early drug design and minimizing risks in drug discovery can be considered as a valuable insights and advances in the field of drug toxicity prediction, based on the findings.
Anahtar Kelimeler
Kaynakça
- Algamal, ZY, Qasim, MK, Lee, MH and Ali, HTM. 2020. High-dimensional QSAR/QSPR classification modelling based on improving pigeon optimization algorithm. Chemom. Intell. Lab. Syst, 206:104170, doi:10.1016/ j.chemolab.2020.104170.
- Arora, S, Singh, S. 2019. Butterfly optimization algorithm: a novel approach for global optimization. Soft Comput. 23, 715–34 doi:10.1007/ s00500-018-3102-4.
- Bakhtyari, N, Raitano, G, Benfenati, E, Martin, T and Young, D. 2013. Comparison of in silico models for prediction of mutagenicity. Carcinog. Ecotoxicol. Rev, 31(1):45–66, doi:10.1080/10590501.2013.763576
- Breiman, L. 2001. Random forests. Mach. Learn, 45:5–32. doi:10.1023/ A:1010933404324.
- Çakmak Pehlivanlı, A. and Çakmak, G. 2022. Genotoksik etkiyi belirlemeye yönelik in-silico yaklaşımlar. In Genetik Toksikoloji (Genetic Toxicology), ed. F. Ünal and D. Yüzbaşıoğlu, 475–92. Ankara: Nobel.
- Cariello, NF, Wilson, JD, Britt, BH, Wedd, DJ, Burlinson, B and Gombar, V. 2002. Comparison of the computer programs DEREK and TOPKAT to predict bacterial mutagenicity. Mutagenesis 17(4):321-9, doi:10.1093/mutage/17.4.321.
- Chen, T and Guestrin, C. 2016. XGBoost. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–94, doi:10.1145/2939672.2939785.
- Chu, CSM, Simpson, JD, O’Neill, PM and Berry, NG. 2021. Machine learning predicting Ames mutagenicity of small molecules. Journal of Molecular Graphics and Modelling, 109. doi:10.1016/j.jmgm.2021.108011.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Makine Öğrenme (Diğer), Veri Yönetimi ve Veri Bilimi (Diğer)
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
26 Eylül 2024
Gönderilme Tarihi
14 Nisan 2024
Kabul Tarihi
20 Ağustos 2024
Yayımlandığı Sayı
Yıl 2024 Cilt: 7 Sayı: 2
APA
Yılmaz, Ö., Gumustas, E., & Çakmak Pehlivanlı, A. (2024). A Hybrid Metaheuristic based Feature Selection Framework for In-silico Mutagenicity Prediction. Journal of Intelligent Systems: Theory and Applications, 7(2), 116-128. https://doi.org/10.38016/jista.1468153
AMA
1.Yılmaz Ö, Gumustas E, Çakmak Pehlivanlı A. A Hybrid Metaheuristic based Feature Selection Framework for In-silico Mutagenicity Prediction. jista. 2024;7(2):116-128. doi:10.38016/jista.1468153
Chicago
Yılmaz, Özlem, Enis Gumustas, ve Ayça Çakmak Pehlivanlı. 2024. “A Hybrid Metaheuristic based Feature Selection Framework for In-silico Mutagenicity Prediction”. Journal of Intelligent Systems: Theory and Applications 7 (2): 116-28. https://doi.org/10.38016/jista.1468153.
EndNote
Yılmaz Ö, Gumustas E, Çakmak Pehlivanlı A (01 Eylül 2024) A Hybrid Metaheuristic based Feature Selection Framework for In-silico Mutagenicity Prediction. Journal of Intelligent Systems: Theory and Applications 7 2 116–128.
IEEE
[1]Ö. Yılmaz, E. Gumustas, ve A. Çakmak Pehlivanlı, “A Hybrid Metaheuristic based Feature Selection Framework for In-silico Mutagenicity Prediction”, jista, c. 7, sy 2, ss. 116–128, Eyl. 2024, doi: 10.38016/jista.1468153.
ISNAD
Yılmaz, Özlem - Gumustas, Enis - Çakmak Pehlivanlı, Ayça. “A Hybrid Metaheuristic based Feature Selection Framework for In-silico Mutagenicity Prediction”. Journal of Intelligent Systems: Theory and Applications 7/2 (01 Eylül 2024): 116-128. https://doi.org/10.38016/jista.1468153.
JAMA
1.Yılmaz Ö, Gumustas E, Çakmak Pehlivanlı A. A Hybrid Metaheuristic based Feature Selection Framework for In-silico Mutagenicity Prediction. jista. 2024;7:116–128.
MLA
Yılmaz, Özlem, vd. “A Hybrid Metaheuristic based Feature Selection Framework for In-silico Mutagenicity Prediction”. Journal of Intelligent Systems: Theory and Applications, c. 7, sy 2, Eylül 2024, ss. 116-28, doi:10.38016/jista.1468153.
Vancouver
1.Özlem Yılmaz, Enis Gumustas, Ayça Çakmak Pehlivanlı. A Hybrid Metaheuristic based Feature Selection Framework for In-silico Mutagenicity Prediction. jista. 01 Eylül 2024;7(2):116-28. doi:10.38016/jista.1468153
Cited By
Makine Öğrenmesi Algoritmalarının Performansını Artırmak için Metasezgisel Algoritma Tabanlı Öznitelik Seçimi
Fırat Üniversitesi Mühendislik Bilimleri Dergisi
https://doi.org/10.35234/fumbd.1632540