EN
TR
Improving Drug-Target Interaction Prediction Using SVM-SMOTE: A Solution for Imbalanced Dataset
Abstract
Drug–target interaction (DTI) prediction is a critical step in the drug discovery process, as experimental methods are often time-consuming and expensive. Machine learning techniques have emerged as effective alternatives for this task. However, DTI datasets commonly suffer from severe class imbalance, where the number of true interactions is significantly lower than negative ones—posing a serious challenge for model training. This study proposes an effective framework for DTI prediction. The model utilizes amino acid composition (AAC) and dipeptide composition (DPC) methods to extract protein features, while FP2 molecular fingerprints are used to represent drug features. To address the class imbalance problem, the SVM-SMOTE technique—an SVM-based synthetic minority oversampling method—is employed. For model training, a Linear Support Vector Machine (LSVM) algorithm is used. The proposed model was evaluated against several state-of-the-art methods using benchmark datasets, including Enzyme, GPCR, Ion Channel, and Nuclear Receptor. The results demonstrate that the proposed framework achieves superior performance. Extensive experiments were conducted at various stages of model design, using evaluation metrics such as AUC, accuracy, F1-score, and recall, all of which confirm the effectiveness of the proposed approach.
Keywords
References
- Abbasi, K., Razzaghi, P., Poso, A., Amanlou, M., Ghasemi, J. B., & Masoudi-Nejad, A. (2020). DeepCDA: deep cross-domain compound–protein affinity prediction through LSTM and convolutional neural networks. Bioinformatics, 36(17), 4633-4642.
- Ai, H., Zhang, L., Zhang, J., Cui, T., Chang, A. K., & Liu, H. (2018). Discrimination of thermophilic and mesophilic proteins using support vector machine and decision tree. Current Proteomics, 15(5), 374-383.
- Aljawazneh, H., Mora, A. M., García-Sánchez, P., & Castillo-Valdivieso, P. A. (2021). Comparing the performance of deep learning methods to predict companies’ financial failure. IEEE Access, 9, 97010-97038.
- Alpay, B. A., Gosink, M., & Aguiar, D. (2022). Evaluating molecular fingerprint-based models of drug side effects against a statistical control. Drug Discovery Today, 27(11), 103364.
- An, Q., & Yu, L. (2021). A heterogeneous network embedding framework for predicting similarity-based drug-target interactions. Briefings in bioinformatics, 22(6), bbab275.
- Atta Mills, E. F. E., Deng, Z., Zhong, Z., & Li, J. (2024). Data-driven prediction of soccer outcomes using enhanced machine and deep learning techniques. Journal of Big Data, 11(1), 170.
- Azlim Khan, A. K., & Ahamed Hassain Malim, N. H. (2023). Comparative studies on resampling techniques in machine learning and deep learning models for drug-target interaction prediction. Molecules, 28(4), 1663.
- Bagherian, M., Kim, R. B., Jiang, C., Sartor, M. A., Derksen, H., & Najarian, K. (2021). Coupled matrix–matrix and coupled tensor–matrix completion methods for predicting drug–target interactions. Briefings in bioinformatics, 22(2), 2161-2171.
Details
Primary Language
English
Subjects
Machine Learning (Other), Artificial Intelligence (Other)
Journal Section
Research Article
Authors
Publication Date
June 30, 2025
Submission Date
March 20, 2025
Acceptance Date
May 27, 2025
Published in Issue
Year 2025 Volume: 9 Number: 1
APA
Naghib Zadeh, S., Ecevit Satı, Z., & Ghanbari Sorkhi, A. (2025). Improving Drug-Target Interaction Prediction Using SVM-SMOTE: A Solution for Imbalanced Dataset. International Journal of Management Information Systems and Computer Science, 9(1), 10-28. https://doi.org/10.33461/uybisbbd.1661593
AMA
1.Naghib Zadeh S, Ecevit Satı Z, Ghanbari Sorkhi A. Improving Drug-Target Interaction Prediction Using SVM-SMOTE: A Solution for Imbalanced Dataset. UYBISBBD. 2025;9(1):10-28. doi:10.33461/uybisbbd.1661593
Chicago
Naghib Zadeh, Sara, Zümrüt Ecevit Satı, and Ali Ghanbari Sorkhi. 2025. “Improving Drug-Target Interaction Prediction Using SVM-SMOTE: A Solution for Imbalanced Dataset”. International Journal of Management Information Systems and Computer Science 9 (1): 10-28. https://doi.org/10.33461/uybisbbd.1661593.
EndNote
Naghib Zadeh S, Ecevit Satı Z, Ghanbari Sorkhi A (June 1, 2025) Improving Drug-Target Interaction Prediction Using SVM-SMOTE: A Solution for Imbalanced Dataset. International Journal of Management Information Systems and Computer Science 9 1 10–28.
IEEE
[1]S. Naghib Zadeh, Z. Ecevit Satı, and A. Ghanbari Sorkhi, “Improving Drug-Target Interaction Prediction Using SVM-SMOTE: A Solution for Imbalanced Dataset”, UYBISBBD, vol. 9, no. 1, pp. 10–28, June 2025, doi: 10.33461/uybisbbd.1661593.
ISNAD
Naghib Zadeh, Sara - Ecevit Satı, Zümrüt - Ghanbari Sorkhi, Ali. “Improving Drug-Target Interaction Prediction Using SVM-SMOTE: A Solution for Imbalanced Dataset”. International Journal of Management Information Systems and Computer Science 9/1 (June 1, 2025): 10-28. https://doi.org/10.33461/uybisbbd.1661593.
JAMA
1.Naghib Zadeh S, Ecevit Satı Z, Ghanbari Sorkhi A. Improving Drug-Target Interaction Prediction Using SVM-SMOTE: A Solution for Imbalanced Dataset. UYBISBBD. 2025;9:10–28.
MLA
Naghib Zadeh, Sara, et al. “Improving Drug-Target Interaction Prediction Using SVM-SMOTE: A Solution for Imbalanced Dataset”. International Journal of Management Information Systems and Computer Science, vol. 9, no. 1, June 2025, pp. 10-28, doi:10.33461/uybisbbd.1661593.
Vancouver
1.Sara Naghib Zadeh, Zümrüt Ecevit Satı, Ali Ghanbari Sorkhi. Improving Drug-Target Interaction Prediction Using SVM-SMOTE: A Solution for Imbalanced Dataset. UYBISBBD. 2025 Jun. 1;9(1):10-28. doi:10.33461/uybisbbd.1661593
