Research Article

An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer

Volume: 3 Number: 0 December 31, 2019
EN

An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer

Abstract

According to recent statistics, breast cancer is one of the most prevalent cancers among women in the world. It represents the majority of new cancer cases and cancer-related deaths. Early diagnosis is very important, as it becomes fatal unless detected and treated in early stages. With the latest advances in artificial intelligence and machine learning (ML), there is a great potential to diagnose breast cancer by using structured data. In this paper, we conduct an empirical comparison of 10 popular machine learning models for the prediction of breast cancer. We used well known Wisconsin Breast Cancer Dataset (WBCD) to train the models and employed advanced accuracy metrics for comparison. Experimental results show that all models demonstrate superior accuracy, while Support Vector Machines (SVM) had slightly better performance than other methods. Logistic Regression, K-Nearest Neighbors and Neural Networks also proved to be strong classifiers for predicting breast cancer.

Keywords

References

  1. Agarap, A. F. M. (2018). On breast cancer detection: an application of machine learning algorithms on the wisconsin diagnostic dataset. Paper presented at the Proceedings of the 2nd International Conference on Machine Learning and Soft Computing.
  2. Akay, M. F. (2009). Support vector machines combined with feature selection for breast cancer diagnosis. Expert Systems with Applications, 36(2), 3240-3247. American Cancer Society. (2018). "Cancer Facts & Figures 2018". Atlanta, American Cancer Society.
  3. Boser, B. E., Guyon, I. M., & Vapnik, V. N. (1992). A training algorithm for optimal margin classifiers. Paper presented at the Proceedings of the fifth annual workshop on Computational learning theory.
  4. Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32.
  5. Breiman, L., Friedman, J., Olshen, R., & Stone, C. (1984). Classification and regression trees. Wadsworth Int. Group, 37(15), 237-251.
  6. Chen, T., & Guestrin, C. (2016). Xgboost: A scalable tree boosting system. Paper presented at the Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining.
  7. Chen, T., He, T., Benesty, M., Khotilovich, V., & Tang, Y. (2015). Xgboost: extreme gradient boosting. R package version 0.4-2, 1-4.
  8. Clark, P., & Niblett, T. (1989). The CN2 induction algorithm. Machine learning, 3(4), 261-283.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Authors

Hamit Taner Ünal This is me

Publication Date

December 31, 2019

Submission Date

November 11, 2019

Acceptance Date

December 26, 2019

Published in Issue

Year 2019 Volume: 3 Number: 0

APA
Basciftci, F., & Ünal, H. T. (2019). An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer. Bilge International Journal of Science and Technology Research, 3, 9-20. https://doi.org/10.30516/bilgesci.645067
AMA
1.Basciftci F, Ünal HT. An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer. bilgesci. 2019;3:9-20. doi:10.30516/bilgesci.645067
Chicago
Basciftci, Fatih, and Hamit Taner Ünal. 2019. “An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer”. Bilge International Journal of Science and Technology Research 3 (December): 9-20. https://doi.org/10.30516/bilgesci.645067.
EndNote
Basciftci F, Ünal HT (December 1, 2019) An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer. Bilge International Journal of Science and Technology Research 3 9–20.
IEEE
[1]F. Basciftci and H. T. Ünal, “An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer”, bilgesci, vol. 3, pp. 9–20, Dec. 2019, doi: 10.30516/bilgesci.645067.
ISNAD
Basciftci, Fatih - Ünal, Hamit Taner. “An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer”. Bilge International Journal of Science and Technology Research 3 (December 1, 2019): 9-20. https://doi.org/10.30516/bilgesci.645067.
JAMA
1.Basciftci F, Ünal HT. An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer. bilgesci. 2019;3:9–20.
MLA
Basciftci, Fatih, and Hamit Taner Ünal. “An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer”. Bilge International Journal of Science and Technology Research, vol. 3, Dec. 2019, pp. 9-20, doi:10.30516/bilgesci.645067.
Vancouver
1.Fatih Basciftci, Hamit Taner Ünal. An Empirical Comparison of Machine Learning Algorithms for Predicting Breast Cancer. bilgesci. 2019 Dec. 1;3:9-20. doi:10.30516/bilgesci.645067

Cited By