Research Article

Disease prognosis using machine learning algorithms based on new clinical dataset

Volume: 65 Number: 1 June 3, 2023
EN

Disease prognosis using machine learning algorithms based on new clinical dataset

Abstract

Today, artificial intelligence-based solutions are produced to facilitate human life in almost every field. The healthcare sector is one of the sectors which took advantage of these solutions. Due to reasons such as the world’s ever-expanding population, ongoing epidemics, and the emergence of new disease types, it is becoming increasingly difficult for a patient to benefit from health services quickly and to make an accurate diagnosis. At this juncture, artificial intelligence reduces the patient density in hospitals, enables patients to access accurate information, and allows medical students to practice by seeing new cases. In this study, a new and reliable dataset was created with disease information obtained from various sources under the supervision of a specialist medical doctor. Then, new patient histories were added to the dataset used in the previous study, the experiments were repeated with the same algorithms, and the accuracy score comparison was presented. The created dataset includes 2006 unique patient histories, 358 symptoms, and 141 diseases and we think it will be a valuable dataset for researchers who make developments using machine learning in the field of healthcare. Various machine learning algorithms have been used in the training process to predict diseases belonging to different branches of medicine, such as diabetes, bronchial asthma, and covid. Besides, Support Vector Machine, Naive Bayes, K-Nearest Neighbors, Multilayer Perceptron, Decision Tree, and Random Forest algorithms, we also studied popular boosting algorithms such as XGBoost and LightGBM. All algorithms were validated with cross-validation and performance comparisons were made with different performance metrics such as accuracy, precision, recall, and f1-score. It is also the first study to achieve an accuracy score of 99.33% with a dataset that involves a greater number of diseases than the datasets used in the studies examined.

Keywords

References

  1. Our World in Data, (2022). Available: https://ourworldindata.org/births-and-deaths/. [Accessed: December 2022].
  2. Cantalay, P. J., Uçan, O. N., Zontul, M., Diagnosis of breast cancer from X-ray images using deep learning methods, J. Ponte, 77 (6), (2021), https://doi.org/10.21506/j.ponte.2021.6.1.
  3. Wang, Y., Yang, F., Zhang, J., Yue, X., Liu, S., Application of artificial intelligence based on deep learning in breast cancer screening and imaging diagnosis, Neural Comput. Applic., (2021), 9637–9647, https://doi.org/10.1007/s00521-021-05728-x.
  4. Mobark, N., Hamad, S., Rida, S. Z., CoroNet: Deep neural network-based end-to-end training for breast cancer diagnosis, Appl. Sci., 12 (14), (2022), 7080, https://doi.org/10.3390/app12147080.
  5. Manishkumar, S. H. and Saranya, P., Detection and classification of breast cancer from mammogram images using adaptive deep learning technique, 2022 6th Int'l Conf. on Dev., Circ. Syst., (2022), 327-331, https://doi.org/10.1109/ICDCS54290.2022.9780770.
  6. Reddy, K. V. V., Elamvazuthi, I., Aziz, A. A., Paramasivam, S., Chua, H. N., Pranavanand, S., Heart disease risk prediction using machine learning classifiers with attribute evaluators, Appl. Sci., 11 (18) (2021), 8352, https://doi.org/10.3390/app11188352.
  7. Bharti, R., Khamparia, A., Shabaz, M., Dhiman, G., Pande, S., Singh, P., Prediction of heart disease using a combination of machine learning and deep learning, Comp. Intell. Neurosci., (2021), 8387680, https://doi.org/10.1155/2021/8387680.
  8. Mehmood, A., Iqbal, M., Mehmood, Z. et al., Prediction of heart disease using deep convolutional neural networks, Arab. J. for Sci. Eng., 46 (2021), 3409–3422, https://doi.org/10.1007/s13369-020-05105-1.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Early Pub Date

May 17, 2023

Publication Date

June 3, 2023

Submission Date

December 9, 2022

Acceptance Date

April 11, 2023

Published in Issue

Year 2023 Volume: 65 Number: 1

APA
Çolak, M., Tümer Sivri, T., Pervan Akman, N., Berkol, A., & Ekici, Y. (2023). Disease prognosis using machine learning algorithms based on new clinical dataset. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering, 65(1), 52-68. https://doi.org/10.33769/aupse.1215962
AMA
1.Çolak M, Tümer Sivri T, Pervan Akman N, Berkol A, Ekici Y. Disease prognosis using machine learning algorithms based on new clinical dataset. Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng. 2023;65(1):52-68. doi:10.33769/aupse.1215962
Chicago
Çolak, Melike, Talya Tümer Sivri, Nergis Pervan Akman, Ali Berkol, and Yahya Ekici. 2023. “Disease Prognosis Using Machine Learning Algorithms Based on New Clinical Dataset”. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering 65 (1): 52-68. https://doi.org/10.33769/aupse.1215962.
EndNote
Çolak M, Tümer Sivri T, Pervan Akman N, Berkol A, Ekici Y (June 1, 2023) Disease prognosis using machine learning algorithms based on new clinical dataset. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering 65 1 52–68.
IEEE
[1]M. Çolak, T. Tümer Sivri, N. Pervan Akman, A. Berkol, and Y. Ekici, “Disease prognosis using machine learning algorithms based on new clinical dataset”, Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng., vol. 65, no. 1, pp. 52–68, June 2023, doi: 10.33769/aupse.1215962.
ISNAD
Çolak, Melike - Tümer Sivri, Talya - Pervan Akman, Nergis - Berkol, Ali - Ekici, Yahya. “Disease Prognosis Using Machine Learning Algorithms Based on New Clinical Dataset”. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering 65/1 (June 1, 2023): 52-68. https://doi.org/10.33769/aupse.1215962.
JAMA
1.Çolak M, Tümer Sivri T, Pervan Akman N, Berkol A, Ekici Y. Disease prognosis using machine learning algorithms based on new clinical dataset. Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng. 2023;65:52–68.
MLA
Çolak, Melike, et al. “Disease Prognosis Using Machine Learning Algorithms Based on New Clinical Dataset”. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering, vol. 65, no. 1, June 2023, pp. 52-68, doi:10.33769/aupse.1215962.
Vancouver
1.Melike Çolak, Talya Tümer Sivri, Nergis Pervan Akman, Ali Berkol, Yahya Ekici. Disease prognosis using machine learning algorithms based on new clinical dataset. Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng. 2023 Jun. 1;65(1):52-68. doi:10.33769/aupse.1215962

Cited By

Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering licensed under a Creative Commons Attribution 4.0 International License.

Creative Commons License