Research Article

Hybrid Analytic Method for Missing Data Imputation in Medical Big Data

Volume: 5 Number: 2 January 16, 2023
EN

Hybrid Analytic Method for Missing Data Imputation in Medical Big Data

Abstract

Compared to other traditional datasets, medical data has several hidden challenges. In fact, the possibility of missing values for certain attributes presents a great dispute for data mining researchers to make correct medical decisions. In this paper, a hybrid scheme combining the k-means method and regression analysis is proposed. A combination of these two analytical methods allows to find the best distributional model of numerical data in space and helps to predict missing data. Applied to medical data (diabetes dataset), the proposed model predicts the values with a minor error rate, which is considered very satisfactory.

Keywords

References

  1. [1] Nazari, Elham, Mohammad Hasan Shahriari, and Hamed Tabesh. "BigData analysis in healthcare: apache hadoop, apache spark and apache flink." Frontiers in Health Informatics 8.1 (2019): 14.
  2. [2] Palanisamy, Venketesh, and Ramkumar Thirunavukarasu. "Implications of big data analytics in developing healthcare frameworks–A review." Journal of King Saud University-Computer and Information Sciences 31.4 (2019): 415-425.
  3. [3] Kumar, Sunil, and Maninder Singh. "Big data analytics for healthcare industry: impact, applications, and tools." Big data mining and analytics 2.1 (2018): 48-57.
  4. [4] Bahri, Safa, et al. "Big data for healthcare: a survey." IEEE access 7 (2018): 7397-7408.
  5. [5] Bennett, Derrick A. "How can I deal with missing data in my study?." Australian and New Zealand journal of public health 25.5 (2001): 464-469.
  6. [6] Graham, John W. "Missing data: Analysis and design". Springer Science & Business Media, (2012).
  7. [7] Mack, Christina, Zhaohui Su, and Daniel Westreich. "Managing missing data in patient registries: addendum to registries for evaluating patient outcomes: a user’s guide." (2018).
  8. [8] Rubin, Donald B. "Inference and missing data." Biometrika 63.3 (1976): 581-592.

Details

Primary Language

English

Subjects

Software Engineering (Other)

Journal Section

Research Article

Authors

Mohamed Ilyes Bourahdoun This is me
Algeria

Bilel Boudjahem This is me
Algeria

Publication Date

January 16, 2023

Submission Date

May 18, 2022

Acceptance Date

October 31, 2022

Published in Issue

Year 2022 Volume: 5 Number: 2

APA
Benhamza, K., Benhamıda, N., Bourahdoun, M. I., & Boudjahem, B. (2023). Hybrid Analytic Method for Missing Data Imputation in Medical Big Data. International Journal of Informatics and Applied Mathematics, 5(2), 1-11. https://doi.org/10.53508/ijiam.1118198
AMA
1.Benhamza K, Benhamıda N, Bourahdoun MI, Boudjahem B. Hybrid Analytic Method for Missing Data Imputation in Medical Big Data. IJIAM. 2023;5(2):1-11. doi:10.53508/ijiam.1118198
Chicago
Benhamza, Karima, Nadjette Benhamıda, Mohamed Ilyes Bourahdoun, and Bilel Boudjahem. 2023. “Hybrid Analytic Method for Missing Data Imputation in Medical Big Data”. International Journal of Informatics and Applied Mathematics 5 (2): 1-11. https://doi.org/10.53508/ijiam.1118198.
EndNote
Benhamza K, Benhamıda N, Bourahdoun MI, Boudjahem B (January 1, 2023) Hybrid Analytic Method for Missing Data Imputation in Medical Big Data. International Journal of Informatics and Applied Mathematics 5 2 1–11.
IEEE
[1]K. Benhamza, N. Benhamıda, M. I. Bourahdoun, and B. Boudjahem, “Hybrid Analytic Method for Missing Data Imputation in Medical Big Data”, IJIAM, vol. 5, no. 2, pp. 1–11, Jan. 2023, doi: 10.53508/ijiam.1118198.
ISNAD
Benhamza, Karima - Benhamıda, Nadjette - Bourahdoun, Mohamed Ilyes - Boudjahem, Bilel. “Hybrid Analytic Method for Missing Data Imputation in Medical Big Data”. International Journal of Informatics and Applied Mathematics 5/2 (January 1, 2023): 1-11. https://doi.org/10.53508/ijiam.1118198.
JAMA
1.Benhamza K, Benhamıda N, Bourahdoun MI, Boudjahem B. Hybrid Analytic Method for Missing Data Imputation in Medical Big Data. IJIAM. 2023;5:1–11.
MLA
Benhamza, Karima, et al. “Hybrid Analytic Method for Missing Data Imputation in Medical Big Data”. International Journal of Informatics and Applied Mathematics, vol. 5, no. 2, Jan. 2023, pp. 1-11, doi:10.53508/ijiam.1118198.
Vancouver
1.Karima Benhamza, Nadjette Benhamıda, Mohamed Ilyes Bourahdoun, Bilel Boudjahem. Hybrid Analytic Method for Missing Data Imputation in Medical Big Data. IJIAM. 2023 Jan. 1;5(2):1-11. doi:10.53508/ijiam.1118198

International Journal of Informatics and Applied Mathematics