Middle East Technical University Project Grant (No: 11401)
Multidimensional datasets in healthcare and life sciences often reflect temporal variations, but are often incomplete, complicating the analysis, and reducing statistical accuracy. To address missing data, imputation techniques are widely used, with machine learning algorithms like random forest and k-nearest neighbors and nonparametric methods such as spline and linear interpolation among the common approaches. This study examines electromyography data, a time-series biomedical data set, by evaluating 11 imputation methods in four datasets. We introduce four approaches, normal ratio, weighted normal ratio, expectation maximization, and Gibbs sampling, and assess each for accuracy and computational efficiency. Two scenarios were simulated: unaltered and down-sampled data, each with scattered and intermittent missingness. The comparative assessment emphasizes the notable precision of the expectation maximization method, with the random forest emerging as a robust alternative. Moreover, the normal ratio and weighted normal ratio methods demonstrate computational efficiency akin to mean and median imputation while improving accuracy. We also address cyclic data, a critical factor for improving accuracy. Using Fourier transformation, spline, and autoregressive models, we propose pattern-based and sinusoidal-based approaches to improve imputation. Results indicate that pattern-based improves accuracy, while sinusoidal-based offers efficiency, particularly for k-nearest neighbors.
The ethical statement is not needed in the analyses. The data are taken from public databases.
The study was supported by the Middle East Technical University Project Grant (No: 11401).
Middle East Technical University Project Grant (No: 11401)
The authors would like to thank Prof. Dr. Fikret Ari for his insightful comments on the EMG data analysis. Furthermore, both authors thank the editor and anonymous referees for their insightful comments, which improve the readability of the paper and the assessments of the findings.
| Primary Language | English |
|---|---|
| Subjects | Statistical Analysis, Applied Statistics |
| Journal Section | Research Article |
| Authors | |
| Project Number | Middle East Technical University Project Grant (No: 11401) |
| Early Pub Date | October 1, 2025 |
| Publication Date | October 29, 2025 |
| Submission Date | May 2, 2025 |
| Acceptance Date | August 26, 2025 |
| Published in Issue | Year 2025 Volume: 54 Issue: 5 |