EN
The imputation of missingness in cyclic and non-cyclic Electromyography(EMG) signaling data
Abstract
Multidimensional datasets in healthcare and life sciences often reflect temporal variations, but are often incomplete, complicating the analysis, and reducing statistical accuracy. To address missing data, imputation techniques are widely used, with machine learning algorithms like random forest and k-nearest neighbors and nonparametric methods such as spline and linear interpolation among the common approaches. This study examines electromyography data, a time-series biomedical data set, by evaluating 11 imputation methods in four datasets. We introduce four approaches, normal ratio, weighted normal ratio, expectation maximization, and Gibbs sampling, and assess each for accuracy and computational efficiency. Two scenarios were simulated: unaltered and down-sampled data, each with scattered and intermittent missingness. The comparative assessment emphasizes the notable precision of the expectation maximization method, with the random forest emerging as a robust alternative. Moreover, the normal ratio and weighted normal ratio methods demonstrate computational efficiency akin to mean and median imputation while improving accuracy. We also address cyclic data, a critical factor for improving accuracy. Using Fourier transformation, spline, and autoregressive models, we propose pattern-based and sinusoidal-based approaches to improve imputation. Results indicate that pattern-based improves accuracy, while sinusoidal-based offers efficiency, particularly for k-nearest neighbors.
Keywords
Supporting Institution
The study was supported by the Middle East Technical University Project Grant (No: 11401).
Project Number
Middle East Technical University Project Grant (No: 11401)
Ethical Statement
The ethical statement is not needed in the analyses. The data are taken from public databases.
Thanks
The authors would like to thank Prof. Dr. Fikret Ari for his insightful comments on the EMG data analysis. Furthermore, both authors thank the editor and anonymous referees for their insightful comments, which improve the readability of the paper and the assessments of the findings.
References
- [1] Z. C. Lipton, D. C. Kale and R. Wetzel, Modeling missing data in clinical time series with RNNs, Mach. Learn. Healthc. 56(56), 253–270, 2016.
- [2] C. Yozgatligil, S. Aslan, C. Iyigun and I. Batmaz, Comparison of missing value imputation methods in time series: the case of Turkish meteorological data, Theor. Appl. Climatol. 112, 143–167, 2013.
- [3] M. E. Magnello, Karl Pearson and the establishment of mathematical statistics, Int. Stat. Rev. 77(1), 3–29, 2009.
- [4] D. Millett, Hans Berger: From psychic energy to the EEG, Perspect. Biol. Med. 44(4), 522–542, 2001.
- [5] R. A. Fisher, The use of multiple measurements in taxonomic problems, Ann. Eugen. 7(2), 179–188, 1936.
- [6] A. H. Herring and J. G. Ibrahim, Likelihood-based methods for missing covariates in the Cox proportional hazards model, J. Am. Stat. Assoc. 96(453), 292–302, 2001.
- [7] D. B. Chaffin, Surface electromyography frequency analysis as a diagnostic tool, J. Occup. Med. 11(3), 109–115, 1969.
- [8] T. R. Ten Have, Statistical and design issues in geriatric psychiatric research, in: Annu. Rev. Gerontol. Geriatr. 19, 37–52, Springer, 1999.
Details
Primary Language
English
Subjects
Statistical Analysis, Applied Statistics
Journal Section
Research Article
Early Pub Date
October 1, 2025
Publication Date
October 29, 2025
Submission Date
May 2, 2025
Acceptance Date
August 26, 2025
Published in Issue
Year 2025 Volume: 54 Number: 5
APA
Sarasir, F., & Purutcuoglu, V. (2025). The imputation of missingness in cyclic and non-cyclic Electromyography(EMG) signaling data. Hacettepe Journal of Mathematics and Statistics, 54(5), 2036-2067. https://doi.org/10.15672/hujms.1689242
AMA
1.Sarasir F, Purutcuoglu V. The imputation of missingness in cyclic and non-cyclic Electromyography(EMG) signaling data. Hacettepe Journal of Mathematics and Statistics. 2025;54(5):2036-2067. doi:10.15672/hujms.1689242
Chicago
Sarasir, Fatemeh, and Vilda Purutcuoglu. 2025. “The Imputation of Missingness in Cyclic and Non-Cyclic Electromyography(EMG) Signaling Data”. Hacettepe Journal of Mathematics and Statistics 54 (5): 2036-67. https://doi.org/10.15672/hujms.1689242.
EndNote
Sarasir F, Purutcuoglu V (October 1, 2025) The imputation of missingness in cyclic and non-cyclic Electromyography(EMG) signaling data. Hacettepe Journal of Mathematics and Statistics 54 5 2036–2067.
IEEE
[1]F. Sarasir and V. Purutcuoglu, “The imputation of missingness in cyclic and non-cyclic Electromyography(EMG) signaling data”, Hacettepe Journal of Mathematics and Statistics, vol. 54, no. 5, pp. 2036–2067, Oct. 2025, doi: 10.15672/hujms.1689242.
ISNAD
Sarasir, Fatemeh - Purutcuoglu, Vilda. “The Imputation of Missingness in Cyclic and Non-Cyclic Electromyography(EMG) Signaling Data”. Hacettepe Journal of Mathematics and Statistics 54/5 (October 1, 2025): 2036-2067. https://doi.org/10.15672/hujms.1689242.
JAMA
1.Sarasir F, Purutcuoglu V. The imputation of missingness in cyclic and non-cyclic Electromyography(EMG) signaling data. Hacettepe Journal of Mathematics and Statistics. 2025;54:2036–2067.
MLA
Sarasir, Fatemeh, and Vilda Purutcuoglu. “The Imputation of Missingness in Cyclic and Non-Cyclic Electromyography(EMG) Signaling Data”. Hacettepe Journal of Mathematics and Statistics, vol. 54, no. 5, Oct. 2025, pp. 2036-67, doi:10.15672/hujms.1689242.
Vancouver
1.Fatemeh Sarasir, Vilda Purutcuoglu. The imputation of missingness in cyclic and non-cyclic Electromyography(EMG) signaling data. Hacettepe Journal of Mathematics and Statistics. 2025 Oct. 1;54(5):2036-67. doi:10.15672/hujms.1689242