Research Article

Performance of imputation techniques: A comprehensive simulation study using the transformer model

Volume: 43 Number: 1 February 28, 2025
EN

Performance of imputation techniques: A comprehensive simulation study using the transformer model

Abstract

This study addresses the critical challenge of handling missing data in time series analysis, which is maintaining the accuracy and reliability of financial forecasting and other predictive models. The study aims to assess various imputation techniques’ and estimation methods’ performance. The purpose of using imputed data is to enhance the robustness and accuracy of time series analyses, especially when dealing with incomplete datasets. We compared eight different imputation methods to identify the most effective approach. We also compared the performance of the Transformer model, Autoregressive Integrated Moving Average, and Generalized Autoregressive Conditional Heteroskedasticity methods in time series analysis using both complete and imputed datasets. The study employed a comprehensive approach, utilizing the Transformer model, Autoregressive Integrated Moving Average, and Generalized Autoregressive Conditional Heteroskedasticity for time series analysis. Eight imputation methods—last observation carried forward, next observation carried backward, mean imputation, linear interpolation, seasonal decomposition, moving average, regression imputation, and Kalman filtering—were evaluated. Monte Carlo simulations and an application were conducted on generated and real data-driven datasets with different proportions of missing data to assess the performance of these methods. The findings suggest that imputation techniques, such as mean imputation, considered conventional, and Kalman filtering, can significantly en-hance the accuracy of time series models, particularly when integrated with innovative models like the Transformer. Moreover, the last observation carried forward, seasonal decomposition, and moving average did not provide better results in any scenario. Simulation-based synthetic data and application-based real data also revealed that the Transformer model outperformed traditional methods in scenarios with complete data (the original dataset) and new datasets generated through imputation at different rates. The results obtained from the real data-driven application support the findings from the simulation results. In addition to the simulation findings, the application results show that mean imputation performs well in cases with low levels of imputation, while Kalman filtering proves more successful when imputing a high proportion of missing data. This work goes beyond previous studies by systematically comparing a wide range of imputation methods within a unified framework, incorporating both traditional and modern time series models. A comprehensive evaluation of estimation techniques and imputation strategies applicable to time series analysis is presented, exploring appropriate combinations of estimation methods and imputation techniques.

Keywords

References

  1. REFERENCES
  2. [1] Hyndman RJ, Athanasopoulos G. Forecasting: principles and practice. 2nd ed. Melbourne: OTexts; 2018. [CrossRef]
  3. [2] Engle RF. GARCH 101: The use of ARCH/GARCH models in applied econometrics. J Econ Perspect 2001;15:157168. [CrossRef]
  4. [3] Zhao L, Wen X, Wang Y, Shao Y. A novel hybrid model of ARIMA-MCC and CKDE-GARCH for urban short-term traffic flow prediction. IET Intell Transp Syst 2022;16:206217. [CrossRef]
  5. [4] Devianto D, Yollanda M, Maiyastri M, Yanuar F. The soft computing FFNN method for adjusting heteroscedasticity on the time series model of currency exchange rate. Front Appl Math Stat 2023;9:1045218. [CrossRef]
  6. [5] Chandola A, Pandey RM, Agarwal R, Rathour L, Mishra VN. On some properties and applications of the generalized m-parameter Mittag-Leffler function. Adv Math Model Appl 2022;7:130145.
  7. [6] Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is all you need. In: 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA. 2017:5998-6008.
  8. [7] Efimova O, Serletis A. Energy markets volatility modeling using GARCH. Energy Econ 2014;43:264273. [CrossRef]

Details

Primary Language

English

Subjects

Building Technology

Journal Section

Research Article

Publication Date

February 28, 2025

Submission Date

May 6, 2024

Acceptance Date

October 3, 2024

Published in Issue

Year 2025 Volume: 43 Number: 1

APA
Yenilmez, İ. (2025). Performance of imputation techniques: A comprehensive simulation study using the transformer model. Sigma Journal of Engineering and Natural Sciences, 43(1), 199-212. https://izlik.org/JA82JY92UB
AMA
1.Yenilmez İ. Performance of imputation techniques: A comprehensive simulation study using the transformer model. SIGMA. 2025;43(1):199-212. https://izlik.org/JA82JY92UB
Chicago
Yenilmez, İsmail. 2025. “Performance of Imputation Techniques: A Comprehensive Simulation Study Using the Transformer Model”. Sigma Journal of Engineering and Natural Sciences 43 (1): 199-212. https://izlik.org/JA82JY92UB.
EndNote
Yenilmez İ (February 1, 2025) Performance of imputation techniques: A comprehensive simulation study using the transformer model. Sigma Journal of Engineering and Natural Sciences 43 1 199–212.
IEEE
[1]İ. Yenilmez, “Performance of imputation techniques: A comprehensive simulation study using the transformer model”, SIGMA, vol. 43, no. 1, pp. 199–212, Feb. 2025, [Online]. Available: https://izlik.org/JA82JY92UB
ISNAD
Yenilmez, İsmail. “Performance of Imputation Techniques: A Comprehensive Simulation Study Using the Transformer Model”. Sigma Journal of Engineering and Natural Sciences 43/1 (February 1, 2025): 199-212. https://izlik.org/JA82JY92UB.
JAMA
1.Yenilmez İ. Performance of imputation techniques: A comprehensive simulation study using the transformer model. SIGMA. 2025;43:199–212.
MLA
Yenilmez, İsmail. “Performance of Imputation Techniques: A Comprehensive Simulation Study Using the Transformer Model”. Sigma Journal of Engineering and Natural Sciences, vol. 43, no. 1, Feb. 2025, pp. 199-12, https://izlik.org/JA82JY92UB.
Vancouver
1.İsmail Yenilmez. Performance of imputation techniques: A comprehensive simulation study using the transformer model. SIGMA [Internet]. 2025 Feb. 1;43(1):199-212. Available from: https://izlik.org/JA82JY92UB

IMPORTANT NOTE: JOURNAL SUBMISSION LINK https://eds.yildiz.edu.tr/sigma/