Short-Term High-Frequency Temperature Forecasting Using Hybrid Stacking Ensemble
Abstract
Weather forecasting is valuable for agriculture through crop production, renewable resource management such as solar or wind farms, and disaster management for preparation and response. It enables an understanding of future conditions so that decisions can be made at the right time. To make well-founded decisions about the long-term use of limited resources, it is essential to know accurately when these resources will be available. In this research, we used a high-frequency multivariate sensor dataset recorded every 10 minutes from Max Planck Institute weather station. This dataset includes 20 atmospheric parameters, such as temperature, humidity, pressure, and wind direction. We developed an appropriate preprocessing pipeline to prepare raw data for analysis. Our pipeline included converting timestamps into date/time format, engineering circular wind-direction features, interpolating missing values, splitting the training and test sets chronologically, and normalizing the features using Min--Max scaling. Using the processed dataset, we compared time-series models (SARIMA and Prophet) with tree-based machine learning models (Random Forest, XGBoost, and CatBoost), as well as a hybrid stacking ensemble that combined the strengths of these models. Performance was assessed on a 7-day test set using Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), and \(R^2\). Our results showed that the Hybrid Stacking Ensemble was the most effective overall model, achieving \(\mathrm{MAE}=0.255\,^{\circ}\mathrm{C}\), \(\mathrm{RMSE}=0.329\,^{\circ}\mathrm{C}\), and \(R^2=0.949\). It significantly outperformed both the individual machine learning models and the baseline models, such as SARIMA and Prophet. Hybrid Stacking Ensemble reduced the error of the best-performing individual model by \(26\%\), while maintaining near-perfect linear agreement with the observed temperatures.
Keywords
High-resolution weather sensing, Data-driven weather modeling, Machine learning, Hybrid stacking ensemble learning, Multivariate climate time series
References
- M. H. F. da Silva, G. F. de Jesus, C. Nascimento, et al., Exploring quantum machine learning for weather forecasting, Braz. J. Phys., 56(1) (2026), Article ID 22. https://doi.org/10.1007/s13538-025-01941-4
- T. P. Agyekum, P. Antwi-Agyei, A. J. Dougill, The contribution of weather forecast information to agriculture, water, and energy sectors in East and West Africa: A systematic review, Front. Environ. Sci., 10 (2022), Article ID 935696. https://doi.org/10.3389/fenvs.2022.935696
- W. Atwa, A. A. Almazroi, N. Ayub, Reliable renewable energy forecasting for climate change mitigation, PeerJ Comput. Sci., 10 (2024), Article ID e2067. https://doi.org/10.7717/peerj-cs.2067
- A. C. Gonc¸alves, X. Costoya, R. Nieto, et al., Extreme weather events on energy systems: A comprehensive review on impacts, mitigation, and adaptation measures, Sustainable Energy Res., 11(1) (2024), Article ID 4. https://doi.org/10.1186/s40807-023-00097-6
- J. A. Segovia, J. F. Toaquiza, J. R. Llanos, et al., Meteorological variables forecasting system using machine learning and open-source software, Electronics, 12(4) (2023), Article ID 1007. https://doi.org/10.3390/electronics12041007
- R. Meenal, D. Binu, K. C. Ramya, et al., Weather forecasting for renewable energy system: A review, Arch. Comput. Methods Eng., 29 (2022), 2875–2891. https://doi.org/10.1007/s11831-021-09695-3
- K. Lagouvardos, V. Kotroni, A. Bezes, et al., The automatic weather stations NOANN network of the National Observatory of Athens: Operation and database, Geosci. Data J., 4(1) (2017), 4–16. https://doi.org/10.1002/gdj3.44
- H. Lira, L. Mart´ı, N. Sanchez-Pi, A graph neural network with spatio-temporal attention for multi-sources time series data: An application to frost forecast, Sensors, 22(4) (2022), Article ID 1486. https://doi.org/10.3390/s22041486
- R. Tsela, S. Maladaki, S. Kolios, An integrated, automated and modular approach for real-time weather monitoring of surface meteorological variables and short-range forecasting using machine learning, Environ. Model. Softw., 183 (2025), Article ID 106203. https://doi.org/10.1016/j.envsoft.2024.106203
- V. Papastefanopoulos, P. Linardatos, T. Panagiotakopoulos, et al., Multivariate time-series forecasting: A review of deep learning methods in internet of things applications to smart cities, Smart Cities, 6(5) (2023), 2519–2552. https://doi.org/10.3390/smartcities6050114
