Comparative Assessment of Machine Learning and Statistical Models for Precipitation Forecasting in Semi-Arid Regions
Abstract
Accurate precipitation forecasting is essential for water resource management, agricultural planning, and climate resilience, particularly in semiarid regions such as Konya Province, Türkiye, which receives approximately 300 to 350 mm of rainfall annually and is highly vulnerable to drought. This study presents a systematic comparative evaluation of eight forecasting models applied to monthly precipitation data spanning the period from 1958 to 2025. The models evaluated include SARIMAX, LASSO, CatBoost, LightGBM, XGBoost, Multilayer Perceptron (MLP), Long Short Term Memory (LSTM), and a hybrid LSTM and XGBoost framework. Model inputs comprise six key meteorological variables, namely temperature, humidity, evaporation, sunshine duration, wind speed, and precipitation, supplemented by sinusoidal seasonal encodings and lagged or rolling features depending on the model structure. A chronological 70/10/20 train, validation, and test split was applied to all machine learning models, while SARIMAX used the full dataset with exogenous regressors. Performance was evaluated using seven complementary metrics: R2, RMSE, MAE, NSE, MAPE, KGE, and PBIAS. The results reveal a clear three tier performance hierarchy. The hybrid LSTM and XGBoost framework achieved the highest accuracy (R2 = 0.945, RMSE = 5.46 mm), demonstrating that combining temporal feature extraction with nonlinear residual learning substantially improves predictive performance. Tree based ensemble models, including CatBoost, LightGBM, and XGBoost, formed an intermediate tier with R2 values around 0.70, effectively capturing seasonal variability but tending to underestimate extreme rainfall events. Linear and standalone deep learning models, including SARIMAX, LASSO, MLP, and LSTM, constituted the weakest tier. These findings confirm that hybrid architectures offer the most reliable framework for precipitation forecasting in semiarid regions, with direct implications for sustainable water management and climate adaptation planning.
Keywords
References
- Griggs, D. J., & Noguer, M. (2002). Climate change 2001: The scientific basis. Weather, 57, 267–269. https://doi.org/10.1256/004316502320517344
- Asadi, M., & Karami, M. (2021). Modeling of relative humidity trends in Iran. Modeling Earth Systems and Environment, 8, 1035–1045. https://doi.org/10.1007/s40808-021-01093-9
- Pasquini, L., van Aardenne, L., Godsmark, C. N., et al. (2020). Emerging climate change-related public health challenges in Africa: A case study of the heat-health vulnerability of informal settlement residents in Dar es Salaam, Tanzania. Science of the Total Environment, 747, 141355. https://doi.org/10.1016/j.scitotenv.2020.141355
- Kogo, B. K., Kumar, L., & Koech, R. (2021). Climate change and variability in Kenya: A review of impacts on agriculture and food security. Environment, Development and Sustainability, 23, 23–43. https://doi.org/10.1007/s10668-020-00589-1
- Ridwan, W. M., Sapitang, M., Aziz, A., et al. (2021). Rainfall forecasting model using machine learning methods: Case study Terengganu, Malaysia. Ain Shams Engineering Journal, 12, 1651–1663. https://doi.org/10.1016/j.asej.2020.09.011
- Acar, R., Katipoğlu, O. M., & Çırağ, B. (2026). Hybrid VMD-EMD Enhanced Machine Learning Models for Daily Reference Evapotranspiration Forecasting. Engineering Perspective, 6(3), 354–373. https://doi.org/10.64808/engineeringperspective.1906862
- Rezaei Aderyani, F., Mousavi, S. J., & Jafari, F. (2022). Short-term rainfall forecasting using machine learning-based approaches of PSO-SVR, LSTM and CNN. Journal of Hydrology, 614, 128463. https://doi.org/10.1016/j.jhydrol.2022.128463
- Deger, İ. H. (2026). Flood Frequency and Temporal Variability Analysis of Yeşilırmak Basin Under Homogeneity and Change Point Criteria. Engineering Perspective, 6(2), 185–209. https://doi.org/10.64808/engineeringperspective.1896337
Details
Primary Language
English
Subjects
Water Resources Engineering, Civil Engineering (Other)
Journal Section
Research Article
Publication Date
May 22, 2026
Submission Date
March 23, 2026
Acceptance Date
May 21, 2026
Published in Issue
Year 2026 Volume: 6 Number: 3