Reliable daily precipitation prediction is pivotal for agricultural water management in semi-arid regions, where water scarcity and climate variability raise operational risk. This study develops and evaluates machine-learning approaches for precipitation occurrence in Konya, Türkiye - one of the country’s key agricultural basins - using 44 years (1980–2024) of meteorological data. We examine three data strategies: (i) European Centre for Medium-Range Weather Forecasts (ECMWF) Reanalysis v5 (ERA5) reanalysis features alone, (ii) National Aeronautics and Space Administration (NASA) Prediction Of Worldwide Energy Resources (POWER) observations alone, and (iii) a combined strategy that merges ERA5 predictors with the NASA rain/no-rain target. Five model families are compared— Random Forest (RF), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Categorical Boosting (CatBoost), and a three-layer Long Short-Term Memory (LSTM) on analysis-ready tabular datasets. All approaches use advanced feature engineering (cyclical seasonality, lags, rolling windows) and a deployment-minded post-processing step that scans decision thresholds from 0.10 to 0.90 to maximize F1‑score. Performance is assessed with a chronological train/test split; F1‑score is the primary metric, complemented by Area Under the Receiver Operating Characteristic Curve (AUC-ROC), precision, recall, confusion matrices, and calibration. Results show that the combined strategy with CatBoost delivers the highest skill (F1‑score: 84.57%; AUC-ROC: 94.37%), confirming that pairing rich ERA5 features with the quality-controlled NASA target improves performance relative to single-source approaches. Ensemble tree-based methods consistently outperform the LSTM baseline on this daily, tabular classification task. Threshold optimization raises F1‑score by about 1–5% across models, and calibration indicates that predicted probabilities closely match observed frequencies. For semi-arid farming in Konya, calibrated daily probabilities of precipitation can be converted into simple decision rules (e.g., skip irrigation when the predicted probability exceeds a locally selected threshold), supporting irrigation scheduling, fertilizer timing, and harvest planning. The workflow is computationally efficient and reproducible, built on globally available ERA5 and NASA POWER resources, and readily transferable to other semi-arid basins with minor re-tuning. Findings highlight a practical path to trustworthy, probability-based rainfall guidance for agricultural water management. Feature importance analysis highlights seasonality, humidity, temperature ranges, pressure tendencies, and wind extremes as leading signals.
Agricultural water management ERA5 Machine learning Model calibration NASA POWER Precipitation prediction
| Primary Language | English |
|---|---|
| Subjects | Agricultural Systems Analysis and Modelling, Agricultural Hydrology, Agricultural Water Management, Agro-Ecosystem Function and Prediction |
| Journal Section | Research Article |
| Authors | |
| Submission Date | November 4, 2025 |
| Acceptance Date | December 16, 2025 |
| Publication Date | December 28, 2025 |
| Published in Issue | Year 2025 Volume: 9 Issue: Special |
Abstracting & Indexing Services
© International Journal of Agriculture, Environment and Food Sciences
All content published by the journal is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
This license allows others to share and adapt the material for non-commercial purposes, provided proper attribution is given to the original work.
Authors retain the copyright of their articles and grant the journal the right of first publication under an open-access model
Web: dergipark.org.tr/jaefs E-mail: editorialoffice@jaefs.com Phone: +90 850 309 59 27