Araştırma Makalesi

Data division effect on machine learning performance for prediction of streamflow

Cilt: 13 Sayı: 4 3 Ocak 2023
PDF İndir
TR EN

Data division effect on machine learning performance for prediction of streamflow

Abstract


Accurate estimation of streamflow has an important role in water resources management, disaster preparedness and early warning, reservoir operation, and sizing of water structures. In this study, Extreme gradient boosting (XGBoost) and K-Nearest Neighbours (KNN) algorithms are used for the estimation of streamflow. In order to reveal the appropriate model, the raw model and models with optimized parameters were evaluated while the models were being built. In the setup of the models, various training test rates were also tried, and it was investigated which data division showed more effective results. For this purpose, the data were divided into ratios such as 60-40, 70-30, 80-20, and 90-10, respectively, and the model results were compared. Various statistical indicators such as Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Coefficient of Determination (R2) were used when comparing the models. As a result of the analysis, it was determined that the most suitable model for monthly streamflow estimation was obtained by using the optimized Xgboost algorithm and 60-40% data division. The obtained outputs constitute a vital resource for decision-makers regarding water resources planning and flood and drought management.

Keywords

Kaynakça

  1. Reference1 [1] X. Yu, Y. Wang, L. Wu, G. Chen, L. Wang, and H. Qin, "Comparison of support vector regression and extreme gradient boosting for decomposition-based data-driven 10-day streamflow forecasting," Journal of Hydrology, vol. 582, p. 124293, 2020.
  2. Reference2 [2] P. Parisouj, H. Mohebzadeh, and T. Lee, "Employing machine learning algorithms for streamflow prediction: a case study of four river basins with different climatic zones in the United States," Water Resources Management, vol. 34, no. 13, pp. 4113-4131, 2020.
  3. Reference3 [3] W. Wang, Stochasticity, nonlinearity and forecasting of streamflow processes. Ios Press, 2006.
  4. Reference4 [4] F. Tosunoğlu, S. HANAY, E. Çintaş, and B. Özyer, "Monthly streamflow forecasting using machine learning," Erzincan University Journal of Science and Technology, vol. 13, no. 3, pp. 1242-1251, 2020.
  5. Reference5 [5] R. M. Adnan, Z. Liang, A. Kuriqi, O. Kisi, A. Malik, and B. Li, "Streamflow forecasting using heuristic machine learning methods," in 2020 2nd International Conference on Computer and Information Sciences (ICCIS), 2020: IEEE, pp. 1-6.
  6. Reference6 [6] L. Ni et al., "Streamflow forecasting using extreme gradient boosting model coupled with Gaussian mixture model," Journal of Hydrology, vol. 586, p. 124901, 2020.
  7. Reference7 [7] H. Tyralis, G. Papacharalampous, and A. Langousis, "Super ensemble learning for daily streamflow forecasting: Large-scale demonstration and comparison with multiple machine learning algorithms," Neural Computing and Applications, vol. 33, no. 8, pp. 3053-3068, 2021.
  8. Reference8 [8] R. M. Adnan, R. R. Mostafa, A. Elbeltagi, Z. M. Yaseen, S. Shahid, and O. Kisi, "Development of new machine learning model for streamflow prediction: Case studies in Pakistan," Stochastic Environmental Research and Risk Assessment, vol. 36, no. 4, pp. 999-1033, 2022.

Ayrıntılar

Birincil Dil

İngilizce

Konular

-

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

3 Ocak 2023

Gönderilme Tarihi

7 Ağustos 2022

Kabul Tarihi

25 Kasım 2022

Yayımlandığı Sayı

Yıl 2022 Cilt: 13 Sayı: 4

Kaynak Göster

IEEE
[1]O. M. Katipoğlu, “Data division effect on machine learning performance for prediction of streamflow”, DÜMF MD, c. 13, sy 4, ss. 653–660, Oca. 2023, doi: 10.24012/dumf.1158748.

Cited By

DUJE tarafından yayınlanan tüm makaleler, Creative Commons Atıf 4.0 Uluslararası Lisansı ile lisanslanmıştır. Bu, orijinal eser ve kaynağın uygun şekilde belirtilmesi koşuluyla, herkesin eseri kopyalamasına, yeniden dağıtmasına, yeniden düzenlemesine, iletmesine ve uyarlamasına izin verir. 24456