Research Article

Prediction of Water Quality’s pH value using Random Forest and LightGBM Algorithms

Volume: 11 Number: 1 March 28, 2025
EN TR

Prediction of Water Quality’s pH value using Random Forest and LightGBM Algorithms

Abstract

This study aims to compare Random Forest Regression and LightGBM algorithms for the prediction of pH value, which is an important parameter in water quality assessment. The performance of both algorithms is evaluated with metrics such as RMSE, R-squared and AUC (Area Under Curve). The results show that the LightGBM algorithm outperforms Random Forest (0.84) with an AUC value of 0.86 and provides better prediction accuracy, especially on large and complex datasets. These findings demonstrate the applicability of machine learning techniques in environmental monitoring processes and their potential for effective management of water quality. The results highlight the superiority of the LightGBM algorithm in solving environmental problems such as pH prediction, but also provide suggestions for more comprehensive approaches. The application of hybrid modeling techniques, generalizable analyses with datasets from different water sources, and the development of real-time monitoring systems are suggested to extend the findings of the study. This study contributes to the literature by demonstrating the importance of machine learning algorithms in environmental monitoring and water quality management.

Keywords

References

  1. Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
  2. Elsenety, M. M., Mohamed, M. B. I., Sultan, M. E., & Elsayed, B. A. (2022). Facile and highly precise pH-value estimation using common pH paper based on machine learning techniques and supported mobile devices. Scientific Reports, 12(22584). https://doi.org/10.1038/s41598-022-27054-5
  3. Ganapa, J. R., Choudari, S., & Rao, M. K. (2024). Gold price prediction using random forest regression. Educational Administration: Theory and Practice, 30(1), 1052–1055. https://doi.org/10.53555/kuey.v30i1.5928
  4. Gao, B., & Balyan, V. (2022). Construction of a financial default risk prediction model based on the LightGBM algorithm. Journal of Intelligent Systems, 31(767–779). https://doi.org/10.1515/jisys-2022-0036
  5. Iyer, S., Kaushik, S., & Nandal, P. (2023). Water quality prediction using machine learning. Manav Rachna International Journal of Engineering and Technology, 10(1), 59-68. https://doi.org/10.58864/mrijet.2023.10.1.8
  6. Kaggle, https://www.kaggle.com/datasets/somasreemajumder/waterdataset , (30.12.2024).
  7. Karaatlı, M., Helvacıoğlu, Ö. C., Ömürbek, N., & Tokgöz, G. (2012). Yapay sinir ağlari yöntemi ile otomobil satiş tahmini. Uluslararası Yönetim İktisat ve İşletme Dergisi, 8(17), 87-100.
  8. Koranga, M., et al. (2022). Machine learning algorithms for water quality prediction for Nanital Lake, Uttarakhand. International Journal of Advanced Research, 10(2), 103-114.

Details

Primary Language

English

Subjects

Ecology (Other)

Journal Section

Research Article

Publication Date

March 28, 2025

Submission Date

January 29, 2025

Acceptance Date

March 24, 2025

Published in Issue

Year 2025 Volume: 11 Number: 1

APA
Budak, İ. (2025). Prediction of Water Quality’s pH value using Random Forest and LightGBM Algorithms. MEMBA Su Bilimleri Dergisi, 11(1), 42-49. https://doi.org/10.58626/memba.1667338
AMA
1.Budak İ. Prediction of Water Quality’s pH value using Random Forest and LightGBM Algorithms. MEMBA Su Bilimleri Dergisi. 2025;11(1):42-49. doi:10.58626/memba.1667338
Chicago
Budak, İbrahim. 2025. “Prediction of Water Quality’s PH Value Using Random Forest and LightGBM Algorithms”. MEMBA Su Bilimleri Dergisi 11 (1): 42-49. https://doi.org/10.58626/memba.1667338.
EndNote
Budak İ (March 1, 2025) Prediction of Water Quality’s pH value using Random Forest and LightGBM Algorithms. MEMBA Su Bilimleri Dergisi 11 1 42–49.
IEEE
[1]İ. Budak, “Prediction of Water Quality’s pH value using Random Forest and LightGBM Algorithms”, MEMBA Su Bilimleri Dergisi, vol. 11, no. 1, pp. 42–49, Mar. 2025, doi: 10.58626/memba.1667338.
ISNAD
Budak, İbrahim. “Prediction of Water Quality’s PH Value Using Random Forest and LightGBM Algorithms”. MEMBA Su Bilimleri Dergisi 11/1 (March 1, 2025): 42-49. https://doi.org/10.58626/memba.1667338.
JAMA
1.Budak İ. Prediction of Water Quality’s pH value using Random Forest and LightGBM Algorithms. MEMBA Su Bilimleri Dergisi. 2025;11:42–49.
MLA
Budak, İbrahim. “Prediction of Water Quality’s PH Value Using Random Forest and LightGBM Algorithms”. MEMBA Su Bilimleri Dergisi, vol. 11, no. 1, Mar. 2025, pp. 42-49, doi:10.58626/memba.1667338.
Vancouver
1.İbrahim Budak. Prediction of Water Quality’s pH value using Random Forest and LightGBM Algorithms. MEMBA Su Bilimleri Dergisi. 2025 Mar. 1;11(1):42-9. doi:10.58626/memba.1667338

Cited By

Founded in 2013 as the "Menba Kastamonu University Faculty of Fisheries Journal," our journal continues to be published as the "MEMBA Journal of Water Sciences."
-----------
MEMBA Journal of Water Sciences is an international, peer-reviewed, open-access scientific journal published by Kastamonu University. The journal aims to encourage the publication of fundamental and applied scientific research related to aquatic sciences and water resources, strengthen interdisciplinary scientific communication, and increase knowledge in this field. The journal began publishing continuously in 2026 and only accepts original articles, short notes, technical notes, reports, and reviews in English.

MEMBA Water Sciences Journal
TRDizin, SOBIAD, ASCI, CAB Direct, Google Scholar, Paperity, Asosindex, Academic Journal Index, CNKI Scholar
----------
Submitting a plagiarism similarity report is mandatory when uploading an article to our journal, and the plagiarism similarity rate in this report must be below 30%. Authors must submit this report when uploading their article.
Articles uploaded to our journal must include abstracts in both Turkish and English.