Research Article

An Application with Python Software for the Classification of Chemical Data

Number: 1 August 15, 2023
EN

An Application with Python Software for the Classification of Chemical Data

Abstract

Nowadays, much data can be generated and stored by chemical analyses. It is possible to evaluate these data, to reveal the relationships between them, and to make predictions with new data measured based on these relationships thanks to data mining algorithms. Monitoring the treatment processes and providing the necessary controls for environmental studies are based on the continuous determination of wastewater and activated sludge characteristics. The main criteria for determining the properties of wastewater are biochemical oxygen demand (BOD5), chemical oxygen demand (COD), total organic carbon (TOC), and dissolved oxygen (DO). Among these parameters, BOD5 measurement takes 5 days, while the others can be measured within 1-2 hours at most. Since BOD5 values can be mathematically correlated with other parameters, estimating them in a short time will provide a great advantage in terms of process control. In this study, a data set was created by measuring the specified parameters from 334 samples taken from a treatment plant for statistical evaluation, and the interactions of the parameters in this data set with each other were analyzed by the decision tree method. Thus, by considering the weighted effects of the parameters, it was tried to predict the probable BOD5 value of an unknown sample. The algorithm selected for this data mining study was modeled with PYTHON software and the performance of the algorithm in the estimation of the BOD5 parameter depending on other parameters was examined by extracting decision tree rules.

Keywords

References

  1. Activestate. (2022). How to Classify Data In Python using Scikit-learn. Retrieved May 3, 2023, from https://www. activestate.com/resources/quick-reads/how-to-classify-data-in-python/ google scholar
  2. Alan, A., & Karabatak, B. (2020). Veri Seti - Sınıflandırma İlişkisinde Performansa Etki Eden Faktörlerin Değerlendirilmesi, Fırat Üniversitesi Mühendislik Bilimleri Dergisi, 32(2), 531-540. google scholar
  3. Amazon, (2016). Retrieved May 3, 2023, from https://www.amazon.com/Hach-8505700-Measurement-Luminescent-Dissolved/dp/B00R3EGHJ4 google scholar
  4. Anaconda. (2022). anaconda/packages/python. https://anaconda.org/anaconda/python/anaconda/packages/ (python3.10.6) google scholar
  5. Çelik, M. (2009). Veri Madenciliğinde Kullanılan Sınıflandırma Yöntemleri ve Bir Uygulama [Yüksek Lisans Tezi]. İstanbul Üniversitesi Sosyal Bilimler Enstitüsü Ekonometri Ana Bilim Dalı. google scholar
  6. Çınar, A. (2019). Veri Madenciliğinde Sınıflandırma Algoritmalarının Performans Değerlendirmesi ve R Dili ile Bir Uygulama, Marmara Üniversitesi Öneri Dergisi, 14(51), 90-111. google scholar
  7. Doğan, O. (2017). Ücretsiz Veri Madenciliği Araçları ve Türkiyede Bilinirlikleri Üzerine Bir Araştırma, Ege Stratejik Araştırmalar Dergisi, 8(1), 77-93. google scholar
  8. Eltem, R. (2001). Atık Sular ve Arıtım, Ege Üniversitesi Fen Fakültesi Yayınları, 172 google scholar

Details

Primary Language

English

Subjects

Software Engineering (Other)

Journal Section

Research Article

Publication Date

August 15, 2023

Submission Date

March 17, 2023

Acceptance Date

May 3, 2023

Published in Issue

Year 2023 Number: 1

APA
Ertürk, G., & Akpolat, O. (2023). An Application with Python Software for the Classification of Chemical Data. Journal of Data Applications, 1, 49-68. https://doi.org/10.26650/JODA.1264915
AMA
1.Ertürk G, Akpolat O. An Application with Python Software for the Classification of Chemical Data. Journal of Data Applications. 2023;(1):49-68. doi:10.26650/JODA.1264915
Chicago
Ertürk, Gonca, and Oğuz Akpolat. 2023. “An Application With Python Software for the Classification of Chemical Data”. Journal of Data Applications, no. 1: 49-68. https://doi.org/10.26650/JODA.1264915.
EndNote
Ertürk G, Akpolat O (August 1, 2023) An Application with Python Software for the Classification of Chemical Data. Journal of Data Applications 1 49–68.
IEEE
[1]G. Ertürk and O. Akpolat, “An Application with Python Software for the Classification of Chemical Data”, Journal of Data Applications, no. 1, pp. 49–68, Aug. 2023, doi: 10.26650/JODA.1264915.
ISNAD
Ertürk, Gonca - Akpolat, Oğuz. “An Application With Python Software for the Classification of Chemical Data”. Journal of Data Applications. 1 (August 1, 2023): 49-68. https://doi.org/10.26650/JODA.1264915.
JAMA
1.Ertürk G, Akpolat O. An Application with Python Software for the Classification of Chemical Data. Journal of Data Applications. 2023;:49–68.
MLA
Ertürk, Gonca, and Oğuz Akpolat. “An Application With Python Software for the Classification of Chemical Data”. Journal of Data Applications, no. 1, Aug. 2023, pp. 49-68, doi:10.26650/JODA.1264915.
Vancouver
1.Gonca Ertürk, Oğuz Akpolat. An Application with Python Software for the Classification of Chemical Data. Journal of Data Applications. 2023 Aug. 1;(1):49-68. doi:10.26650/JODA.1264915