Research Article

Initial Seed Value Effectiveness on Performances of Data Mining Algorithms

Volume: 9 Number: 2 April 25, 2021
EN TR

Initial Seed Value Effectiveness on Performances of Data Mining Algorithms

Abstract

After 2000s, Computer capacities and features are increased and access to data made easy. However, the produced and recorded data should be meaningful. Transformation of unprocessed data into meaningful information can be done with the help of data mining. In this study, classification methods from data mining applications are studied. First, the parameters that make the results of the same data set different were investigated on 4 different data mining tools (Weka, Rapid Miner, Knime, Orange), It has been tested with 3 different algorithms (K nearest neighborhood, Naive Bayes, Random Forest). In order to evaluate the performance of the data set while creating the classification models, the data set was divided into training data and test data as 80% -20%, 70% -30% and 60-40%. The accuracy, roc and precision values was used to test the performance of the classifying data. While classifying, the effect of algorithm parameters on the results is observed. The most important of these parameters is the initial seed value. The initial seed is a value using especially in classification algorithms that determines the initial placement of the data and directly affects the result. In this respect, it is very important to determine the initial seed value correctly. In this study, initial seed values between 0 and 100 were evaluated and it was shown that the classification could change the accuracy value approximately by 5%.

Keywords

References

  1. [1] M. S. Durmuş, “Veri kümeleme algoritmalarının performansları üzerine karşılaştırmalı bir çalışma,” M.S. thesis, Fen Bilimleri Enstitüsü, Pamukkale Üniversitesi, Denizli, 2005.
  2. [2] Y.Farhang, “Face Extraction from Image based on K-Means Clustering Algorithms,” International Journal Of Advanced Computer Science And Applications, 8(9), 96-107,2017.
  3. [3] H. Kaya, K. Köymen, “Veri madenciliği kavramı ve uygulama alanları,” Doğu Anadolu bölgesi araştırmaları Dergisi, 6(2), 159-164, 2008.
  4. [4] Q. Chen, Y. Wan, X. Zhang, Y. Lei, J. Zobel, K. Verspoor, “Comparative analysis of sequence clustering methods for deduplication of biological databases,” Journal of Data and Information Quality (JDIQ), 9(3), 17, 2018.
  5. [5] M. A. ALAN, “VERİ MADENCİLİĞİ VE LİSANSÜSTÜ ÖĞRENCİ VERİLERİ ÜZERİNE BİR UYGULAMA,”Dumlupinar University Journal of Social Science/Dumlupinar Üniversitesi Soysyal Bilimler Dergisi, (33), 2012.
  6. [6] S. ÖZŞEN, R. Ceylan, “Comparison of AIS and fuzzy c-means clustering methods on the classification of breast cancer and diabetes datasets,” Turkish Journal of Electrical Engineering & Computer Sciences, 22(5), 1241-1254, 2014.
  7. [7] G. Kayakutlu, I. Duzdar, E. Mercier-Laurent, B. Sennaroglu, “Intelligent association rules for innovative SME collaboration,” IFIP International Workshop on Artificial Intelligence for Knowledge Management, Springer, Cham, 150-164, 2014.
  8. [8] A. M. Moawad, A. M. Gadallah, M. H. Kholief, “Fuzzy Ontology based Approach for Flexible Association Rules Mining,” Internatıonal Journal Of Advanced Computer Scıence And Applıcatıons, 8(5), 328-337, 2017.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

April 25, 2021

Submission Date

October 19, 2020

Acceptance Date

February 2, 2021

Published in Issue

Year 2021 Volume: 9 Number: 2

APA
Timuçin, T., & Duzdar Argun, İ. (2021). Initial Seed Value Effectiveness on Performances of Data Mining Algorithms. Duzce University Journal of Science and Technology, 9(2), 555-567. https://doi.org/10.29130/dubited.813101
AMA
1.Timuçin T, Duzdar Argun İ. Initial Seed Value Effectiveness on Performances of Data Mining Algorithms. DUBİTED. 2021;9(2):555-567. doi:10.29130/dubited.813101
Chicago
Timuçin, Tunahan, and İrem Duzdar Argun. 2021. “Initial Seed Value Effectiveness on Performances of Data Mining Algorithms”. Duzce University Journal of Science and Technology 9 (2): 555-67. https://doi.org/10.29130/dubited.813101.
EndNote
Timuçin T, Duzdar Argun İ (April 1, 2021) Initial Seed Value Effectiveness on Performances of Data Mining Algorithms. Duzce University Journal of Science and Technology 9 2 555–567.
IEEE
[1]T. Timuçin and İ. Duzdar Argun, “Initial Seed Value Effectiveness on Performances of Data Mining Algorithms”, DUBİTED, vol. 9, no. 2, pp. 555–567, Apr. 2021, doi: 10.29130/dubited.813101.
ISNAD
Timuçin, Tunahan - Duzdar Argun, İrem. “Initial Seed Value Effectiveness on Performances of Data Mining Algorithms”. Duzce University Journal of Science and Technology 9/2 (April 1, 2021): 555-567. https://doi.org/10.29130/dubited.813101.
JAMA
1.Timuçin T, Duzdar Argun İ. Initial Seed Value Effectiveness on Performances of Data Mining Algorithms. DUBİTED. 2021;9:555–567.
MLA
Timuçin, Tunahan, and İrem Duzdar Argun. “Initial Seed Value Effectiveness on Performances of Data Mining Algorithms”. Duzce University Journal of Science and Technology, vol. 9, no. 2, Apr. 2021, pp. 555-67, doi:10.29130/dubited.813101.
Vancouver
1.Tunahan Timuçin, İrem Duzdar Argun. Initial Seed Value Effectiveness on Performances of Data Mining Algorithms. DUBİTED. 2021 Apr. 1;9(2):555-67. doi:10.29130/dubited.813101

Cited By