Review

Decision Trees in Large Data Sets

Volume: 13 Number: 1 January 18, 2021
Zeynep Çetinkaya , Fahrettin Horasan *
EN

Decision Trees in Large Data Sets

Abstract

Data mining is the process of obtaining information, which is used to identify and define the relationships between data of different qualities. One of the important problems encountered in this process is the classification process in large data sets. Extensive research has been done to find solutions to this classification problem and different solution methods have been introduced. Some decision tree algorithms are among the structures that can be used effectively in this field. In this article, various decision tree structures and algorithms used for classification process in large data sets are discussed. Along with the definitions of the algorithms, the similarities and existing differences between them were determined, their advantages and disadvantages were investigated.

Keywords

Decision trees, Decision tree algorithms, Big data sets, Scalable decision trees

References

  1. Akpınar, H. (2000). Veri Tabanlarında Bilgi Keşfi ve Veri Madenciliği, İ. U. işletme Fakültesi Dergisi, C:29, s: 1-22.
  2. Alsabti, K., Ranka, S., Singh, V. (1998). CLOUDS: A Decision Tree Classifier for Large Datasets, Electrical Engineering and Computer Science.Paper 41.
  3. Aytekin, Ç., Sütcü, C.S., Özfidan, U. (2018) Text Classıfıcatıon Vıa Decısıon Trees Algorıthm: Customer Comments Case, The Journal of International Social Research, vol:11 ss:55.
  4. Berry, M.J.A., Lınoff, G.S. (1997). Data Mining Techniques for Marketing, Sales, and Customer Relationship Management, First Edition, Wiley Publishing, 187-216.
  5. Bounsaythip, C., Rinta, R.E. (2001). Overview of Data Mining For Customer Behavior Modeling. VTT Information Technology Research Report. Ver 1. ss. 21.
  6. Breiman, L., Freidman, J.H., Olshen, R.A., Stone, C.J.(1984). Classification and Regression Trees, Monterey, CA: Wadsworth, 358 s.
  7. Chaudhuri, S., Fayyad, U., Berhardt, J. (1997). Scalable Classification over SQL Database, Technical Report MSR-TR-97-35, Microsoft Research.
  8. Chein, C. F., Chen, L. F. (2008) Data Mining to Improve Personnel Selection and Enhance Human Capital: A Case Study in High-Technology Industry, Expert Systems with Applications, vol. 34, p. 280-290.
  9. Çalış, A., Kayapınar, S., Çetinyokuş, T. (2014). Veri Madenciliğinde Karar Ağacı Algoritmaları ile Bilgisayar ve İnternet Güvenliği Üzerine Bir Uygulama, Endüstri Mühendisliği Dergisi cilt:25 sayı:3-4 s:2-19.
  10. Demirel, Ş., Y. Giray, S. (2019). Karar Ağacı Algoritmaları ve Çocuk İşçiliği Üzerine Bir Uygulama. Social Sciences Research Journal, 8 (4), 52-65.
APA
Çetinkaya, Z., & Horasan, F. (2021). Decision Trees in Large Data Sets. International Journal of Engineering Research and Development, 13(1), 140-151. https://doi.org/10.29137/umagd.763490
AMA
1.Çetinkaya Z, Horasan F. Decision Trees in Large Data Sets. IJERAD. 2021;13(1):140-151. doi:10.29137/umagd.763490
Chicago
Çetinkaya, Zeynep, and Fahrettin Horasan. 2021. “Decision Trees in Large Data Sets”. International Journal of Engineering Research and Development 13 (1): 140-51. https://doi.org/10.29137/umagd.763490.
EndNote
Çetinkaya Z, Horasan F (January 1, 2021) Decision Trees in Large Data Sets. International Journal of Engineering Research and Development 13 1 140–151.
IEEE
[1]Z. Çetinkaya and F. Horasan, “Decision Trees in Large Data Sets”, IJERAD, vol. 13, no. 1, pp. 140–151, Jan. 2021, doi: 10.29137/umagd.763490.
ISNAD
Çetinkaya, Zeynep - Horasan, Fahrettin. “Decision Trees in Large Data Sets”. International Journal of Engineering Research and Development 13/1 (January 1, 2021): 140-151. https://doi.org/10.29137/umagd.763490.
JAMA
1.Çetinkaya Z, Horasan F. Decision Trees in Large Data Sets. IJERAD. 2021;13:140–151.
MLA
Çetinkaya, Zeynep, and Fahrettin Horasan. “Decision Trees in Large Data Sets”. International Journal of Engineering Research and Development, vol. 13, no. 1, Jan. 2021, pp. 140-51, doi:10.29137/umagd.763490.
Vancouver
1.Zeynep Çetinkaya, Fahrettin Horasan. Decision Trees in Large Data Sets. IJERAD. 2021 Jan. 1;13(1):140-51. doi:10.29137/umagd.763490

Cited By

Enhancing Decision Trees for Data Stream Mining

Advances in Science, Technology and Engineering Systems Journal

https://doi.org/10.25046/aj060537