Araştırma Makalesi

Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm

Cilt: 7 Sayı: 1 28 Haziran 2020
PDF İndir
EN TR

Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm

Öz

With the introduction of computers into our lives, digital data sizes are increasing gradually. Non-standard values (outliers) which behave differently from the others can be found in these data produced in the digital world. Detection of these values, especially in big data sets; has great importance in fields such as security, insurance, finance, medicine and genetics. Clustering methods of data mining techniques are frequently used in outlier detection in big data sets. Density based DBSCAN (Density-based spatial clustering of applications with noise) algorithm from clustering algorithms which are sensitive to noisy and outlier values is one of the most important methods in outlier detection. In this study, an application was developed using DBSCAN algorithm in C# programming language for the detection of outliers. In the developed application, 2 data sets with different data numbers were examined and analyzed. For the shortest possible data analysis time, serial and parallel programming techniques were used separately. In order to shorten the analysis time of big data sets, parallel class members in TPL (Task Parallel Library) provided with .Net 4.0 were used. In series of analysis of data sets, it was seen that DBSCAN algorithm produces more accurate results and is more practicable than other selected algorithms in terms of outlier detection. When considered in terms of computing performance, parallel programming has become more efficient as the number of data increases.

Anahtar Kelimeler

Destekleyen Kurum

Süleyman Demirel Üniversitesi Bilimsel Araştırma Projeleri Koordinasyon Birimi

Proje Numarası

4199-YL1-14

Teşekkür

Süleyman Demirel Üniversitesi Bilimsel Araştırma Projeleri Koordinasyon Birimine destekleri için teşekkür ederiz.

Kaynakça

  1. IBM. (2016). What is Big Data? https://www 01.ibm.com/software/data/bigdata/what-is-big-data.html, (25.03.2020).
  2. Güçlü, M. (2012). Detection of Outlier Value with Artificial Immune System Based Algorithm. M.Sc. Thesis, Yıldız Technical University, Institute of Science and Technology, Istanbul, Turkey
  3. Duan, L., Xu, L., Liu, Y., Lee, J. (2009). Cluster-Based Outlier Detection. Annals of Operations Research, 168(1), 151-168.
  4. Ercan, U., Akar, H., Koçer, A. (2013). Basic Algorithms Used in Parallel Programming. Academic Informatics Conference, 23-25 January 2013, Antalya, Turkey, 861-866.
  5. Durmuş, B. (2013). Virtual Parallel Machine. M.Sc. Thesis, Dumlupınar University, Institute of Science and Technology, Kütahya, Turkey.
  6. Yang, J., He, Q. (2018). Scheduling Parallel Computations by Work Stealing: A survey. International Journal of Parallel Programming, 46(2), 173-197.
  7. İnce, K. (2013). Application of Genetic Algorithms with Parallel Programming in Multicore Architectures. M.Sc. Thesis, İnönü University, Institute of Science and Technology, Malatya, Turkey.
  8. Akçay, M., Erdem, H.A. (2013). Parallel Computing with Intel Parallel Studio. XVIII. Internet Conference in Turkey, 9-11 December 2013, İstanbul University, 79-83.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Mühendislik

Bölüm

Araştırma Makalesi

Yayımlanma Tarihi

28 Haziran 2020

Gönderilme Tarihi

21 Kasım 2019

Kabul Tarihi

26 Nisan 2020

Yayımlandığı Sayı

Yıl 2020 Cilt: 7 Sayı: 1

Kaynak Göster

APA
Yaşar, H., & Albayrak, M. (2020). Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm. Bilecik Şeyh Edebali Üniversitesi Fen Bilimleri Dergisi, 7(1), 129-140. https://doi.org/10.35193/bseufbd.649539
AMA
1.Yaşar H, Albayrak M. Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm. Bilecik Şeyh Edebali Üniversitesi Fen Bilimleri Dergisi. 2020;7(1):129-140. doi:10.35193/bseufbd.649539
Chicago
Yaşar, Hüseyin, ve Mehmet Albayrak. 2020. “Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm”. Bilecik Şeyh Edebali Üniversitesi Fen Bilimleri Dergisi 7 (1): 129-40. https://doi.org/10.35193/bseufbd.649539.
EndNote
Yaşar H, Albayrak M (01 Haziran 2020) Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm. Bilecik Şeyh Edebali Üniversitesi Fen Bilimleri Dergisi 7 1 129–140.
IEEE
[1]H. Yaşar ve M. Albayrak, “Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm”, Bilecik Şeyh Edebali Üniversitesi Fen Bilimleri Dergisi, c. 7, sy 1, ss. 129–140, Haz. 2020, doi: 10.35193/bseufbd.649539.
ISNAD
Yaşar, Hüseyin - Albayrak, Mehmet. “Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm”. Bilecik Şeyh Edebali Üniversitesi Fen Bilimleri Dergisi 7/1 (01 Haziran 2020): 129-140. https://doi.org/10.35193/bseufbd.649539.
JAMA
1.Yaşar H, Albayrak M. Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm. Bilecik Şeyh Edebali Üniversitesi Fen Bilimleri Dergisi. 2020;7:129–140.
MLA
Yaşar, Hüseyin, ve Mehmet Albayrak. “Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm”. Bilecik Şeyh Edebali Üniversitesi Fen Bilimleri Dergisi, c. 7, sy 1, Haziran 2020, ss. 129-40, doi:10.35193/bseufbd.649539.
Vancouver
1.Hüseyin Yaşar, Mehmet Albayrak. Comparison of Serial and Parallel Programming Performance in Outlier Detection with DBSCAN Algorithm. Bilecik Şeyh Edebali Üniversitesi Fen Bilimleri Dergisi. 01 Haziran 2020;7(1):129-40. doi:10.35193/bseufbd.649539

Cited By