Performance Comparison of Association Rule Algorithms with SPMF on Automotive Industry Data
Abstract
By the recent developments about the information technologies, companies can store their data faster and easier with lower costs. All transactions (sales, current card, invoicing, etc.) performed in companies during the day combine at the end of the day to form big datasets. It is possible to extract valuable information through these datasets with data mining. And this has become more important for companies in terms of today's conditions where the competition in the market is high. In this study, a dataset of a company selling car maintenance and repair products in Turkey is used. Association Rules are applied on this dataset for determining the items which are bought together by the customers. These rules, which are calculated specifically for the company, can be used to redefine the sales and marketing strategies, to revise the storage areas efficiently, and to create sales campaigns suitable for the customers and regions. These algorithms are also called Frequent Itemset Mining Algorithms. The most recent 11 algorithms from these are applied to this dataset in order to compare the performances according to metrics like memory usage and execution times against varying support values and varying record numbers by using SPMF platform. Three different datasets are created by using the whole dataset like 6-months, 12-months and 22-months. According to the experiments, it can be said that executon times generally increases inversely with the support values as nearly all algorithms have higher execution time values for the lowest support value of 0.1. dEclat_bitset algorithm has the most efficient performance for 6-months and 12-months dataset. But Eclat algorithm can be said to be the most efficient algorithm for 0.7 and 0.3 support values; on the other hand dEclat_bitset is the most efficient algorithm for 0.3 and 0.1 support values on 22-months dataset.
Keywords
References
- [1] Gancheva, "Market basket analysis of beauty products." Master of Science in Economics and Business, Erasmus University Rotterdam, Erasmus School of Economics, Rotterdam, Netherlands, 2013.
- [2] Fayyad, Usama, Gregory Piatetsky-Shapiro, and Padhraic Smyth. "From data mining to knowledge discovery in databases." AI magazine, vol.17, no.3, pp. 37, 1996.
- [3] Erpolat, "Otomobil Yetkili Servislerinde Birliktelik Kurallarının Belirlenmesinde Apriori ve FP-Growth Algoritmalarının Karşılaştırılması," Anadolu Üniversitesi Sosyal Bilimler Dergisi, c.12, s.1, ss. 151-166, 2012.
- [4] Bala, A., Shuaibu, M. Z., KaramiLawal, Z., and Zakari, R. I. Y. "Performance Analysis of Apriori and FP-Growth Algorithms (Association Rule Mining)," Int. J. Computer Technology &Applications vol.7, no.2, pp. 279-293, 2016.
- [5] G. Yıldız Erduran, "Online müşteri şikayetlerinin veri madenciliği ile incelenmesi," Doktora tezi, İşletme Bölümü, Trakya Üniversitesi, Edirne, Türkiye, 2017.
- [6] C. Aguwa, M. H. Olya, and L. Monplaisir, "Modeling of fuzzy-based voice of customer for business decision analytics," Knowledge-Based Systems, vol. 125, pp. 136-145, 2017. [7] A. Griva, C. Bardaki, K. Pramatari, and D. Papakiriakopoulos, "Retail business analytics: Customer visit segmentation using market basket data," Expert Systems with Applications, vol. 100, pp. 1-16, 2018.
- [8] M. Postigo-Boix and J. L. Melus-Moreno, "A social model based on customers' profiles for analyzing the churning process in the mobile market of data plans," Physica a-Statistical Mechanics and Its Applications, vol. 496, pp. 571-592, 2018.
- [9] B. Doğan, A. Buldu, Ö. Demir ve B. Erol, "Sigortacılık Sektöründe Müşteri İlişki Yönetimi İçin Kümeleme Analizi." Karaelmas Fen ve Mühendislik Dergisi, c.8, s.1, ss.11-18, 2018.
Details
Primary Language
English
Subjects
Engineering
Journal Section
Research Article
Publication Date
July 31, 2019
Submission Date
June 25, 2019
Acceptance Date
July 6, 2019
Published in Issue
Year 2019 Volume: 7 Number: 3
Cited By
A market basket analysis of the US auto-repair industry
Journal of Business Analytics
https://doi.org/10.1080/2573234X.2020.1838958