Clustering-based Sales Forecasting in a Forklift Distributor
Abstract
Sales forecasting refers to the prediction of
future demand based on past data. A vast literature on sales forecasting has
accumulated due to its vital role in balancing demand and supply. Among these,
data mining has emerged as a powerful tool to facilitate sales forecasting. In
this study, we use data mining methods for accurate and reliable sales
forecasts in a forklift distributor company. Monthly sales data for 100
different types of forklifts between 1998 and 2016 are used. The proposed
forecasting methodology includes three steps. First, products with similar
sales patterns are determined using hierarchical clustering. Dynamic time
warping is applied to calculate the similarities among product sales data.
Second, features are extracted and selected for each cluster. In addition to
the features adopted from the literature, four new features are proposed to
characterize intermittency. Multivariate adaptive regression splines model is
used for feature selection. Third, support vector regression is used to predict
future sales of each product cluster. Finally, the performance of the proposed
approach is evaluated according to forecasting error and complexity. The
numerical analysis shows that the proposed approach gives reasonable accuracy
with less complexity.
Keywords
References
- Agrawal, R., Faloutsos, C., & Swami, A. (1993). Efficient similarity search in sequence databases. The Fourth International Conference Foundations of Data Organization and Algorithms, 69-84. doi:10.1007/3-540-57301-1_5
- Bala, P.K. (2012). Improving inventory performance with clustering based demand forecasts. Journal of Modelling in Management, 7(1), 23-37. doi:10.1108/17465661211208794
- Bao, Y., Wang, W., & Zhang, J. (2004). Forecasting intermittent demand by SVMs regression. IEEE International Conference on Systems, Man and Cybernetics, 1, 461–466. doi:10.1109/icsmc.2004.1398341
- Bao, Y., Wang, W., & Zou, H. (2005). SVR-based method forecasting intermittent demand for service parts inventories. Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing, 604–613. doi:10.1007/11548706_64
- Berndt, D., & Clifford, J. (1994). Using dynamic time warping to find patterns in time series. Workshop on Knowledge Knowledge Discovery in Databases, 398, 359–370.
- Biscarri, F., Monedero, I., García, A., Guerrero, J. I., & León, C. (2017). Electricity clustering framework for automatic classification of customer loads. Expert Systems with Applications, 86, 54–63. doi:10.1016/j.eswa.2017.05.049
- Box, G. E., Jenkins, G. M., Reinsel, G. C., & Ljung, G. M. (2015). Time series analysis: forecasting and control. John Wiley & Sons. Brown, R. G. (1959). Statistical forecasting for inventory control. McGraw/Hill.
- Chen, I. F., & Lu, C. J. (2017). Sales forecasting by combining clustering and machine-learning techniques for computer retailing. Neural Computing and Applications, 28(9), 2633–2647. doi:10.1007/s00521-016-2215-x
- Croston, J. D. (1972). Forecasting and stock control for intermittent demands. Operational Research Quarterly, 23(3), 289-303. doi:10.2307/3007885
- Dai, W., Chuang, Y.-Y., & Lu, C.-J. (2015). A Clustering-based sales forecasting scheme using support vector regression for computer server. Procedia Manufacturing, 2, 82–86. doi:10.1016/j.promfg.2015.07.014