TY - JOUR T1 - Automatic Detection of the Topics in Customer Complaints with Artificial Intelligence AU - Ekinci, Ekin AU - İlhan Omurca, Sevinç AU - Yakupoğlu, Enes AU - Arslan, Emirhan AU - Çapar, Berkay PY - 2021 DA - July DO - 10.17694/bajece.832274 JF - Balkan Journal of Electrical and Computer Engineering PB - MUSA YILMAZ WT - DergiPark SN - 2147-284X SP - 268 EP - 277 VL - 9 IS - 3 LA - en AB - Today, people first make their complaints and compliments on internet about a product which they use or a company they are a customer of. Therefore, when they are going to buy a new product, they first analyze the complaints made by other users of the product. These complaints play an important role in helping people make decision of purchasing or not purchasing product. It is impossible to analyze online complaints manually due to the huge data size. However, companies are still losing a lot of time by analyzing and reading thousands of complaints one by one. In this article, online text based customer complaints are analyzed with Latent Dirichlet Allocation (LDA), GenSim LDA, Mallet LDA and Gibbs Sampling for Dirichlet Multinomial Mixture model (GSDMM) and the performances of them are compared. It is observed that GSDMM gives much more successful results than LDA. The obtained topics of the complaints are presented to users with a mobile application developed in React Native. With the developed application not only the customers will be able to see the topics of complaint from the application interface but also the companies will be able to view the distribution and statistics of the topics of complaints. KW - Topic modelling KW - latent dirichlet allocation KW - gibbs sampling KW - gibbs sampling for dirichlet multinomial mixture KW - natural language processing CR - S. Prasad, “Use of Natural Language Processing to Improve Complaint Classification in Customer Complaint Management System”, Journal of Critical Reviews, Vol.7, No.14, 2020, pp.2642-2652. CR - F. Kalyoncu, E. Zeydan, İ. O. Yiğit, A. Yıldırım, “A Customer Complaint Analysis Tool for Mobile Network Operators.” 2018 IEEE/ACM Int. Conf. on Advances in Social Networks Analysis and Mining. Barcelona, Spain, 2018. CR - R. Liang, W. Guo, D. Yang, “Mining product problems from online feedback of Chinese users”, Kybernetes, Vol.46, No.3, 2017, pp.572-586. CR - K. Bastani, N. Hamed, S. Jeffrey, “Latent Dirichlet allocation (LDA) for topic modelling of the CFPB consumer complaints”, Expert System with Applications, Vol.127, 2019, pp.256-271. CR - W. Mai, M. Wei, J. Zhang, F. Yuan, “Research on Chinese text and application based on the Latent Dirichlet Allocation.” 3rd International Conference on Advanced Electronic Materials, Computers and Software Engineering. Shenzhen, China, 2020. CR - B. Atıcı, S. İlhan Omurca, E. Ekinci, “Product aspect detection in customer complaints by using latent dirichlet allocation.” 2017 International Conference on Computer Science and Engineering. Antalya, Turkey, 2017. CR - X. He, H. Xu, X. Sun, J. Deng, X. Bai, J. Li, “Optimize collapsed Gibbs sampling for biterm topic model by alias method.” 2017 International Joint Conference on Neural Networks. Anchorage, Alaska, 2017. CR - R. Albalawi, T. H. Yeap, M. Benyoucef, “Using Topic Modelling Methods for Short-Text Data: A Comparative Analysis,” Frontiers of Artificial Intelligence, Vol.3, No.42, 2020, pp.1-14. CR - E. Ekinci, S. İlhan Omurca, “An Aspect-Sentiment Pair Extraction Approach Based on Latent Dirichlet Allocation for Turkish,” International Journal of Intelligent Systems and Applications in Engineering, Vol.6, No.3, 2018, pp.209-213. CR - D. M. Blei, “Probabilistic topic models,” Communications of ACM, Vol.55, No.4, 2012, pp.77-84. CR - J. Yin, J. Wang, “A Dirichlet Multinomial Mixture Model-based Approach for Short Text Clustering.” 20th ACM SIGKDD International Joint Conference on Knowledge discovery and data mining. New York, USA, 2014. CR - C. C. Aggarwal, C. Zhai, Mining text data, Springer, 2012, pp.77-128. CR - G. Salton, C. S. Yang, C. T. Yu, “A theory of term importance in automatic text analysis”, Journal of the American Society for Information Science, Vol.26, No.1, 1975, pp.33-44. CR - K. Nigam, A. K. McCallum, S. Thrun, T. M. Mitchell, “Text classification from labeled and unlabeled documents using EM”, Machine Learning, Vol.39, No.2/3, 2000, pp.103-134. CR - I. Akef, X. Xu, J. S. Munoz Arango “Mallet vs GenSim: Topic modeling for 20 news groups report”, 2016. UR - https://doi.org/10.17694/bajece.832274 L1 - https://dergipark.org.tr/en/download/article-file/1418368 ER -