CREDIT SCORING BY ARTIFICIAL NEURAL NETWORKS BASED CROSS-ENTROPY AND FUZZY RELATIONS
Year 2019,
Volume: 37 Issue: 3, 855 - 870, 01.09.2020
Damla Ilter
Ozan Kocadaglı
Abstract
The credit scoring is one of the major activities in the banking sector. Because of growing market and increasing the loan applications, this field still continues its concern in terms of rating the applicants and assessing the credit amounts. To reduce the number of wrong decisions in the credit evaluation process, the decision makers focus on estimating more robust models. However, the traditional methods are criticized due to various pre-requisites and linear approximations in the high dimensional and excessive nonlinear cases. For this reason, artificial intelligence techniques are mostly preferred to handle the credit scoring problems accurately. This study presents an efficient procedure that is based on ANNs with cross-entropy and fuzzy relations in the context of the credit scoring. In the implementations, the proposed procedure is applied to a couple of benchmark credit scoring data sets and its performance is compared with traditional approaches.
References
- [1] D. Soydaner, O. Kocadağlı, Artificial neural networks with gradient learning algorithm for credit scoring, Istanbul University Journal of the School of Business, 44(2) (2015) 3-12.
- [2] N. C. Hsieh, An integrated data mining and behavioral scoring model for analyzing bank customers. Expert systems with applications, 27(4) (2004) 623-633.
- [3] X. Zhu, J. Li, D. Wu, H. Wang, C. Liang, Balancing accuracy, complexity and interpretability in consumer credit decision making: A C-TOPSIS classification approach, Knowledge-Based Systems, 52 (2013) 258-267.
- [4] F. L. Chen, F. C. Li, Combination of feature selection approaches with SVM in credit scoring, Expert Systems with Applications, 37(7) (2010) 4902-4909.
- [5] G. Wang, J. Hao, J. Ma, H. Jiang, A comparative assessment of ensemble learning for credit scoring, Expert Systems with Applications, 38(1) (2011) 223-230.
- [6] K. Bijak, L. C. Thomas, Does segmentation always improve model performance in credit scoring?, Expert Systems with Applications, 39(3) (2012) 2433–2442.
- [7] B. Baesens, R. Setiono, C. Mues, J. Vanthienen, Using neural network rule extraction and decision tables for credit-risk evaluation, Management Science, 49(3) (2003) 312-329.
- [8] J. R. Quinlan, C4.5: Programs for machine learning, San Francisco, CA, USA, 1993.
- [9] M. G. Tsipouras, T. P. Exarchos, D. I. Fotiadis, A methodology for automated fuzzy model generation, Fuzzy Sets and Systems, 159(23)
(2008) 3201-3220.
- [10] T. Harris, Credit scoring using the clustered support vector machine, Expert Systems with Applications, 42(2) (2015) 741–750.
- [11] J. M. Tomczak, M. Zieba, Classification restricted boltzmann machine for comprehensible credit scoring model, Expert System
Applications, 42(4) (2015) 1789– 1796.
- [12] F. C. Tsai, Combining cluster analysis with classifier ensembles to predict financial distress, Information Fusion, 16 (2014) 46-58.
- [13] A. B. Hens, M. K. Tiwari, Computational time reduction for credit scoring: An integrated approach based on support vector machine
and stratified sampling method, Expert Systems with Applications, 39(8) (2012) 6774–6781.
- [14] W. Chen, C. Ma, L. Ma, Mining the customer credit using hybrid support vector machine technique, Expert Systems with Applications,
36(4) , (2009) 7611–7616.
- [15] S. T. Lou, B. W. Cheng, C. H. Hsieh, Prediction model building with clustering-launched classification and support vector machines in
credit scoring, Expert Systems with Applications, 36(4) (2009) 7562-7566.
- [16] C. L. Huang, M. C. Chen, C. J. Wang, Credit scoring with a data mining approach based on support vector machines, Expert Systems
with Applications, 33(4) (2007) 847-856.
- [17] Z. Huang, H. C. Chen, C. J. Hsu, W. H. Chen, S. S. Wu, Credit rating analysis with support vector machines and neural networks: a
market comparative study, Decision Support Systems, 37 (2004) 543-558.
- [18] Y. Peng, G. Wang, G. Kou, Y. Shi, An empirical study of classification algorithm evaluation for financial risk prediction, Applied Soft
Computing, 11(2) (2011) 2906-2915.
- [19] H. Zhu, P. A. Beling, G. A. Overstreet, A Bayesian framework for the combination of classifier outputs, The Journal of the Operational
Research Society, 53(7) (2002) 719–727.
- [20] A. Bazmara, S. S. Donighi, Bank customer credit scoring by using fuzzy expert system, I.J. intelligent systems and applications, 11
(2014) 29-35.
- [21] J. Abellan, J. G. Catellano, A comparative study on base classifiers in ensemble methods for credit scoring, Expert Systems with
Applications, 73 (2017) 1-10.
- [22] Z. F. Liu, S. Pan, Fuzzy-Rough Instance Selection Combined with Effective Classifiers in Credit Scoring, Neural Processing Letters, 47(1)
(2018) 193-202.
- [23] M. B. Gorzalczany, F. Rudzinski, A multi-objective genetic optimization for fast, fuzzy rule-based credit classification with
balanced accuracy and interpretability, Applied Soft Computing, 40 (2016) 206-220.
- [24] J. J. Huang, G. H. Tzeng, C. S. Ong, Two-stage genetic programming (2SGP) for the credit scoring model, Applied Mathematics
and Computation, 174(2) (2006) 1039-1053.
- [25] K. Crockkett, Z. Bandar, D. Mclean, J. O’Shea, on constructing a fuzzy inference framework using crisp decision trees, Fuzzy Sets
and Systems, 157(21) (2006) 2809-2832.
- [26] M. Ala’raj, M. F. Abbod, Classifiers consensus system approach for credit scoring, Knowledge-Based Systems, 104 (2016) 89–105.
- [27] H. Xiao, Z. Xiao, Y. Wang, Ensemble classification based on supervised clustering for credit scoring. Applied Soft Computing, 43(C)
(2016) 73–86.
- [28] S. M. Sadatrasoul, M. Gholamian, M. Siami, Z. Hajimohammadi, Credit scoring in banks and financial institutions via data mining
techniques: A literature review. Journal of AI and Data Mining, 1(2) (2013) 119–129.
- [29] A. Marqués, V. García, J. S. Sánchez, Exploring the behaviour of base classifiers in credit scoring ensembles, Expert Systems with
Applications, 39(11) (2012) 10244–10250.
- [30] G. Wang, J. Ma, L. Huang, K. Xu, Two credit scoring models based on dual strategy ensemble trees, Knowledge-Based Systems, 26
(2012) 61–68.
- [31] C. Hung, J.H. Chen, A selective ensemble based on expected probabilities for bankruptcy prediction. Expert Systems with
Applications, 36(3, Part 1) (2009) 5297– 5303.
- [32] L. Nanni, A. Lumini, An experimental comparison of ensemble of classifiers for bankruptcy prediction and credit scoring, Expert
Systems with Applications, 36(2, Part2) (2009) 3028–3033.
- [33] L. Yu, S. Wang, K. K. Lai, Credit risk assessment with a multistage neural network ensemble learning approach, Expert Systems
with Applications, 34(2) (2008) 1434-1444.
- [34] O. Kocadagli, A Novel Hybrid Learning Algorithm For Full Bayesian Approach of Artificial Neural Networks, Applied Soft Computing,
35 (2015) 1 – 958.
- [35] C. Bishop, Neural networks for pattern recognition, Oxford university press, United Kingdom, (2010).
- [36] C. Ong, J. Huang, G. Tzeng, Building credit scoring models using genetic programming, Expert systems with applications, 29(1)
(2005) 41-47.
- [37] J. K. Sengupta, Measuring efficiency by a fuzzy statistical approach, Fuzzy Sets and Systems, 46(1) (1992)73-80.
- [38] O. Kocadagli, R. Langari, Classification of EEG signals for epileptic seizures using hybrid artificial neural networks based wavelet
transforms and fuzzy relations, Expert Systems with Applications, 88 (2017) 419-434.
- [39] K. K. Lai, L. Yu, S. Y. Wang, , L. G. Zhou, Credit risk analysis using a reliability-based neural network ensemble model, Lecture Notes
in Computer Science, 4132 (2006) 682-690.
- [40] O. Akbilgic, H. Bozdagan, A New Supervised Classification of Credit Approval Data via the Hybridized RBF Neural Network Model
Using Information Complexity, Data Science, Learning by Latent Structures and Knowledge Discovery, (2015) 13-27.
- [41] A. Blanco, R. Pino-Mejias, J. Lara, S. Rayo, Credit scoring models for the microfinance industry using neural networks: Evidence from
Peru, Expert systems with applications, 40 (2013) 356-364.
- [42] S. Oreski, D. Oreski, G. Oreski, Hybrid system with genetic algorithm and artificial neural networks and its application to retail
credit risk assessment, Expert systems with applications, 39 (2012) 12605-12617.
- [43] L. M. Silva, J. Marques de Sá, L. A. Alexandre Data classification with multilayer perceptrons using a generalized error function,
Neural Networks, 21 (2008) 1302 – 1310.
- [44] T. S. Lee, I. F. Chen, A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression
splines, Expert systems with applications, 28(4) (2005) 743-752.
- [45] R. Malhotra, K. D. Malhotra, Evaluating consumer loans using neural networks, Omega & Science Direct, 31(2) (2003) 83-96.
- [46] D. West, Neural network credit scoring models, Computers & operations research, 27, 2000, pp. 1131-1152.
- [47] V. S. Desai, J. N. Crook, G. A. J. Overstreet, A comparison of neural networks and linear scoring models in the credit union
environment, European journal of operational research, 95 (1996) 24-37.
- [48] B. Mirkin, Mathematical classification and clustering. Kluwer Academic Publishers, 1996, pp. 74-76.
- [49] R. Saia, S. Carta, An Entropy Based Algorithm for Credit Scoring, Lecture Notes in Business Information Processing, Springer,
Cham, 268, 2016, pp. 263-276.
- [50] Q. Du, K. Nie, Z. Wang, Application of Entropy-Based Attribute Reduction and an Artificial Neural Network in Medicine: A Case
Study of Estimating Medical Care Costs Associated with Myocardial Infarction, Entropy, 16(9) (2014) 4788-4800.
- [51] H. Bozdogan, Akaike's information criterion and recent developments in information complexity, Journal of mathematical
psychology, 44(1) (2000) 62-91.
- [52] S. Geman, E. Bienenstock, R. Doursat, Neural Networks and the Bias/Variance Dilemma, Mass. Inst. Technol. 4 (1), 1992, pp. 1–58.
- [53] R. J. May, H. R. Maier, G. C. Dandy, T. M. K. G. Fernando, Non-linear variable selection for artificial neural networks using partial
mutual information. Environmental Modeling & Software, 23 (2008) 1312-1326.
- [54] M. Qi, G. P. Zhang, An investigation of model selection criteria for neural network time series forecasting, European journal of
operational research, 132 (2001) 666-680.
- [55] U. Anders, O. Korn, Model selection in neural networks. Neural networks, 12 (1999) 309-323.
- [56] N. Murata, S. Yoshizawa, S. Amari, Network information criterion-determining the number of hidden units for an artificial neural
network model. IEEE transactions on neural networks, 5(6) (1994) 865-872.
- [57] R. M. Golden, Mathematical methods for neural network analysis and design. The MIT press, England, 1996.
- [58] M. Moller, A scaled conjugate gradient algorithm for fast supervised learning, Neural networks, 6(4) (1993) 525-533.
- [59] A. Blanco, M. Delgado, I. Requena, Identification of fuzzy relational equations by fuzzy neural networks, Fuzzy Sets and Systems,
71(2) (1995) 215-226.
- [60] L. X. Wang, A Course in Fuzzy-Systems and Control, Prentice-Hall Inc, Eastbourne, 1997.
- [61] P. Liu, The fuzzy associative memory of max-min fuzzy neural network with threshold, Fuzzy Sets and Systems, 107(2) (1999) 147-157.
- [62] D. Dubois, P. Henri, Fundamentals of Fuzzy Sets, Kluwer Academic Publishers, Boston, 2000, pp. 233 – 288.
- [63] T. J. Ross, Fuzzy Logic with Engineering Applications, McGraw-Hill, Inc., Singapore, 2004.
- [64] L. Zadeh, Similarity Relations and Fuzzy Orderings, Information Sciences, 3(2) (1971) 177 – 200.
- [65] J. W. Rucklidge, Efficiently Locating Objects Using the Hausdorff Distance, International Journal of Computer Vision, 24(3) (1997)
251-270.
- [66] MachineLearningRepository,StatlogData,http://archive.ics.uci.edu/ml/datasets/Statlog+(Australian+Credit+Data).UCI.
- [67] MachineLearningRepository,StatlogData,http://archive.ics.uci.edu/ml/datasets/Statlog+(German+Credit+Data). UCI.