Crucial Challenges In Corporate Credit Risk Assessment: A Case Study

Btıssam Hajjaouı

doi:10.47495/okufbed.1340798

TR EN

Kurumsal Kredi Riski Değerlendirmesinde Önemli Zorluklar: Bir Vaka Çalışması

Öz

Bu makale, müşterinin temerrüde düşüp düşmediğini gösteren değişkeni tahmin ederek kurumsal kredi riskini değerlendirmeyi amaçlamaktadır. Bu amaçla kullanılan veri seti, Türkiye'de finans sektörünün önde gelen kuruluşlarından birinden temin edilmiştir. Genel olarak başvuru sahibinin verileri, kurumsal veriler, hissedar verileri ve başvuru sahibinin alacaklının kurumundaki kredi geçmişine atıfta bulunan 401 değişkenden oluşur. Diğerlerinden girdi değişkenlerini belirleyerek ve ardından bu girdileri inceleyerek güçlü bir şekilde ilişkili değişkenleri ve neredeyse tamamen eksik veya sıfır değerlerden oluşan değişkenleri kullanmaktan kaçınarak bu çok sayıda değişkeni azaltırız. Veri kümesindeki birçok değişkenin çok fazla eksik girişi vardır, ancak bunun haklı sebepleri vardır. Bu sorunu çözmek için, hangi değişken grubunun hangi müşteriyle ilişkili olduğunu yansıtan yedi alt küme oluşturduk. Veri seti dengesiz, yaklaşık %96 temerrüt dışı örneklerden ve onaylanmış krediler arasında yalnızca yaklaşık %4 temerrüt örneklerinden oluşuyor. Bu yazıda, eğitim setlerindeki örnekleri dengelemek için üç örnekleme tekniği kullanıyoruz; alt örnekleme, yüksek örnekleme ve sentetik azınlık yüksek örnekleme tekniği ve altı sınıflandırıcı uyguluyoruz; Rastgele Orman, Naif Bayes, Lojistik Regresyon, Destek Vektör Makinesi, Karar Ağacı ve K-En Yakın Komşu. Bu tekniklerin performansını ölçmek için, sırasıyla çoğunluk sınıfı ve azınlık sınıfının ne kadar iyi tahmin edildiğini ölçmek için duyarlılık ve özgüllük kullanırız. Sonuç olarak, eş zamanlı olarak %50'den fazla duyarlılık ve özgüllük elde ettik; burada alt örnekleme tekniği azınlık sınıfı için en iyi örnekleme tekniğiydi ve sentetik azınlık yüksek örnekleme tekniği ve yüksek örnekleme çoğunluk sınıfı için daha iyi performans gösterdi.

Anahtar Kelimeler

Crucial Challenges In Corporate Credit Risk Assessment: A Case Study

Abstract

This article aims to assess corporate credit risk by predicting the variable that indicates whether the customer has defaulted or not. The dataset used for this purpose is obtained from one of the leading institutions in the finance sector in Türkiye. It consists of 401 variables generally referring to the applicant's data, corporate data, shareholder data, and the applicant's credit history within the creditor's institution. We reduce this large number of variables by identifying the input variables from the others and then studying those inputs to avoid using strongly correlated variables and variables consisting almost entirely of missing or zero values. Many variables in the dataset have too many missing entries but for justifiable reasons. To solve this issue, we created seven subsets to reflect which group of variables relates to which customer. The dataset is imbalanced, consisting of about 96% non-default instances and only around 4% default instances among approved loans. In this paper, we use three sampling techniques to balance the instances in the training sets; under-sampling, oversampling, and synthetic minority oversampling technique, and we apply six classifiers; Random Forest, Naïve Bayes, Logistic Regression, Support Vector Machine, Decision Tree, and K-Nearest Neighbor. To measure the performance of these techniques, we use sensitivity and specificity to measure how well the majority class and minority class were respectively predicted. As a result, we simultaneously achieved greater than 50% sensitivity and specificity, where the under-sampling technique was the best sampling technique for the minority class, and the synthetic minority oversampling technique and oversampling performed better for the majority class.

Keywords

References

Abdou HA., Pointon J. Credit scoring, statistical techniques and evaluation criteria: a review of the literature. Intelligent Systems in Accounting, Finance and Management 2011; 18(2-3): 59-88.
Chawla NV., Bowyer KW., Hall LO., Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 2002; 16: 321-357.
Chen S., Webb GI., Liu L., Ma X. A novel selective naïve Bayes algorithm. Knowledge-Based Systems 2020; 192: 105361.
De Ville B. Decision trees. Wiley Interdisciplinary Reviews: Computational Statistics 2013; 5(6): 448-455.
Elreedy D., Atiya AF. A comprehensive analysis of synthetic minority oversampling technique (SMOTE) for handling class imbalance. Information Sciences 2019; 505: 32-64.
Ghosh D., Vogt A. Outliers: An evaluation of methodologies. Joint statistical meetings, July 2012, page no: 3455-3460, United States.
Hearst MA., Dumais ST., Osuna E., Platt J., Scholkopf B. Support vector machines. IEEE Intelligent Systems and Their Applications 1998; 13(4): 18-28.
Hodge V., Austin J. A survey of outlier detection methodologies. Artificial Intelligence Review 2004; 22: 85-126. Ishwaran H. The effect of splitting on random forests. Machine Learning 2015; 99(1): 75-118.

Details

Primary Language

English

Subjects

Industrial Engineering

Journal Section

Research Article

Authors

Btıssam Hajjaouı ^*
0000-0003-0851-3428
Türkiye

Publication Date

March 11, 2024

Submission Date

August 10, 2023

Acceptance Date

January 11, 2024

Published in Issue

Year 2024 Volume: 7 Number: 2

DOI

https://doi.org/10.47495/okufbed.1340798

IZ

https://izlik.org/JA39NC45PB

Cite

RIS / Bibtex

APA

Hajjaouı, B. (2024). Crucial Challenges In Corporate Credit Risk Assessment: A Case Study. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi, 7(2), 834-854. https://doi.org/10.47495/okufbed.1340798

AMA

1.Hajjaouı B. Crucial Challenges In Corporate Credit Risk Assessment: A Case Study. Osmaniye Korkut Ata University Journal of The Institute of Science and Techno. 2024;7(2):834-854. doi:10.47495/okufbed.1340798

Chicago

Hajjaouı, Btıssam. 2024. “Crucial Challenges In Corporate Credit Risk Assessment: A Case Study”. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi 7 (2): 834-54. https://doi.org/10.47495/okufbed.1340798.

EndNote

Hajjaouı B (March 1, 2024) Crucial Challenges In Corporate Credit Risk Assessment: A Case Study. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi 7 2 834–854.

IEEE

[1]B. Hajjaouı, “Crucial Challenges In Corporate Credit Risk Assessment: A Case Study”, Osmaniye Korkut Ata University Journal of The Institute of Science and Techno, vol. 7, no. 2, pp. 834–854, Mar. 2024, doi: 10.47495/okufbed.1340798.

ISNAD

Hajjaouı, Btıssam. “Crucial Challenges In Corporate Credit Risk Assessment: A Case Study”. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi 7/2 (March 1, 2024): 834-854. https://doi.org/10.47495/okufbed.1340798.

JAMA

1.Hajjaouı B. Crucial Challenges In Corporate Credit Risk Assessment: A Case Study. Osmaniye Korkut Ata University Journal of The Institute of Science and Techno. 2024;7:834–854.

MLA

Hajjaouı, Btıssam. “Crucial Challenges In Corporate Credit Risk Assessment: A Case Study”. Osmaniye Korkut Ata Üniversitesi Fen Bilimleri Enstitüsü Dergisi, vol. 7, no. 2, Mar. 2024, pp. 834-5, doi:10.47495/okufbed.1340798.

Vancouver

1.Btıssam Hajjaouı. Crucial Challenges In Corporate Credit Risk Assessment: A Case Study. Osmaniye Korkut Ata University Journal of The Institute of Science and Techno. 2024 Mar. 1;7(2):834-5. doi:10.47495/okufbed.1340798