AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (474.3 KB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

Cross-Domain Credit Default Prediction via Interpretable Ensemble Transfer

Zhida Shang1Hefeng Meng1Yibowen Zhao1Ronghua Xu2Yonghui Xu1( )Lizhen Cui1
Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR) & Software School, Shandong University, Jinan 250100, China
Business School, East China University of Political Science and Law, Shanghai 201620, China
Show Author Information

Abstract

The evaluation and prediction of credit risk have always been a research hotspot to ensure the healthy and orderly development of the credit market. Most researchers use deep learning to predict credit risk. However, when training data are too small, deep learning models often lead to overfitting. Although we have a large amount of available training data, we often cannot ensure that the data are evenly distributed, which is still not conducive to model training. In addition, deep learning is often difficult to explain, and the unexplained model is often difficult to gain the trust of users, thus reducing the usefulness of the model. To solve these problems, we propose an integrated cross-domain credit default prediction network, called Transfer Light Gradient Boosting Machine (TrLightGBM), based on interpretable integration transfer. This network considers the weight of data from different domains in training and implements cross-domain credit default prediction by adjusting the weight. The experiment shows that our method TrLightGBM not only achieves the interpretability of the model to a certain extent but also has good performance.

References

[1]
X. Hui and Y. S. Gang, Using clustering-based bagging ensemble for credit scoring, in Proc. 2011 Int. Conf. Business Management and Electronic Information, Guangzhou, China, 2011, pp. 369–371.
[2]

Y. Wang, S. Wang, and K. K. Lai, A new fuzzy support vector machine to evaluate credit risk, IEEE Trans. Fuzzy Syst., vol. 13, no. 6, pp. 820–831, 2005.

[3]
A. Lawi and F. Aziz, Classification of credit card default clients using LS-SVM ensemble, in Proc. 2018 Third Int. Conf. Informatics and Computing (ICIC), Palembang, Indonesia, 2018, pp. 1–4.
[4]

T. M. Alam, K. Shaukat, I. A. Hameed, S. Luo, M. U. Sarwar, S. Shabbir, J. Li, and M. Khushi, An investigation of credit card default prediction in the imbalanced datasets, IEEE Access, vol. 8, pp. 201173–201198, 2020.

[5]

P. Xu, Z. Ding, and M. Pan, A hybrid interpretable credit card users default prediction model based on RIPPER, Concurr. Comput., vol. 30, no. 23, p. e4445, 2018.

[6]
K. I. Alkhatib, A. I. Al-Aiad, M. H. Almahmoud, and O. N. Elayan, Credit card fraud detection based on deep neural network approach, in Proc. 2021 12th Int. Conf. Information and Communication Systems (ICICS), Valencia, Spain, 2021, pp. 153–156.
[7]
T. C. Hsu, S. T. Liou, Y. P. Wang, Y. S. Huang, and C. Lin, Enhanced recurrent neural network for combining static and dynamic features for credit card default prediction, in Proc. ICASSP 2019-2019 IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 2019, pp. 1572–1576.
[8]
J. Xiao, R. Wang, G. Teng, and Y. Hu, A transfer learning based classifier ensemble model for customer credit scoring, in Proc. 2014 Seventh Int. Joint Conf. Computational Sciences and Optimization, Beijing, China, 2014, pp. 64–68.
[9]
Y. Wang and H. Leng, Credit decision of SMEs based on improved TOPSIS and decision tree, in Proc. 2021 4th Int. Conf. Advanced Electronic Materials, Computers and Software Engineering (AEMCSE), Changsha, China, 2021, pp. 489–492.
[10]
A. A. Khine and H. W. Khin, Credit card fraud detection using online boosting with extremely fast decision tree, in Proc. 2020 IEEE Conf. Computer Applications (ICCA), Yangon, Myanmar, 2020, pp. 1–4.
[11]
C. Manapragada, G. I. Webb, and M. Salehi, Extremely fast decision tree, in Proc. 24th ACM SIGKDD Int. Conf. Knowledge Discovery & Data Mining, London, UK, 2018, pp. 1953–1962.
[12]
Y. Jiang, X. Zhou, and D. Zhang, A new approach based on a rough set and a decision tree to bank customer credit evaluation, in Proc. 2008 IEEE Int. Symp. on IT in Medicine and Education, Xiamen, China, 2008, pp. 61–65.
[13]
Y. Z. Wang and X. B. Pei, A fast algorithm for reduction based on skowron discernibility matrix, (in Chinese), Compute Science, vol. 32, no. 4, pp. 42−44, 2005.
[14]
J. R. Quinlan, C4.5: Program for Machine Learning. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 1993.
[15]
Y. Jiang, Credit scoring model based on the decision tree and the simulated annealing algorithm, in Proc. 2009 WRI World Congress on Computer Science and Information Engineering, Los Angeles, CA, USA, 2009, pp. 18–22.
[16]
J. Lang and J. Sun, Sensitivity of decision tree algorithm to class-imbalanced bank credit risk early warning, in Proc. 2014 Seventh Int. Joint Conf. Computational Sciences and Optimization, Beijing, China, 2014, pp. 539–543.
[17]
J. Bozsik and G. Körmendi, Decision tree-based credit decision support system, in Proc. 3rd IEEE Int. Symp. on Logistics and Industrial Informatics, Budapest, Hungary, 2011, pp. 189–194.
[18]
G. Wei, S. Yingjie, and Y. X. Mu, Commercial bank credit risk evaluation method based on decision tree algorithm, in Proc. 2015 Seventh Int. Conf. Measuring Technology and Mechatronics Automation, Nanchang, China, 2015, pp. 285–288.
[19]

Y. Freund and R. E. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, J. Comput. Syst. Sci., vol. 55, no. 1, pp. 119–139, 1997.

[20]
G. Ke, Q. Meng, T. Finley, T. Wang, W. Chen, W. Ma, Q. Ye, and T. Y. Liu, LightGBM: A highly efficient gradient boosting decision treek, in Proc. 31st Int. Conf. Neural Information Processing Systems, Long Beach, CA, USA, 2017, pp. 3149–3157.
[21]
H. Feng, Ensemble learning in credit card fraud detection using boosting methods, in Proc. 2021 2nd Int. Conf. Computing and Data Science (CDS), Stanford, CA, USA, 2021, pp. 7–11.
[22]
G. Rushin, C. Stancil, M. Sun, S. Adams, and P. Beling, Horse race analysis in credit card fraud—Deep learning, logistic regression, and gradient boosted tree, in Proc. 2017 Systems and Information Engineering Design Symp. (SIEDS), Charlottesville, VA, USA, 2017, pp. 117–121.
[23]

A. A. Taha and S. J. Malebary, An intelligent approach to credit card fraud detection using an optimized light gradient boosting machine, IEEE Access, vol. 8, pp. 25579–25587, 2020.

[24]
B. Bayram, B. Köroğlu, and M. Gönen, Improving fraud detection and concept drift adaptation in credit card transactions using incremental gradient boosting trees, in Proc. 2020 19th IEEE Int. Conf. Machine Learning and Applications (ICMLA), Miami, FL, USA, 2021, pp. 545–550.
[25]

E. Ileberi, Y. Sun, and Z. Wang, Performance evaluation of machine learning methods for credit card fraud detection using SMOTE and AdaBoost, IEEE Access, vol. 9, pp. 165286–165294, 2021.

[26]
W. Dai, Q. Yang, G. R. Xue, and Y. Yu, Boosting for transfer learning, in Proc. 24th Int. Conf. Machine learning, Corvalis, OR, USA, 2007, pp. 193–200.
International Journal of Crowd Science
Pages 106-112
Cite this article:
Shang Z, Meng H, Zhao Y, et al. Cross-Domain Credit Default Prediction via Interpretable Ensemble Transfer. International Journal of Crowd Science, 2023, 7(3): 106-112. https://doi.org/10.26599/IJCS.2023.9100011

399

Views

28

Downloads

1

Crossref

1

Scopus

Altmetrics

Received: 30 December 2022
Revised: 23 May 2023
Accepted: 29 May 2023
Published: 30 September 2023
© The author(s) 2023.

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return