预测无担保贷款的信用风险:机构学习方法 (Predicting Credit Risk for Unsecured Lending: A Machine Learning Approach)

Since the 1990s, there have been significant advances in the technology space and the e-Commerce area, leading to an exponential increase in demand for cashless payment solutions. This has led to increased demand for credit cards, bringing along with it the possibility of higher credit defaults and hence higher delinquency rates, over a period of time. The purpose of this research paper is to build a contemporary credit scoring model to forecast credit defaults for unsecured lending (credit cards), by employing machine learning techniques. As much of the customer payments data available to lenders, for forecasting Credit defaults, is imbalanced (skewed), on account of a limited subset of default instances, this poses a challenge for predictive modelling. In this research, this challenge is addressed by deploying Synthetic Minority Oversampling Technique (SMOTE), a proven technique to iron out such imbalances, from a given dataset. On running the research dataset through seven different machine learning models, the results indicate that the Light Gradient Boosting Machine (LGBM) Classifier model outperforms the other six classification techniques. Thus, our research indicates that the LGBM classifier model is better equipped to deliver higher learning speeds, better efficiencies and manage larger data volumes. We expect that deployment of this model will enable better and timely prediction of credit defaults for decision-makers in commercial lending institutions and banks.

翻译：自1990年代以来,技术空间和电子商业领域取得了显著进步,导致对无现金支付解决方案的需求急剧增加,从而导致对无现金支付解决方案的需求急剧增加,从而导致对信用卡的需求增加,同时在一段时间内出现更高的信用违约率和因此更高的犯罪率。本研究论文的目的是利用机器学习技术,建立一个当代信用评分模型,以预测无担保贷款(信用卡)的信用违约率。在预测信用违约率时,贷方可获得的客户支付数据中有许多数据不平衡(偏斜),这给预测性模型带来了挑战。在这一研究中,这一挑战通过使用合成少数群体过度采样技术(SMOTE)(SMOTE)(SMOTE)(SMOTE)(SMOTE)(SMOTE)(这是从给定数据集中消除这种失衡的一种经实践证明的技术)得到解决。在通过七个不同的机器学习模型管理研究数据集时,结果显示,轻重推力推导机(LGBBM)的分解模型比其他六种分类技术更完美。因此,我们的研究显示,LGBBM的升级模型和高额预测机构将更能更及时地进行更精确地预测。