In the peer to peer (P2P) lending platform, investors hope to maximize their return while minimizing the risk through a comprehensive understanding of the P2P market. A low and stable average default rate across all the borrowers denotes a healthy P2P market and provides investors more confidence in a promising investment. Therefore, having a powerful model to describe the trend of the default rate in the P2P market is crucial. Different from previous studies that focus on modeling the default rate at the individual level, in this paper, we are the first to comprehensively explore the monthly trend of the default rate at the aggregative level for the P2P data from October 2007 to January 2016 in the US. We use the long short term memory (LSTM) approach to sequentially predict the default risk of the borrowers in Lending Club, which is the largest P2P lending platform in the US. Although being first applied in modeling the P2P sequential data, the LSTM approach shows its great potential by outperforming traditionally utilized time series models in our experiments. Furthermore, incorporating the macroeconomic feature \textit{unemp\_rate} (i.e., unemployment rate) can improve the LSTM performance by decreasing RMSE on both the training and the testing datasets. Our study can broaden the applications of the LSTM algorithm by using it on the sequential P2P data and guide the investors in making investment strategies.
翻译:在同侪(P2P)贷款平台中,投资者希望通过全面了解P2P市场来最大限度地实现回报最大化,同时尽量减少风险。所有借款人的平均违约率低且稳定,意味着健康的P2P市场,使投资者对有希望的投资更有信心。因此,拥有一个强有力的模型来描述P2P市场违约率的趋势至关重要。与以往侧重于在个人层面建立违约率模型的研究不同,在本文件中,我们首先全面探讨2007年10月至2016年1月美国P2P数据在分类层面上的每月违约率趋势。我们使用长期短期记忆(LSTM)方法来连续预测贷款俱乐部借款人的违约风险,这是美国最大的P2P贷款平台。LSTM方法虽然首次用于模拟P2P相继数据,但通过在我们的实验中超越传统使用的时间序列模型模型来显示其巨大的潜力。此外,我们采用长期的宏观经济特征(Textit{NepüPrate}(i,扩大LTM) 和LS Airal 应用中的数据测试方法可以改进我们LSAS 数据库数据库的运行情况,同时在测试LSAS 上不断降低数据。