Predicting the bankruptcy risk of small and medium-sized enterprises (SMEs) is an important step for financial institutions when making decisions about loans. Existing studies in both finance and AI research fields, however, tend to only consider either the intra-risk or contagion risk of enterprises, ignoring their interactions and combinatorial effects. This study for the first time considers both types of risk and their joint effects in bankruptcy prediction. Specifically, we first propose an enterprise intra-risk encoder based on statistically significant enterprise risk indicators for its intra-risk learning. Then, we propose an enterprise contagion risk encoder based on enterprise relation information from an enterprise knowledge graph for its contagion risk embedding. In particular, the contagion risk encoder includes both the newly proposed Hyper-Graph Neural Networks and Heterogeneous Graph Neural Networks, which can model contagion risk in two different aspects, i.e. common risk factors based on hyperedges and direct diffusion risk from neighbors, respectively. To evaluate the model, we collect real-world multi-sources data on SMEs and build a novel benchmark dataset called SMEsD. We provide open access to the dataset, which is expected to further promote research on financial risk analysis. Experiments on SMEsD against twelve state-of-the-art baselines demonstrate the effectiveness of the proposed model for bankruptcy prediction.
翻译:预测中小企业的破产风险是金融机构在就贷款作出决定时采取的一个重要步骤。但是,金融和大赦国际研究领域的现有研究往往只考虑企业的风险内部风险或传染风险,忽视企业的相互作用和组合效应。本研究首次考虑风险的两种类型及其在破产预测中的共同影响。具体地说,我们首先根据具有统计意义的企业风险指标提出企业内部编码器,用于进行风险内部学习。然后,我们根据企业知识图中企业关系信息提出企业传染风险编码器,用于其传染风险的嵌入。特别是,传染风险编码器既包括新提议的超显性神经网络,也包括异质型图形神经网络,它们可以在两个不同方面作为传染风险的模型,即分别基于高端和邻国直接扩散风险的共同风险因素。为了评估模型,我们收集了有关中小企业的现实多源数据,并建立了一个称为中小企业的新型基准数据集。我们提供公开查阅数据的机会,以了解拟议的超显性神经网络和超致变形图形的网络,从而进一步推进对12个中小企业进行银行风险预测的模型进行试点研究。我们预计,将利用12个中小企业的模型,以进一步推进对12个企业进行试点的模型进行试点。