The objective of this paper is to explore how financial big data and machine learning methods can be applied to model and understand financial products. We focus on residential mortgage backed securities, resMBS, which were at the heart of the 2008 US financial crisis. These securities are contained within a prospectus and have a complex waterfall payoff structure. Multiple financial institutions form a supply chain to create prospectuses. To model this supply chain, we use unsupervised probabilistic methods, particularly dynamic topics models (DTM), to extract a set of features (topics) reflecting community formation and temporal evolution along the chain. We then provide insight into the performance of the resMBS securities and the impact of the supply chain through a series of increasingly comprehensive models. First, models at the security level directly identify salient features of resMBS securities that impact their performance. We then extend the model to include prospectus level features and demonstrate that the composition of the prospectus is significant. Our model also shows that communities along the supply chain that are associated with the generation of the prospectuses and securities have an impact on performance. We are the first to show that toxic communities that are closely linked to financial institutions that played a key role in the subprime crisis can increase the risk of failure of resMBS securities.
翻译:本文的目的是探讨如何将金融巨头数据和机器学习方法应用于金融产品的模型和理解。我们侧重于作为2008年美国金融危机核心的住宅抵押担保证券,即ResMBS。这些证券包含在计划书中,并且具有复杂的水瀑补偿结构。多个金融机构组成供应链以创造前景。为了模拟这一供应链,我们使用不受监督的概率方法,特别是动态专题模型(DTM),以提取一系列反映社区形成和时空演变的特征(专题),然后通过一系列日益全面的模型,深入了解RMBS证券的绩效和供应链的影响。首先,在安全层面的模型直接确定影响其业绩的ResMBS证券的显著特征。然后,我们扩展模型,以包括前景水平特征,并表明前景模型的构成很重要。我们的模型还表明,与创造前景和证券有关的供应链各界对业绩产生了影响。我们首先通过一系列日益全面的模型显示,有毒的证券群体在MB的证券危机中会增加金融风险。