The objective of this paper is to explore how financial big data and machine learning methods can be applied to model and understand complex financial products. We focus on residential mortgage backed securities, resMBS, that were at the heart of the 2008 US financial crisis. The securities are contained within a prospectus and have a complex payoff structure. Multiple financial institutions form a supply chain to create the prospectuses. We provide insight into the performance of the resMBS securities through a series of increasingly complex models. First, models at the security level directly identify salient features of resMBS securities that impact their performance. Second, we extend the model to include prospectus level features. We are the first to demonstrate that the composition of the prospectus is associated with the performance of securities. Finally, to develop a deeper understanding of the role of the supply chain, we use unsupervised probabilistic methods, in particular, dynamic topics models (DTM), to understand community formation and temporal evolution along the chain. A comprehensive model provides insight into the impact of DTM communities on the issuance and evolution of prospectuses, and eventually the performance of resMBS securities.
翻译:本文的目的是探讨如何将金融大数据和机器学习方法应用于模拟和理解复杂的金融产品。我们侧重于2008年美国金融危机的核心住宅抵押抵押抵押证券,即ResMBS。证券包含在计划书中,并有一个复杂的支付结构。多个金融机构组成供应链以创造前景。我们通过一系列日益复杂的模型,深入了解再抵押证券的绩效。首先,安全层面的模型直接查明了影响其业绩的再抵押证券的显著特征。第二,我们扩展了模型,以包括期货等级特征。我们首先证明,期货的构成与证券的表现相关联。最后,为了更深入了解供应链的作用,我们使用了非超强的概率性方法,特别是动态专题模型(DTM),以了解链条下社区形成和时间演变情况。一个全面模型可以深入了解DTM社区对期货的发行和演变的影响,并最终揭示了再抵押证券的性能。