Life assurance companies typically possess a wealth of data covering multiple systems and databases. These data are often used for analyzing the past and for describing the present. Taking account of the past, the future is mostly forecasted by traditional statistical methods. So far, only a few attempts were undertaken to perform estimations by means of machine learning approaches. In this work, the individual contract cancellation behavior of customers within two partial stocks is modeled by the aid of various classification methods. Partial stocks of private pension and endowment policy are considered. We describe the data used for the modeling, their structured and in which way they are cleansed. The utilized models are calibrated on the basis of an extensive tuning process, then graphically evaluated regarding their goodness-of-fit and with the help of a variable relevance concept, we investigate which features notably affect the individual contract cancellation behavior.
翻译:生命保障公司通常拥有涉及多个系统和数据库的大量数据,这些数据往往用于分析过去和描述现在。考虑到过去,大部分预测未来都是传统的统计方法。迄今为止,仅进行了几次尝试,通过机器学习方法进行估计。在这项工作中,两个部分库存中的客户个人取消合同的行为以各种分类方法的辅助方法为模型。考虑部分私人养恤金和捐赠政策库存。我们描述了用于建模的数据、其结构以及净化它们的方式。所使用模型是根据广泛的调校过程加以校准的,然后用图形来评价其优劣之处,并借助一个可变关联的概念,我们调查其中明显影响个别合同取消行为的特点。