The task of modeling claim severities is addressed when data is not consistent with the classical regression assumptions. This framework is common in several lines of business within insurance and reinsurance, where catastrophic losses or heterogeneous sub-populations result in data difficult to model. Their correct analysis is required for pricing insurance products, and some of the most prevalent recent specifications in this direction are mixture-of-experts models. This paper proposes a regression model that generalizes the latter approach to the phase-type distribution setting. More specifically, the concept of mixing is extended to the case where an entire Markov jump process is unobserved and where states can communicate with each other. The covariates then act on the initial probabilities of such underlying chain, which play the role of expert weights. The basic properties of such a model are computed in terms of matrix functionals, and denseness properties are derived, demonstrating their flexibility. An effective estimation procedure is proposed, based on the EM algorithm and multinomial logistic regression, and subsequently illustrated using simulated and real-world datasets. The increased flexibility of the proposed models does not come at a high computational cost, and the motivation and interpretation are equally transparent to simpler MoE models.
翻译:如果数据与古典回归假设不一致,则处理索赔分离的建模任务。这个框架在保险和再保险内部的若干业务线上很常见,因为灾难性损失或混杂子人口导致数据难以建模。对保险产品定价需要进行正确的分析,而这方面的最近一些最普遍的规格则是专家混合模型。本文提议了一个回归模型,将后一种方法概括到阶段型分布设置中。更具体地说,混合的概念扩大到整个Markov跳跃过程没有观测到,而且各国可以相互交流。共同变量随后根据这种基础链的初始概率采取行动,这种链条的作用是专家权重。这种模型的基本特性是按矩阵功能计算,密度特性是生成的,显示了它们的灵活性。根据EM算法和多数值物流回归,提出了有效的估算程序,随后用模拟和真实世界数据集加以说明。拟议模型的更大灵活性并不是在高计算成本上产生的,更简单的模型是透明的和解释。