This paper addresses the task of modeling severity losses using segmentation when the distribution of the data does not fall into the usual regression frameworks. This is not an uncommon situation in lines of business such as third party liability insurance, where heavy-tails and multimodality often hamper a direct statistical analysis. We propose to use regression models based on phase-type distributions, regressing on their underlying inhomogeneous Markov intensity using an extension of the EM algorithm. These models are tailored for survival analysis, but are shown to be well suited for loss data modeling as well. Contrary to the common belief that the distribution of a model is irrelevant if we only wish to estimate the mean, we show that correct segmentation can be obscured or revealed depending on the quality of the baseline distribution.
翻译:本文论述在数据分布不纳入通常回归框架时使用分块法来模拟严重性损失的任务,在第三方责任保险等业务中,这种情况并非罕见,因为在第三方责任保险中,重尾和多式联运往往妨碍直接的统计分析。我们提议使用基于阶段类型分布的回归模型,利用EM算法扩展其内在的不相容的Markov强度,回归。这些模型是为生存分析而定制的,但被证明也非常适合损失数据建模。与通常的看法相反,即如果我们只想要估计平均值,模型的分布就无关。我们表明,根据基线分布的质量,正确的分块可被模糊或暴露。