Claim frequency data in insurance records the number of claims on insurance policies during a finite period of time. Given that insurance companies operate with multiple lines of insurance business where the claim frequencies on different lines of business are often correlated, multivariate count modeling with dependence for claim frequency is therefore essential. Due in part to the operation of bonus-malus systems, claims data in automobile insurance are often characterized by an excess of common zeros. This feature is referred to as multivariate zero-inflation. In this paper, we establish two ways of dealing with this feature. The first is to use a multivariate zero-inflated model, where we artificially augment the probability of common zeros based on standard multivariate count distributions. The other is to apply a multivariate zero-modified model, which deals with the common zeros and the number of claims incurred in each line, given that at least one claim occurs separately. A comprehensive comparative analysis of several models under these two frameworks is conducted using the data of an automobile insurance portfolio from a major insurance company in Spain. A less common situation in insurance is the absence of some common zeros resulting from incomplete records. This feature of these data is known as multivariate zero-deflation. In this case, our proposed multivariate zero-modified model still works, as shown by the second empirical study.
翻译:保险中的索赔频率数据记录了在一定时期内保险保单索赔的数量。鉴于保险公司经营的保险业务涉及多个保险业务,不同业务中的索赔频率往往相互关联,因此,必须采用多变计数模式和依赖索赔频率的模式。部分由于奖金-综合系统的运作,汽车保险中的索赔数据通常具有共同零数以上的特点。这一特点被称为多变零通货膨胀。在本文中,我们确定了处理这一特征的两种方法。首先,采用多变零膨胀模式,我们人为地增加基于标准多变数计数分布的通用零的概率。另一个是采用多变零模式,处理每行的通用零数和索赔数目,因为至少有一项索赔是分开提出的。在这两个框架内对若干模式进行全面比较分析,使用西班牙一家大保险公司的汽车保险组合数据进行。保险中较不常见的情况是缺乏基于标准多变数计数计数点计数分布的通用零概率概率。根据不全数记录而人为地增加一些通用零概率的概率。这个模型的特征是零变数模型,目前作为零变数研究的模型显示的模型。