Accuracy and interpretability of a (non-life) insurance pricing model are essential qualities to ensure fair and transparent premiums for policy-holders, that reflect their risk. In recent years, the classification and regression trees (CARTs) and their ensembles have gained popularity in the actuarial literature, since they offer good prediction performance and are relatively easily interpretable. In this paper, we introduce Bayesian CART models for insurance pricing, with a particular focus on claims frequency modelling. Additionally to the common Poisson and negative binomial (NB) distributions used for claims frequency, we implement Bayesian CART for the zero-inflated Poisson (ZIP) distribution to address the difficulty arising from the imbalanced insurance claims data. To this end, we introduce a general MCMC algorithm using data augmentation methods for posterior tree exploration. We also introduce the deviance information criterion (DIC) for the tree model selection. The proposed models are able to identify trees which can better classify the policy-holders into risk groups. Some simulations and real insurance data will be discussed to illustrate the applicability of these models.
翻译:保险(非终身)定价模式的准确性和可解释性是确保投保人公平和透明的保费、反映其风险的基本品质。近年来,分类和回归树及其集合在精算文献中越来越受欢迎,因为这些树具有良好的预测性,而且相对容易解释。在本文件中,我们介绍了贝叶西亚保险定价CART模式,特别侧重于索赔频率建模。除了用于索赔频率的普瓦森和负双向(NB)分布外,我们还为零膨胀Poisson(ZIP)分布采用了Bayesian CART,以解决因保险索赔数据不平衡而产生的困难。为此,我们采用一般MCMC算法,使用数据增强方法进行后树勘探。我们还采用了树模型选择的偏离信息标准(DIC)。拟议的模型能够确定哪些树木能够更好地将投保人分类为风险群体。将讨论一些模拟和真实保险数据,以说明这些模型的适用性。</s>