Mixture distributions with dynamic weights are an efficient way of modeling loss data characterized by heavy tails. However, maximum likelihood estimation of this family of models is difficult, mostly because of the need to evaluate numerically an intractable normalizing constant. In such a setup, simulation-based estimation methods are an appealing alternative. The approximate maximum likelihood estimation (AMLE) approach is employed. It is a general method that can be applied to mixtures with any component densities, as long as simulation is feasible. The focus is on the dynamic lognormal-generalized Pareto distribution, and the Cram\'er - von Mises distance is used to measure the discrepancy between observed and simulated samples. After deriving the theoretical properties of the estimators, a hybrid procedure is developed, where standard maximum likelihood is first employed to determine the bounds of the uniform priors required as input for AMLE. Simulation experiments and two real-data applications suggest that this approach yields a major improvement with respect to standard maximum likelihood estimation.
翻译:基于Cramér-von Mises距离的近似最大似然算法的无监督混合估计
翻译后的摘要:
混合分布与动态权重是建模重尾损失数据的有效方法。然而,这种模型族的最大似然估计很难,主要是因为需要数值地评估无法计算的归一化常数。在这种情况下,基于仿真的估计方法成为一种有吸引力的替代方案。本文采用近似最大似然估计方法,它是一种通用方法,可以应用于任何组成分布的混合分布,只要仿真可行。重点介绍了动态对数正态广义Pareto分布,并使用Cramér-von Mises距离来度量观测样本和仿真样本之间的差异。在推导出评估器的理论性质之后,我们开发了一种混合过程,首先采用标准最大似然方法确定AMLE所需的均匀先验的边界。仿真实验证明,这种方法相对于标准的最大似然估计方法显著提高了估计精度。