In Bayesian inference for mixture models with an unknown number of components, a finite mixture model is usually employed that assumes prior distributions for mixing weights and the number of components. This model is called a mixture of finite mixtures (MFM). As a prior distribution for the weights, a (symmetric) Dirichlet distribution is widely used for conjugacy and computational simplicity, while the selection of the concentration parameter influences the estimate of the number of components. In this paper, we focus on estimating the number of components. As a robust alternative Dirichlet weights, we present a method based on a mixture of finite mixtures with normalized inverse Gaussian weights. The motivation is similar to the use of normalized inverse Gaussian processes instead of Dirichlet processes for infinite mixture modeling. Introducing latent variables, the posterior computation is carried out using block Gibbs sampling without using the reversible jump algorithm. The performance of the proposed method is illustrated through some numerical experiments and real data examples, including clustering, density estimation, and community detection.
翻译:在具有未知组分数量的混合模型的贝叶斯推断中,通常采用有限混合模型,该模型对混合权重和组分数量设定先验分布。该模型被称为有限混合模型之混合。作为权重的先验分布,(对称)狄利克雷分布因其共轭性和计算简便性而被广泛使用,然而浓度参数的选择会影响组分数量的估计。本文聚焦于组分数量的估计问题。作为狄利克雷权重的一种稳健替代方案,我们提出了一种基于具有归一化逆高斯权重的有限混合模型之混合的方法。其动机类似于在无限混合建模中使用归一化逆高斯过程替代狄利克雷过程。通过引入潜变量,后验计算采用分块吉布斯采样执行,无需使用可逆跳转算法。所提方法的性能通过若干数值实验和实际数据示例(包括聚类、密度估计和社区检测)得以展示。