Gibbs sampling methods for mixture models are based on data augmentation schemes that account for the unobserved partition in the data. Conditional samplers rely on allocation variables that identify each observation with a mixture component. They are known to suffer from slow mixing in infinite mixtures, where some form of truncation, either deterministic or random, is required. In mixtures with random number of components, the exploration of parameter spaces of different dimensions can also be challenging. We tackle these issues by expressing the mixture components in the random order of appearance in an exchangeable sequence directed by the mixing distribution. We derive a sampler that is straightforward to implement for mixing distributions with tractable size-biased ordered weights. In infinite mixtures, no form of truncation is necessary. As for finite mixtures with random dimension, a simple updating of the number of components is obtained by a blocking argument, thus, easing challenges found in trans-dimensional moves via Metropolis-Hasting steps. Additionally, the latent clustering structure of the model is encrypted by means of an ordered partition with blocks labelled in the least element order, which mitigates the label-switching problem. We illustrate through a simulation study the good mixing performance of the new sampler.
翻译:混合物模型的Gibbs抽样方法基于数据增强计划,其中考虑到数据中未观测到的分区。有条件采样器依赖分配变量,该变量可以识别每种观测都含有混合物的混合物。已知它们会受到无限混合物中缓慢混合的影响,需要某种形式的脱轨,无论是确定性的还是随机的。在含有随机数量成分的混合物中,探索不同维度的参数空间也可能具有挑战性。我们通过在混合分布所指示的可互换序列中以随机的外观顺序表达混合物成分来解决这些问题。我们产生一个采样器,该采样器可以直接用于使用可移动大小偏移的定重重量混合分布。在无限混合物中,不需要任何形式的脱轨。对于具有随机尺寸的定型混合物,只需通过阻塞参数来简单更新部件的数量,从而缓解在通过Metopolis-Hasting步骤的跨维移动过程中发现的挑战。此外,模型的潜伏聚集结构是通过一个定型分区来加密的,以最小元素顺序标定的区块进行加密,从而减轻标签转换问题。我们通过模拟的样品来说明良好的混合。