Bayesian models based on the Dirichlet process and other stick-breaking priors have been proposed as core ingredients for clustering, topic modeling, and other unsupervised learning tasks. However, due to the flexibility of these models, the consequences of prior choices can be opaque. And so prior specification can be relatively difficult. At the same time, prior choice can have a substantial effect on posterior inferences. Thus, considerations of robustness need to go hand in hand with nonparametric modeling. In the current paper, we tackle this challenge by exploiting the fact that variational Bayesian methods, in addition to having computational advantages in fitting complex nonparametric models, also yield sensitivities with respect to parametric and nonparametric aspects of Bayesian models. In particular, we demonstrate how to assess the sensitivity of conclusions to the choice of concentration parameter and stick-breaking distribution for inferences under Dirichlet process mixtures and related mixture models. We provide both theoretical and empirical support for our variational approach to Bayesian sensitivity analysis.
翻译:根据Drichlet进程和其他破碎的前题,提出了基于Drichlet进程和其他前题的Bayesian模型,作为集群、专题建模和其他不受监督的学习任务的核心要素。然而,由于这些模型具有灵活性,先前选择的后果可能不透明。因此,先前的规格可能相对困难。与此同时,事先选择可能对后推推论产生重大影响。因此,需要结合非参数建模来考虑稳健性因素。在本文中,我们应对这一挑战时,除了利用不同的Bayesian方法之外,还利用在相适应的复杂非参数模型中具有计算优势,还产生对Bayesian模型的参数和非参数方面的敏感度。特别是,我们展示了如何评估结论对选择浓度参数的敏感性,以及如何对Drichlet混合物和相关混合物模型下的误判进行断分配。我们对Bayesian敏感度分析的变异方法提供理论和经验支持。