Bayesian model criticism is an important part of the practice of Bayesian statistics. Traditionally, model criticism methods have been based on the predictive check, an adaptation of goodness-of-fit testing to Bayesian modeling and an effective method to understand how well a model captures the distribution of the data. In modern practice, however, researchers iteratively build and develop many models, exploring a space of models to help solve the problem at hand. While classical predictive checks can help assess each one, they cannot help the researcher understand how the models relate to each other. This paper introduces the posterior predictive null check (PPN), a method for Bayesian model criticism that helps characterize the relationships between models. The idea behind the PPN is to check whether data from one model's predictive distribution can pass a predictive check designed for another model. This form of criticism complements the classical predictive check by providing a comparative tool. A collection of PPNs, which we call a PPN study, can help us understand which models are equivalent and which models provide different perspectives on the data. With mixture models, we demonstrate how a PPN study, along with traditional predictive checks, can help select the number of components by the principle of parsimony. With probabilistic factor models, we demonstrate how a PPN study can help understand relationships between different classes of models, such as linear models and models based on neural networks. Finally, we analyze data from the literature on predictive checks to show how a PPN study can improve the practice of Bayesian model criticism. Code to replicate the results in this paper is available at \url{https://github.com/gemoran/ppn-code}.
翻译:Bayesian 模型批评是Bayesian 统计做法的一个重要部分。 传统上, 模型批评方法基于预测性检查、 将最佳测试改制到Bayesian 模型模型, 以及一种有效方法来理解模型如何很好地捕捉数据分布。 然而, 在现代实践中, 研究人员反复地建立和开发许多模型, 探索模型空间来帮助解决手头问题。 虽然经典预测性检查可以帮助评估每个模型, 它们无法帮助研究者了解模型彼此之间的关系。 本文介绍了后表预测性无结果检查( PPNN), 这是Bayesian 模型批评的一种方法, 有助于描述模型之间的关系。 PPNPN的模型背后的想法是检查一个模型预测性检查, 从一个模型的预测性分布上看数据。 这种形式的批评形式通过提供比较工具来补充经典的预测性检查。 我们称之为PPPPM 研究的模型可以帮助我们了解哪些模型是对应的, 而哪个模型可以提供不同的数据观点。 通过混合模型, 我们展示了一种PPN 分析性分析性模型, 以及我们以传统分析性指数为基础的模型研究, 能够展示一个基于最终的 Pamim 。