Tests of goodness of fit are used in nearly every domain where statistics is applied. One powerful and flexible approach is to sample artificial data sets that are exchangeable with the real data under the null hypothesis (but not under the alternative), as this allows the analyst to conduct a valid test using any test statistic they desire. Such sampling is typically done by conditioning on either an exact or approximate sufficient statistic, but existing methods for doing so have significant limitations, which either preclude their use or substantially reduce their power or computational tractability for many important models. In this paper, we propose to condition on samples from a Bayesian posterior distribution, which constitute a very different type of approximate sufficient statistic than those considered in prior work. Our approach, approximately co-sufficient sampling via Bayes (aCSS-B), considerably expands the scope of this flexible type of goodness-of-fit testing. We prove the approximate validity of the resulting test, and demonstrate its utility on three common null models where no existing methods apply, as well as its outperformance on models where existing methods do apply.
翻译:拟合优度检验被应用于几乎所有统计学领域。一种强大而灵活的方法是在零假设下(而非备择假设下)采样与真实数据可交换的人工数据集,这使得分析者能够使用任意期望的检验统计量进行有效检验。此类采样通常通过条件化于精确或近似充分统计量来实现,但现有方法存在显著局限性,导致其在许多重要模型中无法使用,或严重降低其检验功效与计算可行性。本文提出条件化于贝叶斯后验分布的样本,这构成了一种与先前研究截然不同的近似充分统计量类型。我们的方法——基于贝叶斯的近似共充分采样(aCSS-B)——显著扩展了此类灵活拟合优度检验的适用范围。我们证明了所得检验的近似有效性,并通过三个现有方法无法适用的常见零模型验证其实用性,同时在现有方法可适用的模型上展示了其优越性能。