Assessing goodness-of-fit is challenging because theoretically there is no uniformly powerful test, whereas in practice the question `what would be a preferable default test?' is important to applied statisticians. To take a look at this so-called omnibus testing problem, this paper considers the class of reweighted Anderson-Darling tests and makes two fold contributions. The first contribution is to provide a geometric understanding of the problem via establishing an explicit one-to-one correspondence between the weights and their focal directions of deviations of the distributions under alternative hypothesis from those under the null. It is argued that the weights that produce the test statistic with minimum variance can serve as a general-purpose test. In addition, this default or optimal weights-based test is found to be practically equivalent to the Zhang test, which has been commonly perceived powerful. The second contribution is to establish new large-sample results. It is shown that like Anderson-Darling, the minimum variance test statistic under the null has the same distribution as that of a weighted sum of an infinite number of independent squared normal random variables. These theoretical results are shown to be useful for large sample-based approximations. Finally, the paper concludes with a few remarks, including how the present approach can be extended to create new multinomial goodness-of-fit tests.
翻译:评估福利是具有挑战性的,因为理论上没有统一有力的检验标准,而在实践中,“什么是更可取的默认检验标准?”问题对于应用统计人员来说很重要。为了研究这个所谓的总括测试问题,本文件考虑了重新加权的Anderson-Darling测试类别,并做了两个折叠贡献。第一个贡献是通过在各种重量及其在替代假设下分布偏差的焦点方向之间建立明确的一对一对一对一的对应关系来提供对问题的几何理解,而在实践中,“什么是更可取的默认检验标准?”问题对于应用统计者来说很重要。此外,这一默认或最佳加权测试被认为实际上等同于常被认为强大的Zhang测试。第二个贡献是通过建立新的大型抽样结果。事实表明,与Anderson-Darling一样,无效的最小差异检验统计的分布与无限数量独立的正常随机变量的加权分布相同。这些理论结果显示,对于大规模抽样测试(包括少数论文)的精确度,最终能够对大量抽样分析有用。