Chi-squared tests for lack of fit are traditionally employed to find evidence against a hypothesized model, with the model accepted if the Karl Pearson statistic comparing observed and expected numbers of observations falling within cells is not significantly large. However, if one really wants evidence for goodness of fit, it is better to adopt an equivalence testing approach in which small values of the chi-squared statistic are evidence for the desired model. This method requires one to define what is meant by equivalence to the desired model, and guidelines are proposed. Then a simple extension of the classical normalizing transformation for the non-central chi-squared distribution places these values on a simple to interpret calibration scale for evidence. It is shown that the evidence can distinguish between normal and nearby models, as well between the Poisson and over-dispersed models. Applications to evaluation of random number generators and to uniformity of the digits of pi are included. Sample sizes required to obtain a desired expected evidence for goodness of fit are also provided.
翻译:如果Karl Pearson的统计比较观察到的和预计的细胞内观测数量并不大,则该模型被接受。然而,如果一个人真正想得到符合要求的证据,则最好采用一种等效测试方法,将奇夸统计的微小值作为理想模型的证据。这种方法需要一种方法来界定与理想模型的等值意味着什么,并提出了指南。然后,对非中央奇夸分布的典型正常化转换的简单扩展将这些数值放在一个简单的用于解释校准尺度以作为证据的简单标准上。它表明,证据可以区分正常模型和附近模型,以及Poisson模型和超分散模型。还包含对随机数生成器进行评估和对pi数字统一的应用。还提供样本大小,以便获得预期的合适证据。