Benford's law is often used as a support to critical decisions related to data quality or the presence of data manipulations or even fraud. However, many authors argue that conventional statistical tests will reject the null of data "Benford-ness" if applied in samples of the typical size in this kind of applications, even in the presence of tiny and practically unimportant deviations from Benford's law. Therefore, they suggest using alternative criteria that, however, lack solid statistical foundations. This paper contributes to the debate on the "large $n$" (or "excess power") problem in the context of Benford's law testing. This issue is discussed in relation with the notion of severity testing for goodness of fit tests, with a specific focus on tests for conformity with Benford's law. To do so, we also derive the asymptotic distribution of the mean absolute deviation ($MAD$) statistic as well as an asymptotic standard normal test. Finally, the severity testing principle is applied to six controversial data sets to assess their "Benford-ness".
翻译:Benford的法律常常被用来支持与数据质量或存在数据操纵或甚至欺诈有关的关键决定。然而,许多作者认为,如果在这类应用的典型规模的样本中应用“Benford-ness”数据,常规统计测试将否定“Benford-ness”,即使存在与Benford法律的微小和实际上无关的重要偏离,因此,他们建议采用其他标准,但缺乏坚实的统计基础。本文有助于在Benford法律测试中就“大一美元”(或“超强能力”)问题展开辩论。这个问题将结合对适当测试的严格性测试概念来讨论,具体侧重于对符合Benford法律的检验。为了这样做,我们还得出了绝对偏差($MAD$)统计的无症状分布以及一个有争议的标准正常测试。最后,严重程度测试原则适用于六个有争议的数据集,以评估其“Benford-nity”。