The Wald test remains ubiquitous in statistical practice despite shortcomings such as its inaccuracy in small samples and lack of invariance under reparameterization. This paper develops on another but lesser-known shortcoming called the Hauck--Donner effect (HDE) whereby a Wald test statistic is not monotonely increasing as a function of increasing distance between the parameter estimate and the null value. Resulting in an upward biased $p$-value and loss of power, the aberration can lead to very damaging consequences such as in variable selection. The HDE afflicts many types of regression models and corresponds to estimates near the boundary of the parameter space. This article presents several new results, and its main contributions are to (i) propose a very general test for detecting the HDE, regardless of its underlying cause; (ii) fundamentally characterize the HDE by pairwise ratios of Wald and Rao score and likelihood ratio test statistics for 1-parameter distributions; (iii) show that the parameter space may be partitioned into an interior encased by 5 HDE severity measures (faint, weak, moderate, strong, extreme); (iv) prove that a necessary condition for the HDE in a 2 by 2 table is a log odds ratio of at least 2; (v) give some practical guidelines about HDE-free hypothesis testing. Overall, practical post-fit tests can now be conducted potentially to any model estimated by iteratively reweighted least squares, such as the generalized linear model (GLM) and Vector GLM (VGLM) classes, the latter which encompasses many popular regression models.
翻译:Wald 测试在统计实践中仍然无处不在,尽管存在一些缺陷,如在小样本中存在不准确性,在重新校准时缺乏差异性。本文在另一个不太为人所知的缺点上发展了另一个称为Hauck-Donner(HDE)的缺点,即Wald 测试统计数据不是单数增加,因为其作用是提高参数估计值和无效值之间的距离。由于偏差偏差,功率丧失,因此在统计实践中仍然无处不在。高比值价值和功率上升,结果可能导致非常有害的后果,如变量选择。HDE影响许多类型的回归模型,与参数空间边界附近的估计相匹配。这篇文章提出了若干新的结果,其主要贡献是:(一) 提议对检测GDE的非常一般性测试,而不管其根本原因为何;(二) 将Wald和Rao的评分比和1度分布的可能比率统计结果从根本上定性为HDE(三) 显示,许多参数空间可能会被分割成一个内部(以5 HDE最不精确的模型、中度、中度、强、强、最极端的比值模型),其主要的贡献是(HIV)一个必要的条件。