Randomization (a.k.a. permutation) inference is typically interpreted as testing Fisher's "sharp" null hypothesis that all effects are exactly zero. This hypothesis is often criticized as uninteresting and implausible. We show, however, that many randomization tests are also valid for a "bounded" null hypothesis under which effects are all negative (or positive) for all units but otherwise heterogeneous. The bounded null is closely related to important concepts such as monotonicity and Pareto efficiency. Inverting tests of this hypothesis yields confidence intervals for the maximum (or minimum) individual treatment effect. We then extend randomization tests to infer other quantiles of individual effects, which can be used to infer the proportion of units with effects larger (or smaller) than any threshold. The proposed confidence intervals for all quantiles of individual effects are simultaneously valid, in the sense that no correction due to multiple analyses is needed. In sum, we provide a broader justification for Fisher randomization tests, and develop exact nonparametric inference for quantiles of heterogeneous individual effects. We illustrate our methods with simulations and applications, where we find that Stephenson rank statistics often provide the most informative results.
翻译:随机化( a. k. a. modation) 推断( a. k. a. a. moltation) 通常被解释为测试Fisher的“ sharrp” 空虚假设, 所有效果都是零的, 所有效果都是零的( 或正的), 但是, 我们显示, 许多随机化测试对于“ 捆绑的” 空虚假设也有效, 对所有单位都具有负面( 或正) 效果, 但以其他方式混杂。 捆绑的空虚与单调和Pareto效率等重要概念密切相关。 反转这一假设的测试为最大( 或最低) 个人治疗效果带来信任间隔。 然后我们扩展随机化测试, 推断个别效应的其他量, 而这些量可以用来推断出大于任何阈值的单位的比例。 我们用模拟和应用程序常能提供我们发现的信息性统计的多数方法。