Power analyses are an important aspect of experimental design, because they help determine how experiments are implemented in practice. It is common to specify a desired level of power and compute the sample size necessary to obtain that power. Such calculations are well-known for completely randomized experiments, but there can be many benefits to using other experimental designs. For example, it has recently been established that rerandomization, where subjects are randomized until covariate balance is obtained, increases the precision of causal effect estimators. This work establishes the power of rerandomized treatment-control experiments, thereby allowing for sample size calculators. We find the surprising result that, while power is often greater under rerandomization than complete randomization, the opposite can occur for very small treatment effects. The reason is that inference under rerandomization can be relatively more conservative, in the sense that it can have a lower type-I error at the same nominal significance level, and this additional conservativeness adversely affects power. This surprising result is due to treatment effect heterogeneity, a quantity often ignored in power analyses. We find that heterogeneity increases power for large effect sizes but decreases power for small effect sizes.
翻译:电源分析是实验设计的一个重要方面, 因为它们有助于确定实验在实践中是如何实施的。 通常会指定一个理想的功率水平, 并计算获得该功率所需的样本大小。 这种计算在完全随机化的实验中是众所周知的, 但使用其他实验设计可能有许多好处。 例如, 最近已经确定, 重新调整, 在获得共差平衡之前, 实验对象是随机的, 提高因果测算器的精确度。 这项工作建立了重新调整治疗控制实验的能量, 从而允许样本大小的计算器。 我们发现, 令人惊讶的结果是, 电量在重新随机化后往往比完全随机化时要大, 而相反的结果可能是非常小的治疗效果。 原因是, 重新随机化下的推断可能比较保守, 也就是说, 其类型- I 误差在相同的名义意义上会降低, 额外保守性会影响电力。 这一惊人的结果是治疗效果的异质性, 并且数量常常在电力分析中被忽略。 我们发现, 变异性会增加大影响大小的功力。