Power is an important aspect of experimental design, because it allows researchers to understand the chance of detecting causal effects if they exist. It is common to specify a desired level of power, and then compute the sample size necessary to obtain that level of power; thus, power calculations help determine how experiments are conducted in practice. Power and sample size calculations are readily available for completely randomized experiments; however, there can be many benefits to using other experimental designs. For example, in recent years it has been established that rerandomized designs, where subjects are randomized until a prespecified level of covariate balance is obtained, increase the precision of causal effect estimators. This work establishes the statistical power of rerandomized treatment-control experiments, thereby allowing for sample size calculators. Our theoretical results also clarify how power and sample size are affected by treatment effect heterogeneity, a quantity that is often ignored in power analyses. Via simulation, we confirm our theoretical results and find that rerandomization can lead to substantial sample size reductions; e.g., in many realistic scenarios, rerandomization can lead to a 25% or even 50% reduction in sample size for a fixed level of power, compared to complete randomization. Power and sample size calculators based on our results are in the R package rerandPower on CRAN.
翻译:动力是实验设计的一个重要方面,因为它使研究人员能够理解在存在因果关系的情况下发现这种因果关系的可能性。 通常的做法是指定一个理想的功率水平,然后计算获得这种功率所需的样本规模; 因此, 电力计算有助于确定实际中如何进行实验。 电力和样本规模的计算很容易为完全随机化的实验提供; 但是,使用其他实验设计有许多好处。 例如,近年来已经确定,重新调整的设计,在获得预定的共变平衡水平之前,对实验对象进行随机调整,提高因果关系估测器的精确度。 这项工作建立了重新随机化治疗控制实验的统计能力,从而允许样品规模的计算。 我们的理论结果还澄清了电力和样本规模如何受到治疗效应的异质性影响,而这种数量在电力分析中常常被忽视。 虚拟,我们证实了我们的理论结果,并发现重新调整可以导致大量样本规模的缩小; 例如,在许多现实的假设中,重新调整后, 因果关系可以导致25%或甚至50%的试测器规模的样本规模的递减幅度。 在固定的RAN级的样品规模上, 我们的试算的样品级的机级的大小为25 %。