Two-stage randomized experiments are becoming an increasingly popular experimental design for causal inference when the outcome of one unit may be affected by the treatment assignments of other units in the same cluster. In this paper, we provide a methodological framework for general tools of statistical inference and power analysis for two-stage randomized experiments. Under the randomization-based framework, we consider the estimation of a new direct effect of interest as well as the average direct and spillover effects studied in the literature. We provide unbiased estimators of these causal quantities and their conservative variance estimators in a general setting. Using these results, we then develop hypothesis testing procedures and derive sample size formulas. We theoretically compare the two-stage randomized design with the completely randomized and cluster randomized designs, which represent two limiting designs. Finally, we conduct simulation studies to evaluate the empirical performance of our sample size formulas. For empirical illustration, the proposed methodology is applied to the randomized evaluation of the Indian national health insurance program. An open-source software package is available for implementing the proposed methodology.
翻译:当一个单位的结果可能受到同一组别中其他单位的处理任务的影响时,两阶段随机实验正在成为一种日益流行的因果关系推断实验设计。在本文件中,我们为两阶段随机实验的统计推断和权力分析一般工具提供了一个方法框架。在随机化框架内,我们考虑对一个新的直接利益影响以及文献研究的平均直接效应和溢出效应进行估计。我们在一般环境中对这些因果关系数量及其保守差异估计器进行公正的估计。然后,我们利用这些结果制定假设测试程序并得出样本尺寸公式。我们理论上将两阶段随机化设计与完全随机化和分组随机化的设计进行比较,这代表两个限制性的设计。最后,我们进行模拟研究,以评价我们样本规模公式的经验性表现。关于经验性说明,拟议方法应用于对印度国家健康保险方案的随机评估。有一个用于实施拟议方法的开放源软件包。