Classical randomized experiments, equipped with randomization-based inference, provide assumption-free inference for treatment effects. They have been the gold standard for drawing causal inference and provide excellent internal validity. However, they have also been criticized for questionable external validity, in the sense that the conclusion may not generalize well to a larger population. The randomized survey experiment is a design tool that can help mitigate this concern, by randomly selecting the experimental units from the target population of interest. However, as pointed out by Morgan and Rubin (2012), chance imbalances often exist in covariate distributions between different treatment groups even under completely randomized experiments. Not surprisingly, such covariate imbalances also occur in randomized survey experiments. Furthermore, the covariate imbalances happen not only between different treatment groups, but also between the sampled experimental units and the overall population of interest. In this paper, we propose a two-stage rerandomization design that can actively avoid undesirable covariate imbalances at both the sampling and treatment assignment stages. We further develop asymptotic theory for rerandomized survey experiments, demonstrating that rerandomization provides better covariate balance, more precise treatment effect estimators, and shorter large-sample confidence intervals. We also propose covariate adjustment to deal with remaining covariate imbalances after rerandomization, showing that it can further improve both the sampling and estimated precision. Our work allows general relationship among covariates at the sampling, treatment assignment and analysis stages, and generalizes both rerandomization in classical randomized experiments (Morgan and Rubin 2012) and rejective sampling in survey sampling (Fuller 2009).
翻译:常规随机随机实验,配有基于随机的推断,为治疗效果提供了无假设的推断。它们是计算因果推断的黄金标准,并提供了极好的内部有效性。然而,它们也被批评为有疑问的外部有效性,因为这一结论可能无法向更多的人口概括。随机调查实验是一个设计工具,可以随机从感兴趣的对象人群中选择实验单位,帮助减轻这一关切。然而,正如摩根和鲁宾(2012年)所指出的,概率失衡经常存在于不同治疗群体之间的共变分布中,即使在完全随机的实验中也是如此。奇怪的是,这种共变失衡也出现在随机调查实验中。此外,共变失衡不仅发生在不同的治疗群体之间,而且发生在抽样试验单位和总体利益人群之间。在本文件中,我们建议采用两阶段的重新调节设计,在取样和治疗分配阶段都能够避免不适当的反复波动不平衡。我们进一步拒绝重新调整调查实验的理论,表明,在一般精确的精确的分类中,进行更精确的汇率分析,我们更准确地估计,在深度的汇率分析后,我们还可以选择更精确的再平衡,在深度的计算中进行。