This paper proposes an adaptive randomization procedure for two-stage randomized controlled trials. The method uses data from a first-wave experiment in order to determine how to stratify in a second wave of the experiment, where the objective is to minimize the variance of an estimator for the average treatment effect (ATE). We consider selection from a class of stratified randomization procedures which we call stratification trees: these are procedures whose strata can be represented as decision trees, with differing treatment assignment probabilities across strata. By using the first wave to estimate a stratification tree, we simultaneously select which covariates to use for stratification, how to stratify over these covariates, as well as the assignment probabilities within these strata. Our main result shows that using this randomization procedure with an appropriate estimator results in an asymptotic variance which is minimal in the class of stratification trees. Moreover, the results we present are able to accommodate a large class of assignment mechanisms within strata, including stratified block randomization. In a simulation study, we find that our method, paired with an appropriate cross-validation procedure ,can improve on ad-hoc choices of stratification. We conclude by applying our method to the study in Karlan and Wood (2017), where we estimate stratification trees using the first wave of their experiment.
翻译:本文为两阶段随机控制试验建议一个适应性随机化程序。 方法使用第一波实验的数据, 以确定如何在实验第二波中分层, 目的是将平均治疗效果( ATE) 的估计值差异最小化。 我们考虑从一类分层随机化程序中选择一个我们称之为分层树的分层随机化程序: 这些是其层次可以代表为决定树的程序, 不同层次的治疗概率分配不同。 通过使用第一波来估计分层树, 我们同时选择哪些共变量用于分层, 如何分层, 以及这些层内的分配概率 。 我们的主要结果显示, 使用这种随机化程序, 适当的估计值使分层树的分层性差异最小。 此外, 我们所展示的结果能够适应层次内部的大型分配机制, 包括分层区块随机化。 在模拟研究中, 我们发现我们使用的方法, 分层分层的分层和分层的分层概率, 以及这些层的分层的分层概率。 我们的主要结果显示, 使用这个随机化程序