This paper considers the problem of inference in cluster randomized experiments when cluster sizes are non-ignorable. Here, by a cluster randomized experiment, we mean one in which treatment is assigned at the level of the cluster; by non-ignorable cluster sizes we mean that "large" clusters and "small" clusters may be heterogeneous, and, in particular, the effects of the treatment may vary across clusters of differing sizes. In order to permit this sort of flexibility, we consider a sampling framework in which cluster sizes themselves are random. In this way, our analysis departs from earlier analyses of cluster randomized experiments in which cluster sizes are treated as non-random. We distinguish between two different parameters of interest: the equally-weighted cluster-level average treatment effect, and the size-weighted cluster-level average treatment effect. For each parameter, we provide methods for inference in an asymptotic framework where the number of clusters tends to infinity and treatment is assigned using simple random sampling. We additionally permit the experimenter to sample only a subset of the units within each cluster rather than the entire cluster and demonstrate the implications of such sampling for some commonly used estimators. A small simulation study shows the practical relevance of our theoretical results.
翻译:本文考虑了在组群大小不显眼时群集随机实验中的推断问题。 这里, 通过群集随机实验, 我们指的是在组群级别上分配处理的群集随机实验; 不可忽略的群集大小意味着“ 大型” 群集和“ 小型” 群群群可能有差异, 特别是, 处理的效果可能因不同大小的组群而不同。 为了允许这种灵活性, 我们考虑一个抽样框架, 群集大小本身是随机的。 这样, 我们的分析脱离了先前对群集随机实验的群集随机实验的分析, 群集大小被作为非随机性对待。 我们区分了两种不同的利益参数: 同等加权的群集平均处理效果, 以及大小加权的群集平均处理效果。 对于每个参数, 我们提供一种在零星体框架中推断的方法, 在那里, 群集的数量往往与处理不同, 使用简单的随机取样方法。 我们允许实验者在每个组群集内只抽样一个组组组组组组组组分组组, 而不是整个组群集的实际相关性。 我们用模拟结果展示了我们使用过的一些模拟结果。