Neyman (1923/1990) introduced the randomization model, which contains the notation of potential outcomes to define causal effects and a framework for large-sample inference based on the design of the experiment. However, the existing theory for this framework is far from complete especially when the number of treatment levels diverges and the group sizes vary a lot across treatment levels. We provide a unified discussion of statistical inference under the randomization model with general group sizes across treatment levels. We formulate the estimator in terms of a linear permutational statistic and use results based on Stein's method to derive various Berry--Esseen bounds on the linear and quadratic functions of the estimator. These new Berry--Esseen bounds serve as basis for design-based causal inference with possibly diverging treatment levels and diverging dimension of causal effects. We also fill an important gap by proposing novel variance estimators for experiments with possibly many treatment levels without replications. Equipped with the newly developed results, design-based causal inference in general settings becomes more convenient with stronger theoretical guarantees.
翻译:内曼(1923/1990年)引入了随机化模型,其中载有根据实验设计确定因果关系的潜在结果说明以及大型抽样推断框架,然而,这一框架的现有理论远非完全,特别是当处理水平不同,组规模不同,不同处理水平差异很大时更是如此;我们对随机化模型下的统计推论进行了统一讨论,其范围跨处理水平的组群一般大小不同;我们根据Stein在估计者线性和二次函数上得出各种Berry-Esseen界限的方法,以线性调整统计和使用结果为基础,对估计结果进行了估计;这些新的Berry-Esseen界限作为基于设计、可能不同处理水平和不同因果影响层面的因果关系推论的基础;我们还填补了一个重要的差距,为可能许多治疗水平的试验提出新的差异估测,而无需复制;根据新开发的结果,一般环境中基于设计因素的因果关系推论更加方便,理论保证更加有力。