Randomized experiments are the gold standard for causal inference, and justify simple comparisons across treatment groups. Regression adjustment provides a convenient way to incorporate covariate information for additional efficiency. This article provides a unified account of its utility for improving estimation efficiency in multi-armed experiments. We start with the commonly used additive and fully interacted models for regression adjustment, and clarify the trade-offs between the resulting ordinary least-squares (OLS) estimators for estimating average treatment effects in terms of finite-sample performance and asymptotic efficiency. We then move on to regression adjustment based on restricted least squares (RLS), and establish for the first time its properties for inferring average treatment effects from the design-based perspective. The resulting inference has multiple guarantees. First, it is asymptotically efficient when the restriction is correctly specified. Second, it remains consistent as long as the restriction on the coefficients of the treatment indicators, if any, is correctly specified and separate from that on the coefficients of the treatment-covariates interactions. Third, it can have better finite-sample performance than its unrestricted counterpart even if the restriction is moderately misspecified. It is thus our recommendation for covariate adjustment in multi-armed experiments when the OLS fit of the fully interacted regression risks large finite-sample variability in case of many covariates, many treatments, yet a moderate sample size. In addition, the proposed theory of RLS also provides a powerful tool for studying OLS-based inference from general regression specifications. As an illustration, we demonstrate its unique value for studying OLS-based regression adjustment in factorial experiments via both theory and simulation.
翻译:随机化实验是因果推断的黄金标准, 并证明不同治疗组间进行简单比较是有道理的。 递减调整提供了一种方便的方式, 以纳入共变信息, 以提高效率。 本条统一说明其对于提高多控实验中估计效率的效用。 我们从常用的添加和充分互动模型开始, 以回归调整为起点, 并澄清由此产生的普通最小方( OLS) 估计值之间的权衡, 以从有限增量性能和饱和性效率的角度来估计平均治疗效果。 然后, 我们开始根据有限的最小方( RLS) 进行回归调整, 并首次建立其属性, 从设计角度推断平均治疗效果的效用。 由此得出的推论有多重保证。 首先, 当限制得到正确的说明时, 通常最小值( OLS) 估计值的系数( 如果有的话) 与治疗- 相容性反应的系数( ) 。 第三, 以有限的中度( RLS) 递增率( RLS) 推算法的精确度( ) ), 其精确性实验的精确性反应( ) 也通过无限制的数值推算法( ), 大幅变整数级变。