用于共变适应随机化的回归调整的一般理论:OLS、Lasso及以后 (A general theory of regression adjustment for covariate-adaptive randomization: OLS, Lasso, and beyond)

We consider the problem of estimating and inferring treatment effects in randomized experiments. In practice, stratified randomization, or more generally, covariate-adaptive randomization, is routinely used in the design stage to balance the treatment allocations with respect to a few variables that are most relevant to the outcomes. Then, regression is performed in the analysis stage to adjust the remaining imbalances to yield more efficient treatment effect estimators. Building upon and unifying the recent results obtained for ordinary least squares adjusted estimators under covariate-adaptive randomization, this paper presents a general theory of regression adjustment that allows for arbitrary model misspecification and the presence of a large number of baseline covariates. We exemplify the theory on two Lasso-adjusted treatment effect estimators, both of which are optimal in their respective classes. In addition, nonparametric consistent variance estimators are proposed to facilitate valid inferences, which work irrespective of the specific randomization methods used. The robustness and improved efficiency of the proposed estimators are demonstrated through a simulation study and a clinical trial example. This study sheds light on improving treatment effect estimation efficiency by implementing machine learning methods in covariate-adaptive randomized experiments.

翻译：我们考虑了随机实验中估计和推断治疗效果的问题。在实践中,在设计阶段经常使用分层随机化,或更一般的共变调整随机化,以平衡与结果最相关的几个变量的治疗分配。然后,在分析阶段进行回归,以调整其余的不平衡,从而产生更有效的治疗效果估计器。根据并统一在共变调整随机化中普通调整的平方调整的估算器的最新结果,本文件提出了回归调整的一般理论,允许任意的模型误差和大量基线变量的存在。我们举例说明了两个激光调整治疗效果估计器的理论,两者在各自类别中都是最佳的。此外,提出非对称一致差异估计器,以便利有效的推论,不管使用的具体随机化方法如何运作。通过模拟研究和临床试验示例展示了拟议估算器的稳健性和效率。本项研究展示了两个激光调整治疗效果估计器的理论,通过测试模型和临床试验模型,在改进治疗效果估计方法方面,通过随机实验方法进行改进。