Blocking, a special case of rerandomization, is routinely implemented in the design stage of randomized experiments to balance the baseline covariates. Regression adjustment is highly encouraged in the analysis stage to adjust for the remaining covariate imbalances. This study proposes methods that combine blocking, rerandomization, and regression adjustment techniques in randomized experiments with high-dimensional covariates and a general blocking scheme. In the design stage, we suggest the implementation of blocking and rerandomization to balance the fixed number of covariates most relevant to the outcomes. For the analysis stage, we propose a regression adjustment method based on least absolute shrinkage and selection operator (Lasso) to adjust for the remaining imbalances in additional high-dimensional covariates. We derive the asymptotic properties of the proposed estimator and outline the conditions under which this estimator is more efficient than the unadjusted one. Moreover, we provide a conservative variance estimator to facilitate valid inferences. Our design-based analysis allows for model misspecification and is applicable to heterogeneous block sizes, propensity scores, and treatment effects. Simulation studies and two real-data analyses demonstrate the advantages of the proposed method.
翻译:在随机试验的设计阶段,通常会采用封隔和重新调节的特殊情况,以平衡基准共变。在分析阶段,鼓励回退调整,以适应其余的共变不平衡。本研究提出了在高维共变和一般阻塞装置随机试验中结合封隔、重新调整和回归调整技术的方法。在设计阶段,我们建议采用封隔和重新排序,以平衡与结果最相关的固定共变数。在分析阶段,我们提议一种基于最不绝对缩缩和选择操作器(Lasso)的回归调整方法,以适应额外高维变异体中剩余的不平衡。我们从中得出拟议的估计值的无平衡性属性,并概述该估计值比未经调整的计算器更有效率的条件。此外,我们提供了一种保守的差异估计器,以利合理的推算。我们基于设计的分析允许模型的偏差,并适用于不均匀的块大小、偏移度计分数和处理效果。我们提出的估计法的优劣性和两个实际分析方法展示了模拟法的优劣性。