We study regression discontinuity designs in which many predetermined covariates, possibly much more than the number of observations, can be used to increase the precision of treatment effect estimates. We consider a two-step estimator which first selects a small number of "important" covariates through a localized Lasso-type procedure, and then, in a second step, estimates the treatment effect by including the selected covariates linearly into the usual local linear estimator. We provide an in-depth analysis of the algorithm's theoretical properties, showing that, under an approximate sparsity condition, the resulting estimator is asymptotically normal, with asymptotic bias and variance that are conceptually similar to those obtained in low-dimensional settings. Bandwidth selection and inference can be carried out using standard methods. We also provide simulations and an empirical application.
翻译:我们研究回归不连续性设计,其中许多预先确定的共变法,可能比观测次数多得多,可用来提高处理效果估计的精确度。我们考虑一个两步测算器,先通过局部拉索型程序选择少量“重要”共变法,然后在第二步,通过将选定的共变法线性纳入通常的本地线性测算器来估计处理效果。我们深入分析了算法的理论属性,表明在大致的宽度条件下,由此产生的测算器是暂时正常的,在概念上与在低维环境中获得的偏差和差异相似。可以使用标准方法进行宽幅选择和推论。我们还提供模拟和实验应用。