While difference-in-differences (DID) was originally developed with one pre- and one post-treatment periods, data from additional pre-treatment periods is often available. How can researchers improve the DID design with such multiple pre-treatment periods under what conditions? We first use potential outcomes to clarify three benefits of multiple pre-treatment periods: (1) assessing the parallel trends assumption, (2) improving estimation accuracy, and (3) allowing for a more flexible parallel trends assumption. We then propose a new estimator, double DID, which combines all the benefits through the generalized method of moments and contains the two-way fixed effects regression as a special case. In a wide range of applications where several pre-treatment periods are available, the double DID improves upon the standard DID both in terms of identification and estimation accuracy. We also generalize the double DID to the staggered adoption design where different units can receive the treatment in different time periods. We illustrate the proposed method with two empirical applications, covering both the basic DID and staggered adoption designs. We offer an open-source R package that implements the proposed methodologies.
翻译:虽然最初以一个治疗前和一个治疗后时期来发展差异(DID),但从其他治疗前时期得到的数据经常是存在的。研究人员如何在什么条件下改进这种多个治疗前时期的设计?我们首先利用潜在结果澄清多个治疗前时期的三种好处:(1)评估平行趋势假设,(2)提高估计准确性,(3)允许更灵活的平行趋势假设。我们然后提出一个新的估计数据,即双重确定数据,通过一般的瞬时法将所有好处结合起来,并包含双向固定影响回归,作为一个特殊案例。在多个治疗前时期的多种应用中,双重确定数据在识别和估计准确性两方面都改进了标准。我们还将双重数据归纳到交错的收养设计中,让不同单位在不同时期获得治疗。我们用两种经验性应用来说明拟议的方法,既包括基本的确定数据,也包括错开的收养设计。我们提出了一个执行拟议方法的开放源R包。