The regression discontinuity (RD) design offers identification of causal effects under weak assumptions, earning it the position as a standard method in modern political science research. But identification does not necessarily imply that the causal effects can be estimated accurately with limited data. In this paper, we highlight that estimation is particularly challenging with the RD design and investigate how these challenges manifest themselves in the empirical literature. We collect all RD-based findings published in top political science journals from 2009--2018. The findings exhibit pathological features; estimates tend to bunch just above the conventional level of statistical significance. A reanalysis of all studies with available data suggests that researcher's discretion is not a major driver of these pathological features, but researchers tend to use inappropriate methods for inference, rendering standard errors artificially small. A retrospective power analysis reveals that most of these studies were underpowered to detect all but large effects. The issues we uncover, combined with well-documented selection pressures in academic publishing, cause concern that many published findings using the RD design are exaggerated, if not entirely spurious.
翻译:回归不连续(RD)设计在薄弱的假设下可以确定因果关系,从而获得现代政治科学研究的标准方法。但识别并不必然意味着根据有限数据可以准确估计因果关系。在本文中,我们强调,对于RD设计而言,估算尤其具有挑战性,并调查在经验文献中如何反映这些挑战。我们收集了2009年至2018年最高政治科学期刊上发表的基于RD的所有结果。这些结果显示出病理特征;估计往往比常规的统计意义水平高出很多。用现有数据重新分析所有研究的结果表明,研究人员的酌处权并不是这些病理特征的主要驱动力,但研究人员往往使用不适当的方法进行推断,人为地缩小标准错误。追溯力分析表明,大多数这些研究没有足够能力来检测所有但巨大的影响。我们发现的问题,加上学术出版中记录良好的选择压力,使得我们发现的许多使用RD设计发表的研究结果被夸大,如果不是完全虚假的话。