In this paper I revisit the interpretation of the linear instrumental variables (IV) estimand as a weighted average of conditional local average treatment effects (LATEs). I focus on a practically relevant situation in which additional covariates are required for identification while the reduced-form and first-stage regressions implicitly restrict the effects of the instrument to be homogeneous, and are thus possibly misspecified. I show that the weights on some conditional LATEs are negative and the IV estimand is no longer interpretable as a causal effect under a weaker version of monotonicity, i.e. when there are compliers but no defiers at some covariate values and defiers but no compliers elsewhere. The problem of negative weights disappears in the overidentified specification of Angrist and Imbens (1995) and in an alternative method, termed "reordered IV," that I also develop. Even if all weights are positive, the IV estimand in the just identified specification is not interpretable as the unconditional LATE parameter unless the groups with different values of the instrument are roughly equal sized. I illustrate my findings in an application to causal effects of college education using the college proximity instrument. The benchmark estimates suggest that college attendance yields earnings gains of about 60 log points, which is well outside the range of estimates in the recent literature. I demonstrate that this result is driven by the existence of defiers and the presence of negative weights. Corrected estimates indicate that attending college causes earnings to be roughly 20% higher.
翻译:在本文中,我重审了对线性工具变量(四)估计值的解释,并把它解释为当地有条件平均治疗效果(LATEs)的加权平均值。我着重论述一种实际相关的情况,即需要增加共变数才能识别,而缩写和第一阶段的回归则隐含地限制工具的同一性效果,从而可能作错误的描述。我指出,某些有条件的LATE的权重是负的,而IV估计值不再可解释为单一性这一较弱版本下的因果关系,即,如果有些共同价值和藐视者有遵守者,但没有违反者,但在别处没有遵守者。在安格里斯特和伊姆本斯(1995年)的超标定规格和一种替代方法中,负加权问题已经消失。即使所有加权均为负数,但刚刚确定规格中的IV估计值不再可解释为无条件的LATET参数,除非具有不同价值的群体大致相同。我用60种较高价值来说明我对大学外的正值估算值研究结果。我用60种正值来说明,大学的正值估算结果显示大学收益的比值是大学的近标值。