Semiparametric inference on average causal effects from observational data is based on assumptions yielding identification of the effects. In practice, several distinct identifying assumptions may be plausible; an analyst has to make a delicate choice between these models. In this paper, we study three identifying assumptions based on the potential outcome framework: the back-door assumption, which uses pre-treatment covariates, the front-door assumption, which uses mediators, and the two-door assumption using pre-treatment covariates and mediators simultaneously. We provide the efficient influence functions and the corresponding semiparametric efficiency bounds that hold under these assumptions, and their combinations. We demonstrate that neither of the identification models provides uniformly the most efficient estimation and give conditions under which some bounds are lower than others. We show when semiparametric estimating equation estimators based on influence functions attain the bounds, and study the robustness of the estimators to misspecification of the nuisance models. The theory is complemented with simulation experiments on the finite sample behavior of the estimators. The results obtained are relevant for an analyst facing a choice between several plausible identifying assumptions and corresponding estimators. Our results show that this choice implies a trade-off between efficiency and robustness to misspecification of the nuisance models.
翻译:对观测数据平均因果关系的半成因推断依据的假设得出了观测数据的平均因果关系的半成因效应的确定。在实践中,若干不同的识别假设可能是合理的;分析师必须在这些模型之间作出微妙的选择。在本文中,我们根据潜在成果框架研究三种确定假设:使用预处理共变的后门假设,使用调解人的前门假设,以及同时使用预处理共变和调解人的双门假设。我们提供了在这些假设及其组合下保持的有效影响功能和相应的半成分效率界限。我们证明,这两个识别模型都没有提供统一的最有效估计,也没有提供某些界限低于其他界限的条件。我们在根据影响力功能得出半对等方程估计值时显示,并研究估算者对误差的干扰模型的准确性。该理论还辅之以关于这些假设及其组合下的有限抽样行为的模拟实验。我们获得的结果对于在几个合理确定假设和相应精确度模型之间作出选择的分析师来说是相关的。我们所显示的是,根据影响作用估算的方程估算师达到界限,我们所得出的结果表明,这种选择意味着对精确度与精确度之间的选择。