Proxy causal learning (PCL) is a method for estimating the causal effect of treatments on outcomes in the presence of unobserved confounding, using proxies (structured side information) for the confounder. This is achieved via two-stage regression: in the first stage, we model relations among the treatment and proxies; in the second stage, we use this model to learn the effect of treatment on the outcome, given the context provided by the proxies. PCL guarantees recovery of the true causal effect, subject to identifiability conditions. We propose a novel method for PCL, the deep feature proxy variable method (DFPV), to address the case where the proxies, treatments, and outcomes are high-dimensional and have nonlinear complex relationships, as represented by deep neural network features. We show that DFPV outperforms recent state-of-the-art PCL methods on challenging synthetic benchmarks, including settings involving high dimensional image data. Furthermore, we show that PCL can be applied to off-policy evaluation for the confounded bandit problem, in which DFPV also exhibits competitive performance.
翻译:代理因果学习(PCL)是一种方法,用于评估治疗在未观察到的混乱情况下对结果产生的因果影响,为困惑者使用代理人(结构侧信息),这是通过两阶段回归实现的:在第一阶段,我们模拟治疗和代理人之间的关系;在第二阶段,我们利用这一模型了解治疗对结果的影响,考虑到代理人提供的背景。PCL保证在可识别性条件下恢复真正的因果效应。我们提出了PCL的新颖方法,即深特征替代变量方法(DFPV),以解决代理人、治疗和结果为高维度且具有非线性复杂关系的情况,如深神经网络特征所示。我们表明,DFPV在挑战合成基准(包括高维图像数据环境)方面,超越了最新的最先进的PCL方法。此外,我们表明,PCL可以适用于对相近的波段问题的非政策评价,DFPV也展示了竞争性表现。