The instrumental variable (IV) approach is a widely used way to estimate the causal effects of a treatment on an outcome of interest from observational data with latent confounders. A standard IV is expected to be related to the treatment variable and independent of all other variables in the system. However, it is challenging to search for a standard IV from data directly due to the strict conditions. The conditional IV (CIV) method has been proposed to allow a variable to be an instrument conditioning on a set of variables, allowing a wider choice of possible IVs and enabling broader practical applications of the IV approach. Nevertheless, there is not a data-driven method to discover a CIV and its conditioning set directly from data. To fill this gap, in this paper, we propose to learn the representations of the information of a CIV and its conditioning set from data with latent confounders for average causal effect estimation. By taking advantage of deep generative models, we develop a novel data-driven approach for simultaneously learning the representation of a CIV from measured variables and generating the representation of its conditioning set given measured variables. Extensive experiments on synthetic and real-world datasets show that our method outperforms the existing IV methods.
翻译:工具变量(IV)方法是一种广泛使用的方法,用来估计处理方法对潜在混淆者对观测数据感兴趣的结果产生的因果关系。预计标准四四与处理变量有关,独立于系统中所有其他变量。然而,由于条件严格,直接从数据中寻找标准四具有挑战性。有条件的四(CIV)方法是为了使变量成为以一组变量为条件的工具,允许更广泛地选择可能的四,并能够更广泛地实际应用四方法。然而,没有一种数据驱动方法来发现CIV及其直接由数据构成的调节器。为填补这一空白,我们提议从与潜在混淆器一起的数据中了解CIV及其调节器的表述方式,以便进行平均因果关系估计。我们利用深层的归因模型,开发一种由数据驱动的新办法,以便同时从测量的变量中学习CIV的表示方式,并产生其调节器的表示。在合成和现实世界数据集上进行的广泛实验表明,我们的方法超越了现有的四种方法。