The foremost challenge to causal inference with real-world data is to handle the imbalance in the covariates with respect to different treatment options, caused by treatment selection bias. To address this issue, recent literature has explored domain-invariant representation learning based on different domain divergence metrics (e.g., Wasserstein distance, maximum mean discrepancy, position-dependent metric, and domain overlap). In this paper, we reveal the weaknesses of these strategies, i.e., they lead to the loss of predictive information when enforcing the domain invariance; and the treatment effect estimation performance is unstable, which heavily relies on the characteristics of the domain distributions and the choice of domain divergence metrics. Motivated by information theory, we propose to learn the Infomax and Domain-Independent Representations to solve the above puzzles. Our method utilizes the mutual information between the global feature representations and individual feature representations, and the mutual information between feature representations and treatment assignment predictions, in order to maximally capture the common predictive information for both treatment and control groups. Moreover, our method filters out the influence of instrumental and irrelevant variables, and thus it effectively increases the predictive ability of potential outcomes. Experimental results on both the synthetic and real-world datasets show that our method achieves state-of-the-art performance on causal effect inference. Moreover, our method exhibits reliable prediction performances when facing data with different characteristics of data distributions, complicated variable types, and severe covariate imbalance.
翻译:真实世界数据因果推断的最大挑战是处理因治疗选择偏差造成的不同治疗选项的共变性不平衡问题。为了解决这一问题,最近文献探讨了基于不同领域差异度量(如瓦塞斯坦距离、最大平均差异、位置依赖度量和域重叠)的域内差异性代表性学习。 在本文中,我们揭示了这些战略的弱点,即当实施域差异时,导致预测信息丢失;以及治疗效果估计性能不稳定,这严重依赖域分布的特性和域内差异度量度的选择。在信息理论的推动下,我们提议学习信息max和多曼独立度代表法,以解决上述难题。我们的方法利用全球特征表和个人特征表之间的相互信息,以及特征表和处理任务预测之间的相互信息,以便最大限度地为治疗和控制群体获取共同的复杂预测性信息。此外,我们的方法过滤出工具性和无关性变量的特性和域内差异性差异性差异性衡量标准。我们提议学习Informax和多曼因地代表制数据,从而有效地预测我们真实性数据结果。