This paper studies the confounding effects from the unmeasured confounders and the imbalance of observed confounders in IV regression and aims at unbiased causal effect estimation. Recently, nonlinear IV estimators were proposed to allow for nonlinear model in both stages. However, the observed confounders may be imbalanced in stage 2, which could still lead to biased treatment effect estimation in certain cases. To this end, we propose a Confounder Balanced IV Regression (CB-IV) algorithm to jointly remove the bias from the unmeasured confounders and the imbalance of observed confounders. Theoretically, by redefining and solving an inverse problem for potential outcome function, we show that our CB-IV algorithm can unbiasedly estimate treatment effects and achieve lower variance. The IV methods have a major disadvantage in that little prior or theory is currently available to pre-define a valid IV in real-world scenarios. Thus, we study two more challenging settings without pre-defined valid IVs: (1) indistinguishable IVs implicitly present in observations, i.e., mixed-variable challenge, and (2) latent IVs don't appear in observations, i.e., latent-variable challenge. To address these two challenges, we extend our CB-IV by a latent-variable module, namely CB-IV-L algorithm. Extensive experiments demonstrate that our CB-IV(-L) outperforms the existing approaches.
翻译:本文研究了四级回归中被观察到的困惑者和被观察者在四级回归中的不平衡的不测和不平衡的混和效应,目的是进行公正的因果关系估计。最近,提出了非线性四级估算,以允许在两个阶段都采用非线性模型。然而,在第二阶段中,观察到的混淆者可能出现不平衡,这在某些情况下仍然可能导致有偏向性的待遇估计。为此,我们建议采用一个混杂者四级平衡回归(CB-IV)算法,以联合消除被观察者在不测的混淆者和被观察者之间不平衡的偏见。理论上,通过重新定义和解决潜在结果功能的反问题,我们表明我们的CB-IV算法可以不带偏见地估计治疗效果,并实现更低的差异。第四级方法的主要缺点是,在现实世界情景中,目前很少或理论可以对有效的四级四级平衡(CB-V-IV)算法进行预定义有效的四级回归。因此,我们研究两个更具挑战性的背景环境:(1) 在观察中隐含存在的不易分的四级的四级方针,即可变易变的四级的四级的四级的四级模型显示我们可变式的变式的变式的变式模型中,即潜变式的变式的四级的变式的变式的变式的演化的变式的变式的变式的变式的变式的变式的变式。