Many real-world decision-making tasks require learning casual relationships between a set of variables. Typical causal discovery methods, however, require that all variables are observed, which might not be realistic in practice. Unfortunately, in the presence of latent confounding, recovering casual relationships from observational data without making additional assumptions is an ill-posed problem. Fortunately, in practice, additional structure among the confounders can be expected, one such example being pervasive confounding, which has been exploited for consistent causal estimation in the special case of linear causal models. In this paper, we provide a proof and method to estimate causal relationships in the non-linear, pervasive confounding setting. The heart of our procedure relies on the ability to estimate the pervasive confounding variation through a simple spectral decomposition of the observed data matrix. We derive a DAG score function based on this insight, and empirically compare our method to existing procedures. We show improved performance on both simulated and real datasets by explicitly accounting for both confounders and non-linear effects.
翻译:许多现实世界的决策任务要求学习一组变量之间的偶然关系。但是,典型的因果关系发现方法要求观察所有变量,这在实践中可能并不现实。 不幸的是,在潜伏混淆的情况下,从观察数据中恢复偶然关系而不做额外的假设是一个不恰当的问题。幸运的是,在实践中,可以预期混淆者之间会有更多的结构,其中一个例子是普遍存在的混杂,在线性因果关系模型的特殊案例中,这种混杂被用来进行一致的因果估计。在本文中,我们提供了一种证据和方法,用以估计非线性、普遍融和环境中的因果关系。我们程序的核心取决于能否通过观察的数据矩阵的简单光谱分解来估计普遍存在的混杂差异。我们根据这种洞察力得出DAG的得分函数,并从经验上将我们的方法与现有程序进行比较。我们通过明确计算混淆者和非线性效应,在模拟和真实数据集上的表现都有所改善。