Recovering causal structure in the presence of latent variables is an important but challenging task. While many methods have been proposed to handle it, most of them require strict and/or untestable assumptions on the causal structure. In real-world scenarios, observed variables may be affected by multiple latent variables simultaneously, which, generally speaking, cannot be handled by these methods. In this paper, we consider the linear, non-Gaussian case, and make use of the joint higher-order cumulant matrix of the observed variables constructed in a specific way. We show that, surprisingly, causal asymmetry between two observed variables can be directly seen from the rank deficiency properties of such higher-order cumulant matrices, even in the presence of an arbitrary number of latent confounders. Identifiability results are established, and the corresponding identification methods do not even involve iterative procedures. Experimental results demonstrate the effectiveness and asymptotic correctness of our proposed method.
翻译:在存在潜在变量的情况下恢复因果结构是一项重要但具有挑战性的任务。尽管已有许多方法被提出以处理此问题,但大多数方法对因果结构的要求严格且/或假设难以验证。在实际场景中,观测变量可能同时受到多个潜在变量的影响,而现有方法通常无法处理此类情况。本文在线性非高斯框架下,通过以特定方式构建观测变量的联合高阶累积量矩阵,证明了即使存在任意数量的潜在混杂变量,两个观测变量间的因果非对称性仍可直接从这类高阶累积量矩阵的秩亏缺性质中识别。我们建立了可识别性理论,相应的识别方法甚至无需迭代计算过程。实验结果验证了所提方法的有效性与渐近正确性。