This paper studies the problem of learning causal structures from observational data. We reformulate the Structural Equation Model (SEM) with additive noises in a form parameterized by binary graph adjacency matrix and show that, if the original SEM is identifiable, then the binary adjacency matrix can be identified up to super-graphs of the true causal graph under mild conditions. We then utilize the reformulated SEM to develop a causal structure learning method that can be efficiently trained using gradient-based optimization, by leveraging a smooth characterization on acyclicity and the Gumbel-Softmax approach to approximate the binary adjacency matrix. It is found that the obtained entries are typically near zero or one and can be easily thresholded to identify the edges. We conduct experiments on synthetic and real datasets to validate the effectiveness of the proposed method, and show that it readily includes different smooth model functions and achieves a much improved performance on most datasets considered.
翻译:本文研究从观测数据中学习因果结构的问题。 我们用二进制图对相邻关系矩阵的参数以形式将结构衡平模型(SEM)重新配置为添加噪音,并表明,如果最初的SEM可以识别,那么二进制相邻关系矩阵就可以在温和的条件下被识别到真实因果图的超强成像。 然后,我们利用重新拟订的SEM开发一种因果结构学习方法,通过利用对周期的平稳定性和Gumbel-Softmax方法来接近二进制相邻关系矩阵来进行高效培训。 我们发现,获得的条目一般接近零或一个,而且很容易被临界到边缘。 我们进行合成和真实的数据集实验,以验证拟议方法的有效性,并表明它很容易包含不同的光滑模型功能,并在所考虑的大多数数据集上取得更好的性能。