Many of the causal discovery methods rely on the faithfulness assumption to guarantee asymptotic correctness. However, the assumption can be approximately violated in many ways, leading to sub-optimal solutions. Although there is a line of research in Bayesian network structure learning that focuses on weakening the assumption, such as exact search methods with well-defined score functions, they do not scale well to large graphs. In this work, we introduce several strategies to improve the scalability of exact score-based methods in the linear Gaussian setting. In particular, we develop a super-structure estimation method based on the support of inverse covariance matrix which requires assumptions that are strictly weaker than faithfulness, and apply it to restrict the search space of exact search. We also propose a local search strategy that performs exact search on the local clusters formed by each variable and its neighbors within two hops in the super-structure. Numerical experiments validate the efficacy of the proposed procedure, and demonstrate that it scales up to hundreds of nodes with a high accuracy.
翻译:许多因果发现方法都依靠忠诚假设来保证无症状的正确性。 但是,这种假设可能在许多方面被大约违反,导致次优的解决方案。 虽然在巴伊西亚网络结构中存在一系列研究,其重点是削弱假设,例如精确的搜索方法,具有明确界定的得分函数,但它们的大小不及大图。 在这项工作中,我们引入了几项战略来提高线性高斯设置中精确的得分方法的可缩放性。特别是,我们开发了一种基于反常变量矩阵支持的超级结构估计方法,该模型要求的假设绝对弱于忠实性,并应用它来限制精确搜索的搜索空间。我们还提出了一个本地搜索战略,对每个变量及其邻居在超级结构中两个跳跃中形成的地方集群进行精确搜索。 数字实验验证了拟议程序的功效,并表明它以高精确度将数百个节点的节点缩成。