Constraint-based methods are one of the main approaches for causal structure learning that are particularly valued as they are asymptotically guaranteed to find a structure that is Markov equivalent to the causal graph of the system. On the other hand, they may require an exponentially large number of conditional independence (CI) tests in the number of variables of the system. In this paper, we propose a novel recursive constraint-based method for causal structure learning that significantly reduces the required number of CI tests compared to the existing literature. The idea of the proposed approach is to use Markov boundary information to identify a specific variable that can be removed from the set of variables without affecting the statistical dependencies among the other variables. Having identified such a variable, we discover its neighborhood, remove that variable from the set of variables, and recursively learn the causal structure over the remaining variables. We further provide a lower bound on the number of CI tests required by any constraint-based method. Comparing this lower bound to our achievable bound demonstrates the efficiency of the proposed approach. Our experimental results show that the proposed algorithm outperforms state-of-the-art both on synthetic and real-world structures.
翻译:基于约束性的方法是因果结构学习的主要方法之一,这些方法特别受到重视,因为它们得到保证,能够找到与系统因果图相等的马可夫结构。另一方面,它们可能需要在系统变量数量上进行数量惊人的有条件独立测试(CI)测试。在本文件中,我们提出了一个新的基于循环约束的因果结构学习方法,该方法与现有文献相比,大大减少了所需的CI测试数量。拟议方法的构想是利用Markov边界信息,在不影响其他变量之间统计依赖性的情况下,从一组变量中找出一个可以删除的具体变量。我们发现了这种变量,发现其周边,从一组变量中去除该变量,并反复了解剩余变量的因果结构。我们进一步为任何基于制约的方法所要求的CI测试数量提供了较低的约束。将这一较低约束与我们可实现的约束结合起来,表明了拟议方法的效率。我们的实验结果显示,拟议的算法在合成和现实世界结构上都符合最先进的状态。