Causal structure discovery from observational data is fundamental to the causal understanding of autonomous systems such as medical decision support systems, advertising campaigns and self-driving cars. This is essential to solve well-known causal decision making and prediction problems associated with those real-world applications. Recently, recursive causal discovery algorithms have gained particular attention among the research community due to their ability to provide good results by using Conditional Independent (CI) tests in smaller sub-problems. However, each of such algorithms needs a refinement function to remove undesired causal relations of the discovered graphs. Notably, with the increase of the problem size, the computation cost (i.e., the number of CI-tests) of the refinement function makes an algorithm expensive to deploy in practice. This paper proposes a generic causal structure refinement strategy that can locate the undesired relations with a small number of CI-tests, thus speeding up the algorithm for large and complex problems. We theoretically prove the correctness of our algorithm. We then empirically evaluate its performance against the state-of-the-art algorithms in terms of solution quality and completion time in synthetic and real datasets.
翻译:从观测数据中发现因果结构,对于理解医疗决策支持系统、广告运动和自行驾驶汽车等自主系统具有因果关系至关重要。这对于解决与这些现实应用有关的众所周知的因果决策和预测问题至关重要。最近,循环因果发现算法由于能够通过在较小的次级问题中使用有条件独立测试(CI)来提供良好结果而引起研究界的特别关注。然而,每种此类算法都需要一个完善功能来消除所发现图表中不理想的因果关系。值得注意的是,随着问题规模的增加,精细功能的计算成本(即光学测试的数量)使得实际部署的算法费用昂贵。本文提出了一个通用因果结构改进战略,可以将不理想的关系与少量的CI测试联系起来,从而加快大型和复杂问题的算法。我们理论上证明我们的算法是正确的。我们随后根据解决方案质量和合成和真实数据集的完成时间对它的业绩进行了实验性评估。