Causal discovery, the task of automatically constructing a causal model from data, is of major significance across the sciences. Evaluating the performance of causal discovery algorithms should ideally involve comparing the inferred models to ground-truth models available for benchmark datasets, which in turn requires a notion of distance between causal models. While such distances have been proposed previously, they are limited by focusing on graphical properties of the causal models being compared. Here, we overcome this limitation by defining distances derived from the causal distributions induced by the models, rather than exclusively from their graphical structure. Pearl and Mackenzie (2018) have arranged the properties of causal models in a hierarchy called the "ladder of causation" spanning three rungs: observational, interventional, and counterfactual. Following this organization, we introduce a hierarchy of three distances, one for each rung of the ladder. Our definitions are intuitively appealing as well as efficient to compute approximately. We put our causal distances to use by benchmarking standard causal discovery systems on both synthetic and real-world datasets for which ground-truth causal models are available. Finally, we highlight the usefulness of our causal distances by briefly discussing further applications beyond the evaluation of causal discovery techniques.
翻译:由数据自动构建因果模型的任务,即因果发现,在科学中具有重大意义。评估因果发现算法的性能,最理想的做法是将推算模型与基准数据集可用的地面真象模型进行比较,这反过来需要因果模型之间的距离概念。虽然以前曾提出过这种距离,但由于侧重于因果模型的图形属性比较而受到限制。在这里,我们通过界定由模型引起的因果分布产生的距离,而不是完全从其图形结构中得出的距离来克服这一限制。珍珠和麦肯锡(2018年)将因果模型的特性安排在一个等级中,称为“因果关系之大”的等级,涵盖三个层次:观察、干预和反事实。我们遵循这个结构,我们引入了三个距离的等级,每个阶梯子各一个。我们的定义直截了当地吸引人,而且有效地进行了大致的计算。我们通过将标准因果发现系统的基准用于合成和真实世界的因果发现系统,这些数据集都具备地面因果模型。最后,我们强调我们因果发现因果距离的效用,通过讨论进一步评估技术。