The investigation of the question "which treatment has a causal effect on a target variable?" is of particular relevance in a large number of scientific disciplines. This challenging task becomes even more difficult if not all treatment variables were or even cannot be observed jointly with the target variable. Another similarly important and challenging task is to quantify the causal influence of a treatment on a target in the presence of confounders. In this paper, we discuss how causal knowledge can be obtained without having observed all variables jointly, but by merging the statistical information from different datasets. We first show how the maximum entropy principle can be used to identify edges among random variables when assuming causal sufficiency and an extended version of faithfulness. Additionally, we derive bounds on the interventional distribution and the average causal effect of a treatment on a target variable in the presence of confounders. In both cases we assume that only subsets of the variables have been observed jointly.
翻译:调查“哪些治疗对目标变量具有因果关系”的问题,在许多科学学科中具有特别的相关性。这一具有挑战性的任务,如果不是所有治疗变量都与目标变量一起观察,甚至无法与目标变量一起观察,就变得更加困难。另一个同样重要和具有挑战性的任务,是量化治疗对目标目标的因果关系,同时有混淆者在场。在本文中,我们讨论如何在不共同观察所有变量的情况下获得因果关系知识,但将不同数据集的统计资料合并在一起。我们首先说明在假设因果关系充足和忠诚度的扩展版本时,如何使用最大通缩原则来确定随机变量的边际。此外,我们从干预分布和治疗对目标变量的平均因果关系影响中得出界限。在这两种情况下,我们假设只有变量的子项是共同观察的。