We consider causal models with two observed variables and one latent variables, each variable being discrete, with the goal of characterizing the possible distributions on outcomes that can result from controlling one of the observed variables. We optimize linear functions over the space of all possible interventional distributions, which allows us find properties of the interventional distribution even when we cannot uniquely identify what it is. We show that, under certain mild assumptions about the correlation between controlled variable and the latent variable, the resulting interventional distribution must be close to the observed conditional distribution in a quantitative sense. Specifically, we show that if the observed variables are sufficiently highly correlated, and the latent variable can only take on a small number of distinct values, then the variables will remain causally related after passing to the interventional distribution. Another result, possibly of more general interest, is a bound on the distance between the interventional distribution and the observed conditional distribution in terms of the mutual information between the controlled variable and the latent variable, which shows that the controlled variable and the latent variable must be tightly correlated for the interventional distribution to differ significantly from the observed distribution. We believe that this type of result may make it possible to rigorously consider 'weak' experiments, where the causal variable is not entirely independent from the environment, but only approximately so. More generally, we suggest a connection between the theory of causality to polynomial optimization, which give useful bounds on the space of interventional distributions.
翻译:我们用两个观察到的变量和一个潜在变量来考虑因果模型,每个变量是分开的,目的是说明从控制观察到的变量中可能产生的结果分布。我们优化了所有可能的干预分布空间的线性功能,这使我们能够找到干预分布的属性,即使我们无法独有地确定它是什么。我们显示,根据关于受控变量和潜在变量之间相互关系的某些温和假设,由此产生的干预分布必须从数量意义上接近观察到的有条件分布。具体地说,我们表明,如果观察到的变量具有高度关联性,而潜在变量只能吸收少量的不同值,那么在通过干预分布之后,变量将保持因果关联性。另一个可能具有更普遍兴趣的结果,是干预分布与所观察到的变量和潜在变量之间相互信息之间所观察到的有条件分布之间的距离。我们表明,受控变量和潜在变量必须密切关联性,以便与所观察到的分布有显著差异。我们认为,如果观测到的变量的结果可能使这种结果有可能产生少数不同值,那么在通过干预分布到干预分布之后,那么这些变量将仍然有因果联系。另一个结果,可能是,可能是更普遍的、可能与我们从“weak”的因果性环境之间的因果关系。