Structural causal models are the basic modelling unit in Pearl's causal theory; in principle they allow us to solve counterfactuals, which are at the top rung of the ladder of causation. But they often contain latent variables that limit their application to special settings. This appears to be a consequence of the fact, proven in this paper, that causal inference is NP-hard even in models characterised by polytree-shaped graphs. To deal with such a hardness, we introduce the causal EM algorithm. Its primary aim is to reconstruct the uncertainty about the latent variables from data about categorical manifest variables. Counterfactual inference is then addressed via standard algorithms for Bayesian networks. The result is a general method to approximately compute counterfactuals, be they identifiable or not (in which case we deliver bounds). We show empirically, as well as by deriving credible intervals, that the approximation we provide becomes accurate in a fair number of EM runs. These results lead us finally to argue that there appears to be an unnoticed limitation to the trending idea that counterfactual bounds can often be computed without knowledge of the structural equations.
翻译:结构因果模型是珍珠因果理论中的基本建模单位;原则上,它们允许我们解决反事实,这是因果关系阶梯的顶端。但它们往往包含将自身应用限制在特殊设置的潜伏变量。这似乎是本文所证明的以下事实的结果:即使以多树形图形为特征的模型中,因果关系推论也是坚硬的。为了处理这种硬性,我们引入了因果的EM算法。其主要目的是从绝对明显变量的数据中重建潜在变量的不确定性。随后,通过巴伊西亚网络的标准算法处理反事实推论。结果是一种大致计算反事实的一般方法,可以识别,也可以不识别。我们通过得出可靠的间隔,我们从经验上表明我们提供的近似在相当数量的EM运行中变得准确。这些结果最终导致我们说,在不了解结构方程式的情况下,往往可以计算反事实界限的趋势概念似乎存在不为人注意的限制。