We provide an explicit model of the causal mechanism in a structural causal model (SCM) with the goal of estimating counterfactual quantities of interest (CQIs). We propose some standard dependence structures, i.e. copulas, as base cases for the causal mechanism. While these base cases can be used to construct more interesting copulas, there are uncountably many copulas in general and so we formulate optimization problems for bounding the CQIs. As our ultimate goal is counterfactual reasoning in dynamic models which may have latent-states, we show by way of example that filtering / smoothing / sampling methods for these models can be integrated with our modeling of the causal mechanism. Specifically, we consider the "cheating-at-the-casino" application of a hidden Markov model and use linear programming (LP) to construct lower and upper bounds on the casino's winnings due to cheating. These bounds are considerably tighter when we constrain the copulas in the LPs to be time-independent. We can characterize the entire space of SCMs obeying counterfactual stability (CS), and we use it to negatively answer the open question of Oberst and Sontag [18] regarding the uniqueness of the Gumbel-max mechanism for modeling CS. Our work has applications in epidemiology and legal reasoning, and more generally in counterfactual off-policy evaluation, a topic of increasing interest in the reinforcement learning community.
翻译:我们在一个结构性因果模型(SCM)中提供一个明确的因果关系机制模型,目的是估计反事实利息数量。我们建议一些标准依赖结构,即合金,作为因果关系机制的基础案例。虽然这些基本案例可以用来构建更有趣的合金,但总而言之,有许多合金可以用来构建更有趣的合金,因此我们为约束合金制定了最优化问题。我们的最终目标是在可能具有潜伏状态的动态模型中进行反事实推理,我们举例来说,这些模型的过滤/平滑/取样方法可以与我们因果机制的建模结合起来。具体地说,我们考虑将隐藏的马可夫模型的“切换在彩虹”应用,并使用线性编程(LP)来构建赌场因欺骗而得的下限和上限问题。当我们限制LPs中具有潜在状态的合金模型时,这些界限就会更加紧密地紧密地紧密地紧密地联系在一起。我们可以描述这些模型的过滤/顺从反现实社会机制中汲取反事实稳定法律估价的整个空间,我们一般地用S-xxxxxximalalalal ablal laxal lax laus