The counterfactual distribution models the effect of the treatment in the untreated group. While most of the work focuses on the expected values of the treatment effect, one may be interested in the whole counterfactual distribution or other quantities associated to it. Building on the framework of Bayesian conditional mean embeddings, we propose a Bayesian approach for modeling the counterfactual distribution, which leads to quantifying the epistemic uncertainty about the distribution. The framework naturally extends to the setting where one observes multiple treatment effects (e.g. an intermediate effect after an interim period, and an ultimate treatment effect which is of main interest) and allows for additionally modelling uncertainty about the relationship of these effects. For such goal, we present three novel Bayesian methods to estimate the expectation of the ultimate treatment effect, when only noisy samples of the dependence between intermediate and ultimate effects are provided. These methods differ on the source of uncertainty considered and allow for combining two sources of data. Moreover, we generalize these ideas to the off-policy evaluation framework, which can be seen as an extension of the counterfactual estimation problem. We empirically explore the calibration of the algorithms in two different experimental settings which require data fusion, and illustrate the value of considering the uncertainty stemming from the two sources of data.
翻译:反事实分布模型是未经治疗群体治疗的效果。 虽然大部分工作侧重于治疗效果的预期值, 但人们可能对整个反事实分布或与之相关的其他数量感兴趣。 基于巴伊西亚有条件的嵌入体框架, 我们提议采用巴伊西亚方法来模拟反事实分布模式, 从而量化有关分布的隐含不确定性; 框架自然延伸到人们观察多种治疗效果( 例如, 过渡期之后的中间效应, 以及主要感兴趣的最终治疗效应) 的环境, 并允许对这些效应的关系进行更多的建模不确定性。 为此, 我们提出三种新颖的巴伊西亚方法来估计最终治疗效果的预期值, 当只提供中间和最终效应之间依赖性的杂乱样时。 这些方法与所考虑的不确定性来源不同, 并允许将两种数据来源结合起来。 此外, 我们将这些想法概括到非政策评价框架, 它可以被视为反事实估计问题的延伸。 我们从两个不同的实验来源, 实验性地探索了算法的校准值, 以说明两种实验性数据所需的不确定性。