How should we intervene on an unknown structural causal model to maximize a downstream variable of interest? This optimization of the output of a system of interconnected variables, also known as causal Bayesian optimization (CBO), has important applications in medicine, ecology, and manufacturing. Standard Bayesian optimization algorithms fail to effectively leverage the underlying causal structure. Existing CBO approaches assume noiseless measurements and do not come with guarantees. We propose model-based causal Bayesian optimization (MCBO), an algorithm that learns a full system model instead of only modeling intervention-reward pairs. MCBO propagates epistemic uncertainty about the causal mechanisms through the graph and trades off exploration and exploitation via the optimism principle. We bound its cumulative regret, and obtain the first non-asymptotic bounds for CBO. Unlike in standard Bayesian optimization, our acquisition function cannot be evaluated in closed form, so we show how the reparameterization trick can be used to apply gradient-based optimizers. Empirically we find that MCBO compares favorably with existing state-of-the-art approaches.
翻译:我们应如何干预一个未知的结构性因果模型,以最大限度地增加下游利益变量?这种优化一个相互关联的变量系统(又称因果巴耶斯优化(CBO))的产出,在医学、生态学和制造业方面有着重要的应用。标准的巴伊西亚优化算法未能有效地利用潜在的因果结构。现有的CBO方法假定无噪音测量,而没有担保。我们提议基于模型的因果巴耶斯优化(MCBO)算法,这种算法可以学习一个完整的系统模型,而不是只模拟干预-奖励对子。MCBO通过图表传播因果关系机制的隐含不确定性,并通过乐观主义原则进行勘探和开发交易。我们将其累积的遗憾捆绑起来,并获得CBO的第一个非因果界限。与标准的巴伊西亚优化不同,我们的购置功能不能以封闭的形式评价,因此我们展示如何使用再分计量技巧来应用基于梯度的优化。我们乐观地发现MCBO比现有的状态方法更适合。