How should we intervene on an unknown structural equation model to maximize a downstream variable of interest? This setting, also known as causal Bayesian optimization (CBO), has important applications in medicine, ecology, and manufacturing. Standard Bayesian optimization algorithms fail to effectively leverage the underlying causal structure. Existing CBO approaches assume noiseless measurements and do not come with guarantees. We propose the model-based causal Bayesian optimization algorithm (MCBO) that learns a full system model instead of only modeling intervention-reward pairs. MCBO propagates epistemic uncertainty about the causal mechanisms through the graph and trades off exploration and exploitation via the optimism principle. We bound its cumulative regret, and obtain the first non-asymptotic bounds for CBO. Unlike in standard Bayesian optimization, our acquisition function cannot be evaluated in closed form, so we show how the reparameterization trick can be used to apply gradient-based optimizers. The resulting practical implementation of MCBO compares favorably with state-of-the-art approaches empirically.
翻译:我们应如何干预一个未知的结构方程式模型,以最大限度地增加下游利益变量?这种环境,又称因果贝叶西亚优化(CBO),在医学、生态和制造业中有着重要的应用。标准贝叶西亚优化算法未能有效地利用根本因果结构。现有的CBO方法假定无噪音测量,而不是有保证。我们提议基于模型的因果贝叶西亚优化算法(MCBO),该算法可以学习一个完整的系统模型,而不是仅仅模拟干预-奖励配对。MCBO通过图表传播因果关系机制的隐含不确定性,并通过乐观原则进行勘探和开发交易。我们将其累积的遗憾捆绑起来,并获得CBO的第一个非补救界限。与标准的Beesian优化不同,我们的获取功能不能以封闭的形式评价,因此我们展示如何使用重新计法来应用基于梯度的优化。因此,MBO的实际实施比得上最先进的方法。</s>