Bayesian optimization (BO) is a popular method for black-box optimization, which relies on uncertainty as part of its decision-making process when deciding which experiment to perform next. However, not much work has addressed the effect of uncertainty on the performance of the BO algorithm and to what extent calibrated uncertainties improve the ability to find the global optimum. In this work, we provide an extensive study of the relationship between the BO performance (regret) and uncertainty calibration for popular surrogate models and compare them across both synthetic and real-world experiments. Our results confirm that Gaussian Processes are strong surrogate models and that they tend to outperform other popular models. Our results further show a positive association between calibration error and regret, but interestingly, this association disappears when we control for the type of model in the analysis. We also studied the effect of re-calibration and demonstrate that it generally does not lead to improved regret. Finally, we provide theoretical justification for why uncertainty calibration might be difficult to combine with BO due to the small sample sizes commonly used.
翻译:Bayesian优化(BO)是一种流行的黑盒优化方法,在决定下一步要进行哪些实验时,该方法依赖不确定性作为其决策过程的一部分。然而,没有做多少工作来处理不确定性对BO算法的性能的影响,以及经校准的不确定性在多大程度上提高了找到全球最佳模型的能力。在这项工作中,我们对BO性能(regret)与流行代用模型的不确定性校准之间的关系进行了广泛研究,并在合成和现实世界实验中进行比较。我们的结果证实,Gausian进程是强大的替代模型,往往超越其他流行模型。我们的结果进一步表明,校准错误和遗憾之间存在积极的联系,但有趣的是,当我们控制分析中模型的类型时,这种联系消失了。我们还研究了重新校准的效果,并表明它一般不会导致遗憾的改善。最后,我们从理论上解释了为什么由于常用的样本规模较小,不确定性的校准可能难以与BO相结合。