The expected improvement (EI) is one of the most popular acquisition functions for Bayesian optimization (BO) and has demonstrated good empirical performances in many applications for the minimization of simple regret. However, under the evaluation metric of cumulative regret, the performance of EI may not be competitive, and its existing theoretical regret upper bound still has room for improvement. To adapt the EI for better performance under cumulative regret, we introduce a novel quantity called the evaluation cost which is compared against the acquisition function, and with this, develop the expected improvement-cost (EIC) algorithm. In each iteration of EIC, a new point with the largest acquisition function value is sampled, only if that value exceeds its evaluation cost. If none meets this criteria, the current best point is resampled. This evaluation cost quantifies the potential downside of sampling a point, which is important under the cumulative regret metric as the objective function value in every iteration affects the performance measure. We further establish in theory a tight regret upper bound of EIC for the squared-exponential covariance kernel under mild regularity conditions, and perform experiments to illustrate the improvement of EIC over several popular BO algorithms.
翻译:预期的改进(EI)是巴耶斯优化(BO)最流行的获取功能之一,并且在许多应用中表现出良好的经验性表现,以尽量减少简单的遗憾。然而,根据累积遗憾的评价标准,EI的表现可能不具竞争力,而其现有的理论遗憾上层仍有改进的余地。为了使EI适应在累积遗憾下更好的业绩,我们引入了一个新的数量,称为评价成本,与获取功能相比,并据此开发了预期的改进成本算法。在EIC的每一次循环中,都取样了一个具有最大获取功能值的新点,只有该值超过其评价成本时,才进行该值的抽样。如果没有达到这一标准,则目前的最佳点将重新标出。这一评估成本将取样的潜在下行量量化一个点,在累积遗憾指标下限中,这一点很重要,因为每项获取功能的客观功能值影响绩效衡量。我们从理论上进一步确定EIC在适度的扩展内置值超过其评价成本的新点,并在适度的正常条件下进行实验,以说明EIC的若干大众对BA的改进。