Contextual Bayesian optimization (CBO) is a powerful framework for sequential decision-making given side information, with important applications, e.g., in wind energy systems. In this setting, the learner receives context (e.g., weather conditions) at each round, and has to choose an action (e.g., turbine parameters). Standard algorithms assume no cost for switching their decisions at every round. However, in many practical applications, there is a cost associated with such changes, which should be minimized. We introduce the episodic CBO with movement costs problem and, based on the online learning approach for metrical task systems of Coester and Lee (2019), propose a novel randomized mirror descent algorithm that makes use of Gaussian Process confidence bounds. We compare its performance with the offline optimal sequence for each episode and provide rigorous regret guarantees. We further demonstrate our approach on the important real-world application of altitude optimization for Airborne Wind Energy Systems. In the presence of substantial movement costs, our algorithm consistently outperforms standard CBO algorithms.
翻译:Bayesian环境优化(CBO)是按顺序决策的有力框架,提供侧面信息,包括重要应用,例如风能系统。在这一背景下,学习者在每个回合都接受上下文(例如天气条件),并必须选择行动(例如涡轮参数)。标准算法假定在每一回合中改变其决策不需花费任何费用。然而,在许多实际应用中,这种变化涉及成本,应当尽量减少。我们引入带有移动成本问题的分型 CBO。我们引入了带有移动成本问题的CBO。我们根据Coester和Lee(2019年)等量级任务系统的在线学习方法,提出了利用Gaussian进程信任界限的新型随机镜底算法。我们将其表现与每一回合的离线最佳序列进行了比较,并提供严格的遗憾保证。我们进一步展示了我们在空气风能源系统高度优化的重要现实应用中所采用的方法。在出现大量移动成本时,我们的算法始终超越了CBO标准算法。