It is commonly believed that Bayesian optimization (BO) algorithms are highly efficient for optimizing numerically costly functions. However, BO is not often compared to widely different alternatives, and is mostly tested on narrow sets of problems (multimodal, low-dimensional functions), which makes it difficult to assess where (or if) they actually achieve state-of-the-art performance. Moreover, several aspects in the design of these algorithms vary across implementations without a clear recommendation emerging from current practices, and many of these design choices are not substantiated by authoritative test campaigns. This article reports a large investigation about the effects on the performance of (Gaussian process based) BO of common and less common design choices. The experiments are carried out with the established COCO (COmparing Continuous Optimizers) software. It is found that a small initial budget, a quadratic trend, high-quality optimization of the acquisition criterion bring consistent progress. Using the GP mean as an occasional acquisition contributes to a negligible additional improvement. Warping degrades performance. The Mat\'ern 5/2 kernel is a good default but it may be surpassed by the exponential kernel on irregular functions. Overall, the best EGO variants are competitive or improve over state-of-the-art algorithms in dimensions less or equal to 5 for multimodal functions. The code developed for this study makes the new version (v2.1.1) of the R package DiceOptim available on CRAN. The structure of the experiments by function groups allows to define priorities for future research on Bayesian optimization.
翻译:人们普遍认为,巴耶斯优化(BO)算法对于优化数字成本高的功能非常有效。然而,BO并不经常被与大不相同的替代方法进行比较,而且大多在狭窄的一组问题(多式、低维功能)上进行测试,这使得难以评估它们实际达到最先进性能的(或是否)在哪里。此外,这些算法的设计在不同的执行中各不相同,而目前的做法没有提出明确的建议,而且许多这些设计选择没有权威的测试运动来证实。本篇文章报告了对共同和不太常见的设计选择对(基于Gausian进程的)BO性能的影响的大规模调查。实验是在已经建立的COCO(CO)(Capparing Coptimer Applicers)软件软件中进行的,这使得很难评估它们实际上在哪里取得最先进的预算、四重趋势、高质量地优化采购标准带来一致的进展。使用GPOP作为偶尔获得的代用词有助于微量的额外改进。WERSD2.1系统变现性性性能。 Mat\'n 5/2内尔(B) 允许未来设计选择一个良好的默认,但是它可能由指数式的Eng-ROKNOL 的Sload 函数比GO值更小的升级为更小。