Bayesian optimization is a popular method for optimizing expensive black-box functions. Yet it oftentimes struggles in high dimensions where the computation could be prohibitively heavy. To alleviate this problem, we introduce Coordinate backoff Bayesian Optimization (CobBO) with two-stage kernels. During each round, the first stage uses a simple coarse kernel that sacrifices the approximation accuracy for computational efficiency. It captures the global landscape by purposely smoothing away local fluctuations. Then, in the second stage of the same round, past observed points in the full space are projected to the selected subspace to form virtual points. These virtual points, along with the means and variances of their unknown function values estimated using the simple kernel of the first stage, are fitted to a more sophisticated kernel model in the second stage. Within the selected low dimensional subspace, the computational cost of conducting Bayesian optimization therein becomes affordable. To further enhance the performance, a sequence of consecutive observations in the same subspace are collected, which can effectively refine the approximation of the function. This refinement lasts until a stopping rule is met determining when to back off from a certain subspace and switch to another. This decoupling significantly reduces the computational burden in high dimensions, which fully leverages the observations in the whole space rather than only relying on observations in each coordinate subspace. Extensive evaluations show that CobBO finds solutions comparable to or better than other state-of-the-art methods for dimensions ranging from tens to hundreds, while reducing both the trial complexity and computational costs.
翻译:Bayesian 优化是优化昂贵黑盒功能的流行方法。 然而, 它往往在高维度上挣扎, 计算可能令人望而却步。 为了缓解这一问题, 我们引入了使用两阶段内核的 Bayesian Optimo化( CobbO) 后端协调。 在每一回合中, 第一阶段使用简单的粗粗内核, 以牺牲计算效率的近似精确度。 它通过有意平息本地波动来捕捉全球景色。 然后在同一回合的第二阶段, 全部空间的过去观察点被预测到选定的子空间形成虚拟点。 这些虚拟点, 连同它们使用第一阶段简单内核估计的未知功能值的手段和差异一起, 安装在第二阶段, 安装更复杂的内核内核模型。 在选定的低维次空间中, 进行Bayesian 优化的计算成本可以负担得起。 为了进一步提高业绩, 在同一次空间的连续观测顺序可以有效地改进功能的近比值。 这种改进将持续到停止的观察, 规则在使用第一个阶段, 以数百个空间内测算方法来, 完全地平整一个空间的计算, 。