For Bayesian optimization (BO) on high-dimensional data with complex structure, neural network-based kernels for Gaussian processes (GPs) have been used to learn flexible surrogate functions by the high representation power of deep learning. However, existing methods train neural networks by maximizing the marginal likelihood, which do not directly improve the BO performance. In this paper, we propose a meta-learning method for BO with neural network-based kernels that minimizes the expected gap between the true optimum value and the best value found by BO. We model a policy, which takes the current evaluated data points as input and outputs the next data point to be evaluated, by a neural network, where neural network-based kernels, GPs, and mutual information-based acquisition functions are used as its layers. With our model, the neural network-based kernel is trained to be appropriate for the acquisition function by backpropagating the gap through the acquisition function and GP. Our model is trained by a reinforcement learning framework from multiple tasks. Since the neural network is shared across different tasks, we can gather knowledge on BO from multiple training tasks, and use the knowledge for unseen test tasks. In experiments using three text document datasets, we demonstrate that the proposed method achieves better BO performance than the existing methods.
翻译:对于具有复杂结构的高维数据的Bayesian优化(BO)而言,Gausian进程以神经网络为基础的内核被用于通过深层学习的高代表力学习灵活的替代功能。然而,现有方法通过最大限度地增加边缘可能性来培养神经网络,这并没有直接改善BO的性能。在本文中,我们为BO提出了一个基于神经网络的内核的元学习方法,该方法将真正的最佳价值与BO发现的最佳价值之间的预期差距降到最低。我们模拟了一种政策,将当前评价的数据点作为投入和产出,用神经网络来评估下一个数据点,用神经网络的内核、GPs和相互的信息获取功能作为其分层。我们用模型来培训以神经网络为基础的内核,通过获取功能和GPO发现的最大值来减少预期的差距。我们的模式通过一个强化学习框架从多个任务中培训。由于神经网络在不同的任务中共享神经网络作为投入和产出,将神经网络作为评估的下一个数据点,用神经网络、基于神经网络的内核、GP和基于信息的相互获取功能作为它的分层作为其分层。我们提出的三个试验方法,我们可以更好地使用现有工具,我们用现有学习方法来学习。