Active learning provides a framework to adaptively sample the most informative experiments towards learning an unknown black-box function. Various approaches of active learning have been proposed in the literature, however, they either focus on exploration or exploitation in the design space. Methods that do consider exploration-exploitation simultaneously employ fixed or ad-hoc measures to control the trade-off that may not be optimal. In this paper, we develop a Bayesian hierarchical approach to dynamically balance the exploration-exploitation trade-off as more data points are queried. We subsequently formulate an approximate Bayesian computation approach based on the linear dependence of data samples in the feature space to sample from the posterior distribution of the trade-off parameter obtained from the Bayesian hierarchical model. Simulated and real-world examples show the proposed approach achieves at least 6% and 11% average improvement when compared to pure exploration and exploitation strategies respectively. More importantly, we note that by optimally balancing the trade-off between exploration and exploitation, our approach performs better or at least as well as either pure exploration or pure exploitation.
翻译:主动学习提供了一种框架,可以自适应地对未知的黑盒函数进行抽样,以获取最具信息的实验。文献中已经提出了各种主动学习的方法,但它们要么关注于探索,要么关注于开发设计空间。同时考虑探索和开发设计空间的方法,采用固定或临时措施来控制权衡,这可能不是最优的。本文提出了一种基于贝叶斯分层模型的方法,以在查询更多数据点时动态平衡探索-利用权衡。随后,我们基于特征空间中数据样本的线性相关性提出了一种近似贝叶斯计算方法,以从Bayesian hierarchical model中的权衡参数的后验分布中进行采样。模拟和现实世界的例子显示,与纯探索和开发策略相比,所提出的方法平均表现至少提高了6%和11%。更重要的是,我们注意到通过平衡探索和利用之间的权衡,我们的方法表现更好或至少与纯探索或纯开发相同。