Gausian 工序强盗优化使用少几只蝙蝠 (Gaussian Process Bandit Optimization with Few Batches) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 优化器 · 核化 · Processing（编程语言） · 平方指数 ·

2021 年 10 月 15 日

Gaussian Process Bandit Optimization with Few Batches

翻译：Gausian 工序强盗优化使用少几只蝙蝠

Zihan Li,Jonathan Scarlett

In this paper, we consider the problem of black-box optimization using Gaussian Process (GP) bandit optimization with a small number of batches. Assuming the unknown function has a low norm in the Reproducing Kernel Hilbert Space (RKHS), we introduce a batch algorithm inspired by batched finite-arm bandit algorithms, and show that it achieves the cumulative regret upper bound $O^\ast(\sqrt{T\gamma_T})$ using $O(\log\log T)$ batches within time horizon $T$, where the $O^\ast(\cdot)$ notation hides dimension-independent logarithmic factors and $\gamma_T$ is the maximum information gain associated with the kernel. This bound is near-optimal for several kernels of interest and improves on the typical $O^\ast(\sqrt{T}\gamma_T)$ bound, and our approach is arguably the simplest among algorithms attaining this improvement. In addition, in the case of a constant number of batches (not depending on $T$), we propose a modified version of our algorithm, and characterize how the regret is impacted by the number of batches, focusing on the squared exponential and Mat\'ern kernels. The algorithmic upper bounds are shown to be nearly minimax optimal via analogous algorithm-independent lower bounds.

翻译：在本文中, 我们考虑使用 Gaussian 进程( GP) 土匪优化来优化黑盒问题。如果在复制 Kernel Hilbert 空间( RKHS) 中, 未知函数的常态值较低, 我们引入了由批量有限武器土匪算法启发的批量算法, 并展示它利用美元( log\log\logT) 在时间范围内直系交易的批次在一定范围内用美元T$( log\log T) 来实现累积的遗憾 $O ast( log\log\log T) 优化。美元( codt) $( count (\ codt) $( $\ gamma_ T) 在时间范围内, 隐藏维度独立的对调调调调调调调调调调调调调调调调调时, $\ gammamama_ T$( legle) legal comendal rubes) 的算算出一个不变的版本。

0

相关内容

赌博机/老虎机

赌博机/老虎机

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

复杂网络能控性鲁棒性研究进展

专知会员服务

26+阅读 · 2021年6月9日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

已删除

将门创投

8+阅读 · 2019年6月13日

Policy Optimization with Stochastic Mirror Descent

Arxiv

0+阅读 · 2021年12月9日

A Survey on Parameterized Inapproximability: $k$-Clique, $k$-SetCover, and More

Arxiv

0+阅读 · 2021年12月9日

An improper estimator with optimal excess risk in misspecified density estimation and logistic regression

Arxiv

0+阅读 · 2021年12月8日

Improved Distributed Fractional Coloring Algorithms

Arxiv

0+阅读 · 2021年12月8日

A hybrid projection algorithm for stochastic differential equations on manifolds

Arxiv

0+阅读 · 2021年12月6日

Minimax properties of Dirichlet kernel density estimators

Arxiv

0+阅读 · 2021年12月6日

On the complexity of the optimal transport problem with graph-structured cost

Arxiv

0+阅读 · 2021年12月5日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

Variance-based regularization with convex objectives

Arxiv

5+阅读 · 2017年12月14日

VIP会员

文章信息

相关主题

赌博机/老虎机

Processing（编程语言）

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

复杂网络能控性鲁棒性研究进展

专知会员服务

26+阅读 · 2021年6月9日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

已删除

将门创投

8+阅读 · 2019年6月13日

相关论文

Policy Optimization with Stochastic Mirror Descent

Arxiv

0+阅读 · 2021年12月9日

A Survey on Parameterized Inapproximability: $k$-Clique, $k$-SetCover, and More

Arxiv

0+阅读 · 2021年12月9日

An improper estimator with optimal excess risk in misspecified density estimation and logistic regression

Arxiv

0+阅读 · 2021年12月8日

Improved Distributed Fractional Coloring Algorithms

Arxiv

0+阅读 · 2021年12月8日

A hybrid projection algorithm for stochastic differential equations on manifolds

Arxiv

0+阅读 · 2021年12月6日

Minimax properties of Dirichlet kernel density estimators

Arxiv

0+阅读 · 2021年12月6日

On the complexity of the optimal transport problem with graph-structured cost

Arxiv

0+阅读 · 2021年12月5日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

Variance-based regularization with convex objectives

Arxiv

5+阅读 · 2017年12月14日

微信扫码咨询专知VIP会员