Lazy OCO: 在线电汇优化转换预算 (Lazy OCO: Online Convex Optimization on a Switching Budget) - 专知论文

会员服务 ·

0

优化器 · 情景 · Continuity · Performer · CASES ·

2021 年 8 月 13 日

Lazy OCO: Online Convex Optimization on a Switching Budget

翻译：Lazy OCO: 在线电汇优化转换预算

Uri Sherman,Tomer Koren

We study a variant of online convex optimization where the player is permitted to switch decisions at most $S$ times in expectation throughout $T$ rounds. Similar problems have been addressed in prior work for the discrete decision set setting, and more recently in the continuous setting but only with an adaptive adversary. In this work, we aim to fill the gap and present computationally efficient algorithms in the more prevalent oblivious setting, establishing a regret bound of $O(T/S)$ for general convex losses and $\widetilde O(T/S^2)$ for strongly convex losses. In addition, for stochastic i.i.d.~losses, we present a simple algorithm that performs $\log T$ switches with only a multiplicative $\log T$ factor overhead in its regret in both the general and strongly convex settings. Finally, we complement our algorithms with lower bounds that match our upper bounds in some of the cases we consider.

翻译：我们研究的是在线convex优化的变式,即允许玩家在整个T回合中按预期最多S美元的时间转换决定。类似的问题已经在离散决定设置的前期工作中解决,最近也在连续设置中解决,但只是与适应性对手一起解决。在这项工作中,我们的目标是填补空白,在更普遍的模糊环境中提出计算效率高的算法,为一般 convex损失确定O(T/S)$的遗憾,为强烈的 convex损失设定美元(T/S2)的遗憾结合值。此外,对于Stochaticic i.d.~loss,我们提出了一个简单的算法,在一般和强烈的 convex环境中只使用多复制的$\log T$的系数管理费来执行$(log T$)开关。最后,我们用更低的界限来补充我们的算法,与我们所考虑的一些案例的上限相匹配。

0

相关内容

优化器

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【AAAI2021】Lipschitz终身强化学习

专知会员服务

31+阅读 · 2020年12月14日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Optimizing Ranking Systems Online as Bandits

Arxiv

0+阅读 · 2021年10月12日

Private Federated Learning Without a Trusted Server: Optimal Algorithms for Convex Losses

Arxiv

0+阅读 · 2021年10月12日

Randomized Exploration for Non-Stationary Stochastic Linear Bandits

Arxiv

0+阅读 · 2021年10月11日

Efficient Methods for Online Multiclass Logistic Regression

Arxiv

0+阅读 · 2021年10月10日

Exponential Upper Bounds for the Runtime of Randomized Search Heuristics

Arxiv

0+阅读 · 2021年10月9日

Distributed Bandits: Probabilistic Communication on $d$-regular Graphs

Arxiv

0+阅读 · 2021年10月9日

When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?

When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?

Arxiv

0+阅读 · 2021年10月8日

Deep Upper Confidence Bound Algorithm for Contextual Bandit Ranking of Information Selection

Deep Upper Confidence Bound Algorithm for Contextual Bandit Ranking of Information Selection

Arxiv

0+阅读 · 2021年10月8日

Efficient Local Planning with Linear Function Approximation

Arxiv

0+阅读 · 2021年10月7日

Scaling Bayesian Optimization With Game Theory

Arxiv

0+阅读 · 2021年10月7日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【AAAI2021】Lipschitz终身强化学习

专知会员服务

31+阅读 · 2020年12月14日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

【斯坦福博士论文】数据、决策与依赖：构建可信人工智能的挑战

人工智能时代背景下的未来海战

接触战中的无人机优势：美军旅级部队面临的小型无人机系统挑战与调整

相关资讯

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Optimizing Ranking Systems Online as Bandits

Arxiv

0+阅读 · 2021年10月12日

Private Federated Learning Without a Trusted Server: Optimal Algorithms for Convex Losses

Arxiv

0+阅读 · 2021年10月12日

Randomized Exploration for Non-Stationary Stochastic Linear Bandits

Arxiv

0+阅读 · 2021年10月11日

Efficient Methods for Online Multiclass Logistic Regression

Arxiv

0+阅读 · 2021年10月10日

Exponential Upper Bounds for the Runtime of Randomized Search Heuristics

Arxiv

0+阅读 · 2021年10月9日

Distributed Bandits: Probabilistic Communication on $d$-regular Graphs

Arxiv

0+阅读 · 2021年10月9日

When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?

When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?

Arxiv

0+阅读 · 2021年10月8日

Deep Upper Confidence Bound Algorithm for Contextual Bandit Ranking of Information Selection

Deep Upper Confidence Bound Algorithm for Contextual Bandit Ranking of Information Selection

Arxiv

0+阅读 · 2021年10月8日

Efficient Local Planning with Linear Function Approximation

Arxiv

0+阅读 · 2021年10月7日

Scaling Bayesian Optimization With Game Theory

Arxiv

0+阅读 · 2021年10月7日

微信扫码咨询专知VIP会员