在线镜镜下线和双向平均:动态个案的步调 (Online mirror descent and dual averaging: keeping pace in the dynamic case) - 专知论文

会员服务 ·

0

SimPLe · Performer · CASE · 学成 · 相似度 ·

2021 年 9 月 3 日

Online mirror descent and dual averaging: keeping pace in the dynamic case

翻译：在线镜镜下线和双向平均:动态个案的步调

Huang Fang,Nicholas J. A. Harvey,Victor S. Portella,Michael P. Friedlander

from arxiv, 27 pages main text, 37 pages in total, 1 figure. Version 2: Revision for camera-ready version of ICML 2020, with a new abstract, new discussion and acknowledgements sections, and some other minor modifications. Version 3: Technical report version of JMLR submission, with minor revisions, full proofs, and more details on the setting with composite functions

Online mirror descent (OMD) and dual averaging (DA) -- two fundamental algorithms for online convex optimization -- are known to have very similar (and sometimes identical) performance guarantees when used with a fixed learning rate. Under dynamic learning rates, however, OMD is provably inferior to DA and suffers a linear regret, even in common settings such as prediction with expert advice. We modify the OMD algorithm through a simple technique that we call stabilization. We give essentially the same abstract regret bound for OMD with stabilization and for DA by modifying the classical OMD convergence analysis in a careful and modular way that allows for straightforward and flexible proofs. Simple corollaries of these bounds show that OMD with stabilization and DA enjoy the same performance guarantees in many applications -- even under dynamic learning rates. We also shed light on the similarities between OMD and DA and show simple conditions under which stabilized-OMD and DA generate the same iterates.

翻译：在线镜底(OMD)和双平均值(DA) -- -- 两种用于在线曲线优化的基本算法 -- -- 已知在以固定学习率使用时,其性能保障非常相似(有时是相同的),但是,在动态学习率下,OMD明显比DA低,甚至根据专家意见预测等常见环境,也遭受线性遗憾。我们通过我们称之为稳定化的简单技术修改OMD算法。我们给OMD带来基本上相同的抽象的遗憾,给OMD带来稳定,给DA带来同样的遗憾,我们以谨慎和模块化的方式修改传统的OMD趋同分析,允许提供直截了当和灵活的证明。这些界限的简单缩略图显示,即使在动态学习率下,具有稳定性的OMDDA在许多应用中也享有同样的性能保障。我们还阐明了OMD和DA之间的相似之处,并展示了稳定式OMD和DA产生相同的迭代的简单条件。

0

相关内容

SimPLe

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

MIT-深度学习Deep Learning State of the Art in 2020，87页ppt

MIT-深度学习Deep Learning State of the Art in 2020，87页ppt

专知会员服务

62+阅读 · 2020年2月17日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

误差反向传播——RNN

误差反向传播——RNN

统计学习与视觉计算组

18+阅读 · 2018年9月6日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

A new Lagrange multiplier approach for constructing structure-preserving schemes, II. bound preserving

Arxiv

0+阅读 · 2021年10月27日

Implicit Regularization in Matrix Sensing via Mirror Descent

Arxiv

0+阅读 · 2021年10月27日

Fast rates for prediction with limited expert advice

Arxiv

0+阅读 · 2021年10月27日

A Continuized View on Nesterov Acceleration for Stochastic Gradient Descent and Randomized Gossip

Arxiv

0+阅读 · 2021年10月27日

Conflict-Averse Gradient Descent for Multi-task Learning

Arxiv

0+阅读 · 2021年10月26日

Generalization Bounds for Meta-Learning via PAC-Bayes and Uniform Stability

Arxiv

0+阅读 · 2021年10月26日

Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks

Arxiv

0+阅读 · 2021年10月26日

Task-Driven Out-of-Distribution Detection with Statistical Guarantees for Robot Learning

Arxiv

0+阅读 · 2021年10月26日

Federated Learning with Fair Averaging

Arxiv

7+阅读 · 2021年4月30日

The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study

The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study

Arxiv

4+阅读 · 2019年5月9日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

MIT-深度学习Deep Learning State of the Art in 2020，87页ppt

MIT-深度学习Deep Learning State of the Art in 2020，87页ppt

专知会员服务

62+阅读 · 2020年2月17日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

误差反向传播——RNN

误差反向传播——RNN

统计学习与视觉计算组

18+阅读 · 2018年9月6日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

A new Lagrange multiplier approach for constructing structure-preserving schemes, II. bound preserving

Arxiv

0+阅读 · 2021年10月27日

Implicit Regularization in Matrix Sensing via Mirror Descent

Arxiv

0+阅读 · 2021年10月27日

Fast rates for prediction with limited expert advice

Arxiv

0+阅读 · 2021年10月27日

A Continuized View on Nesterov Acceleration for Stochastic Gradient Descent and Randomized Gossip

Arxiv

0+阅读 · 2021年10月27日

Conflict-Averse Gradient Descent for Multi-task Learning

Arxiv

0+阅读 · 2021年10月26日

Generalization Bounds for Meta-Learning via PAC-Bayes and Uniform Stability

Arxiv

0+阅读 · 2021年10月26日

Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks

Arxiv

0+阅读 · 2021年10月26日

Task-Driven Out-of-Distribution Detection with Statistical Guarantees for Robot Learning

Arxiv

0+阅读 · 2021年10月26日

Federated Learning with Fair Averaging

Arxiv

7+阅读 · 2021年4月30日

The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study

The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study

Arxiv

4+阅读 · 2019年5月9日

微信扫码咨询专知VIP会员