通过Frank-Wolfe优化化提高在线记忆学习效率: 使用有缺陷的动态遗憾和用于控制的应用 (Efficient Online Learning with Memory via Frank-Wolfe Optimization: Algorithms with Bounded Dynamic Regret and Applications to Control) - 专知论文

会员服务 ·

0

控制器 · 在线 · Learning · 线性的 · 最优化 ·

2023 年 1 月 2 日

Efficient Online Learning with Memory via Frank-Wolfe Optimization: Algorithms with Bounded Dynamic Regret and Applications to Control

翻译：通过Frank-Wolfe优化化提高在线记忆学习效率: 使用有缺陷的动态遗憾和用于控制的应用

Hongyu Zhou,Zirui Xu,Vasileios Tzoumas

Projection operations are a typical computation bottleneck in online learning. In this paper, we enable projection-free online learning within the framework of Online Convex Optimization with Memory (OCO-M) -- OCO-M captures how the history of decisions affects the current outcome by allowing the online learning loss functions to depend on both current and past decisions. Particularly, we introduce the first projection-free meta-base learning algorithm with memory that minimizes dynamic regret, i.e., that minimizes the suboptimality against any sequence of time-varying decisions. We are motivated by artificial intelligence applications where autonomous agents need to adapt to time-varying environments in real-time, accounting for how past decisions affect the present. Examples of such applications are: online control of dynamical systems; statistical arbitrage; and time series prediction. The algorithm builds on the Online Frank-Wolfe (OFW) and Hedge algorithms. We demonstrate how our algorithm can be applied to the online control of linear time-varying systems in the presence of unpredictable process noise. To this end, we develop the first controller with memory and bounded dynamic regret against any optimal time-varying linear feedback control policy. We validate our algorithm in simulated scenarios of online control of linear time-invariant systems.

翻译：投影操作是在线学习的一个典型的计算瓶颈。在本文中, 我们允许在在线存储优化存储( OCO- M) 框架内进行不投射的在线学习 -- -- OCO- M 通过允许在线学习损失功能取决于当前和过去的决定, 从而让在线学习损失功能取决于当前和过去的决定, 从而了解决定的历史如何影响当前的结果。特别是, 我们引入了第一个不投影的元基学习算法, 将动态后悔降到最低程度, 也就是说, 在任何时间变化的决定序列中, 将不优化的程度降到最低程度。我们受到人工智能应用的激励, 自主代理需要实时适应时间变化的环境, 并解释过去的决定如何影响当前。应用的例子有: 动态系统的在线控制; 统计套用; 时间序列预测。算法建立在在线 Frank- Wolfe (OFW) 和 Hedge 算法上。我们展示了我们的算法如何在不可预测的过程噪音面前对线性时间变化系统进行在线控制。为此, 我们开发了第一个具有记忆和约束式的线性弹性政策列表。

0

相关内容

控制器

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【KDD2019|讲座推荐】零阶优化及其在数据挖掘和机器学习中对抗鲁棒性的应用研究进展：Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning

【KDD2019|讲座推荐】零阶优化及其在数据挖掘和机器学习中对抗鲁棒性的应用研究进展：Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning

专知会员服务

16+阅读 · 2019年12月6日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

SIRT1介导的Resveratrol对糖尿病视网膜病变“代谢记忆”的作用及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

互联网金融三维信任机制及参与者信任感知与交易决策

国家自然科学基金

0+阅读 · 2014年12月31日

MDM2介导的有丝分裂灾难- - -糖尿病肾病足细胞损伤的新机制

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

ADAM33基因多态性与新疆维吾尔族、哈萨克族、汉族慢性阻塞性肺疾病易感性及其交互作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

社会化商务中的消费者行为和定价策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

颗粒破碎及其对颗粒材料破坏行为影响的宏细观模拟

国家自然科学基金

0+阅读 · 2008年12月31日

Meta-Learning in Games

Arxiv

0+阅读 · 2023年3月1日

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game

Arxiv

0+阅读 · 2023年3月1日

Contextual bandits with concave rewards, and an application to fair ranking

Arxiv

0+阅读 · 2023年2月28日

An active-set method for sparse approximations. Part I: Separable $\ell_1$ terms

Arxiv

0+阅读 · 2023年2月28日

On the Privacy Effect of Data Enhancement via the Lens of Memorization

Arxiv

0+阅读 · 2023年2月28日

Randomized low-rank approximation of parameter-dependent matrices

Arxiv

0+阅读 · 2023年2月24日

Overcoming Prior Misspecification in Online Learning to Rank

Arxiv

0+阅读 · 2023年2月24日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Survey on Evolutionary Deep Learning: Principles, Algorithms, Applications and Open Issues

Arxiv

20+阅读 · 2022年8月23日

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Arxiv

16+阅读 · 2020年3月12日

VIP会员

文章信息

相关主题

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【KDD2019|讲座推荐】零阶优化及其在数据挖掘和机器学习中对抗鲁棒性的应用研究进展：Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning

【KDD2019|讲座推荐】零阶优化及其在数据挖掘和机器学习中对抗鲁棒性的应用研究进展：Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning

专知会员服务

16+阅读 · 2019年12月6日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

唯快不破：大型语言模型高效架构综述

光纤无人机：反无人机系统的重大挑战

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Meta-Learning in Games

Arxiv

0+阅读 · 2023年3月1日

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game

Arxiv

0+阅读 · 2023年3月1日

Contextual bandits with concave rewards, and an application to fair ranking

Arxiv

0+阅读 · 2023年2月28日

An active-set method for sparse approximations. Part I: Separable $\ell_1$ terms

Arxiv

0+阅读 · 2023年2月28日

On the Privacy Effect of Data Enhancement via the Lens of Memorization

Arxiv

0+阅读 · 2023年2月28日

Randomized low-rank approximation of parameter-dependent matrices

Arxiv

0+阅读 · 2023年2月24日

Overcoming Prior Misspecification in Online Learning to Rank

Arxiv

0+阅读 · 2023年2月24日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Survey on Evolutionary Deep Learning: Principles, Algorithms, Applications and Open Issues

Arxiv

20+阅读 · 2022年8月23日

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Arxiv

16+阅读 · 2020年3月12日

相关基金

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

SIRT1介导的Resveratrol对糖尿病视网膜病变“代谢记忆”的作用及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

互联网金融三维信任机制及参与者信任感知与交易决策

国家自然科学基金

0+阅读 · 2014年12月31日

MDM2介导的有丝分裂灾难- - -糖尿病肾病足细胞损伤的新机制

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

ADAM33基因多态性与新疆维吾尔族、哈萨克族、汉族慢性阻塞性肺疾病易感性及其交互作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

社会化商务中的消费者行为和定价策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

颗粒破碎及其对颗粒材料破坏行为影响的宏细观模拟

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员