最佳快速方法的异差衍生法 (A Discrete Variational Derivation of Accelerated Methods in Optimization) - 专知论文

会员服务 ·

0

变分导数 · 优化器 · 离散化 · Integration · Nesterov加速梯度 ·

2021 年 11 月 5 日

A Discrete Variational Derivation of Accelerated Methods in Optimization

翻译：最佳快速方法的异差衍生法

Cédric M. Campos,Alejandro Mahillo,David Martín de Diego

from arxiv, 28 pages, 11 figures

Many of the new developments in machine learning are connected with gradient-based optimization methods. Recently, these methods have been studied using a variational perspective. This has opened up the possibility of introducing variational and symplectic methods using geometric integration. In particular, in this paper, we introduce variational integrators which allow us to derive different methods for optimization. Using both, Hamilton's and Lagrange-d'Alembert's principle, we derive two families of respective optimization methods in one-to-one correspondence that generalize Polyak's heavy ball and the well known Nesterov accelerated gradient method, the second of which mimics the behavior of the first reducing the oscillations of classical momentum methods. However, since the systems considered are explicitly time-dependent, the preservation of symplecticity of autonomous systems occurs here solely on the fibers. Several experiments exemplify the result.

翻译：机器学习的许多新发展都与基于梯度的优化方法有关。最近,这些方法已经用变异角度进行了研究。这打开了采用几何集成采用变异和间隙方法的可能性。特别是, 在本文中, 我们引入了变异集成器, 使我们能够得出不同的优化方法。使用汉密尔顿和拉格兰格- 德阿伦伯特原则, 我们从一对一的通信中得出了两个关于各自优化方法的组合, 这些通信将波拉克的重球和众所周知的内斯特罗夫加速梯度方法普遍化, 其中第二个方法模仿了第一次减少古典动力方法振动的动作。然而, 由于所考虑的系统明确取决于时间, 维护自主系统的随机性只发生在这里的纤维上。几个实验将结果举例化。

0

相关内容

变分导数

【ICML2021】REPAINT:深度强化学习中的知识迁移

专知会员服务

23+阅读 · 2021年9月5日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知会员服务

87+阅读 · 2020年8月28日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

245+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Sketching Methods for Dynamic Mode Decomposition in Spherical Shallow Water Equations

Arxiv

0+阅读 · 2022年1月11日

TiltedBERT: Resource Adjustable Version of BERT

Arxiv

0+阅读 · 2022年1月10日

On the convergence of Broyden's method and some accelerated schemes for singular problems

Arxiv

0+阅读 · 2022年1月8日

A Divergence-Conforming Hybridized Discontinuous Galerkin Method for the Incompressible Magnetohydrodynamics Equations

Arxiv

0+阅读 · 2022年1月6日

Observability-Aware Trajectory Optimization: Theory, Viability, and State of the Art

Arxiv

0+阅读 · 2022年1月6日

AdaGDA: Faster Adaptive Gradient Descent Ascent Methods for Minimax Optimization

Arxiv

0+阅读 · 2022年1月5日

Improved bounds for the many-user MAC

Arxiv

0+阅读 · 2022年1月3日

Accelerated Reinforcement Learning

Arxiv

6+阅读 · 2018年4月24日

Parameter Space Noise for Exploration

Arxiv

3+阅读 · 2018年1月31日

Optimal Algorithms for Distributed Optimization

Arxiv

3+阅读 · 2017年12月1日

VIP会员

文章信息

相关主题

Nesterov加速梯度

相关VIP内容

【ICML2021】REPAINT:深度强化学习中的知识迁移

专知会员服务

23+阅读 · 2021年9月5日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知会员服务

87+阅读 · 2020年8月28日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

245+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《2024年度美国防部作战测试与评估报告》500页

《面相未来作战空中系统中有人-无人编组的AI驱动协作模式选择》含slides

无人机编队飞行：复杂环境中作战的策略、挑战与应用

《探索军事背景下共享大语言模型：AI助手与智能体部署中可扩展性与效率的早期洞察》（含44页slides）

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Sketching Methods for Dynamic Mode Decomposition in Spherical Shallow Water Equations

Arxiv

0+阅读 · 2022年1月11日

TiltedBERT: Resource Adjustable Version of BERT

Arxiv

0+阅读 · 2022年1月10日

On the convergence of Broyden's method and some accelerated schemes for singular problems

Arxiv

0+阅读 · 2022年1月8日

A Divergence-Conforming Hybridized Discontinuous Galerkin Method for the Incompressible Magnetohydrodynamics Equations

Arxiv

0+阅读 · 2022年1月6日

Observability-Aware Trajectory Optimization: Theory, Viability, and State of the Art

Arxiv

0+阅读 · 2022年1月6日

AdaGDA: Faster Adaptive Gradient Descent Ascent Methods for Minimax Optimization

Arxiv

0+阅读 · 2022年1月5日

Improved bounds for the many-user MAC

Arxiv

0+阅读 · 2022年1月3日

Accelerated Reinforcement Learning

Arxiv

6+阅读 · 2018年4月24日

Parameter Space Noise for Exploration

Arxiv

3+阅读 · 2018年1月31日

Optimal Algorithms for Distributed Optimization

Arxiv

3+阅读 · 2017年12月1日

微信扫码咨询专知VIP会员