与基于梯度优化应用的消散间流整合 (On dissipative symplectic integration with applications to gradient-based optimization) - 专知论文

会员服务 ·

0

Integration · 离散化 · 最优化 · 优化器 · 泛化理论 ·

2021 年 4 月 26 日

On dissipative symplectic integration with applications to gradient-based optimization

翻译：与基于梯度优化应用的消散间流整合

Guilherme França,Michael I. Jordan,René Vidal

from arxiv, matches the published version

Recently, continuous-time dynamical systems have proved useful in providing conceptual and quantitative insights into gradient-based optimization, widely used in modern machine learning and statistics. An important question that arises in this line of work is how to discretize the system in such a way that its stability and rates of convergence are preserved. In this paper we propose a geometric framework in which such discretizations can be realized systematically, enabling the derivation of "rate-matching" algorithms without the need for a discrete convergence analysis. More specifically, we show that a generalization of symplectic integrators to non-conservative and in particular dissipative Hamiltonian systems is able to preserve rates of convergence up to a controlled error. Moreover, such methods preserve a shadow Hamiltonian despite the absence of a conservation law, extending key results of symplectic integrators to non-conservative cases. Our arguments rely on a combination of backward error analysis with fundamental results from symplectic geometry. We stress that although the original motivation for this work was the application to optimization, where dissipative systems play a natural role, they are fully general and not only provide a differential geometric framework for dissipative Hamiltonian systems but also substantially extend the theory of structure-preserving integration.

翻译：最近,事实证明,连续时间动态系统有助于对现代机器学习和统计中广泛使用的梯度优化提供概念和数量方面的洞察力,现代机器学习和统计中广泛使用的梯度优化。在这个工作线上出现的一个重要问题是,如何将系统分解,使其稳定和趋同率得以保持。在本文件中,我们提议了一个几何框架,使这种分化能够系统地实现,从而能够得出“节比”算法,而不必进行离散的趋同分析。更具体地说,我们表明,对非保守性,特别是分解性汉密尔顿系统进行抽调的抽调集,能够将系统同汇率保持在受控制的错误的水平上。此外,尽管没有保护法,但这种方法仍能保持一个影子汉密尔顿人,将这种分解式算法的关键结果扩大到非保守性案例。我们的论点依赖于将后向错误分析与分辨性几何测法的基本结果结合起来。我们强调,尽管这项工作的最初动机是应用优化,即分化系统在其中产生一种自然作用,但是它们也完全地扩展了一种精确的理论结构。

0

相关内容

Integration

Integration：Integration, the VLSI Journal。 Explanation：集成，VLSI杂志。 Publisher：Elsevier。 SIT：http://dblp.uni-trier.de/db/journals/integration/

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

基于深度学习的行人检测方法综述

基于深度学习的行人检测方法综述

专知会员服务

71+阅读 · 2021年4月14日

【干货书】鲁棒优化Robust Optimization，570页pdf

专知会员服务

144+阅读 · 2021年3月17日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

专知会员服务

55+阅读 · 2020年3月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

已删除

将门创投

11+阅读 · 2019年4月26日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

随波逐流：Similarity-Adaptive and Discrete Optimization

随波逐流：Similarity-Adaptive and Discrete Optimization

我爱读PAMI

5+阅读 · 2018年2月6日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

An adaptive high-order surface finite element method for the self-consistent field theory on general curved surfaces

Arxiv

0+阅读 · 2021年6月14日

Decentralized Distributed Optimization for Saddle Point Problems

Arxiv

0+阅读 · 2021年6月14日

Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps

Arxiv

0+阅读 · 2021年6月12日

Conservative Galerkin methods for dispersive Hamiltonian problems

Arxiv

0+阅读 · 2021年6月12日

An error analysis of generative adversarial networks for learning distributions

Arxiv

0+阅读 · 2021年6月12日

Compressed Gradient Tracking Methods for Decentralized Optimization with Linear Convergence

Arxiv

0+阅读 · 2021年6月11日

Conditional and Adversarial Euler-based Generators For Time Series

Arxiv

0+阅读 · 2021年6月11日

Integral Probability Metric based Regularization for Optimal Transport

Arxiv

0+阅读 · 2021年6月6日

A Two-Level Fourth-Order Approach For Time-Fractional Convection-Diffusion-Reaction Equation With Variable Coefficients

Arxiv

0+阅读 · 2021年4月6日

Application of Rényi and Tsallis Entropies to Topic Modeling Optimization

Arxiv

6+阅读 · 2018年2月28日

VIP会员

文章信息

相关主题

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

基于深度学习的行人检测方法综述

基于深度学习的行人检测方法综述

专知会员服务

71+阅读 · 2021年4月14日

【干货书】鲁棒优化Robust Optimization，570页pdf

专知会员服务

144+阅读 · 2021年3月17日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

专知会员服务

55+阅读 · 2020年3月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

已删除

将门创投

11+阅读 · 2019年4月26日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

随波逐流：Similarity-Adaptive and Discrete Optimization

随波逐流：Similarity-Adaptive and Discrete Optimization

我爱读PAMI

5+阅读 · 2018年2月6日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

An adaptive high-order surface finite element method for the self-consistent field theory on general curved surfaces

Arxiv

0+阅读 · 2021年6月14日

Decentralized Distributed Optimization for Saddle Point Problems

Arxiv

0+阅读 · 2021年6月14日

Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps

Arxiv

0+阅读 · 2021年6月12日

Conservative Galerkin methods for dispersive Hamiltonian problems

Arxiv

0+阅读 · 2021年6月12日

An error analysis of generative adversarial networks for learning distributions

Arxiv

0+阅读 · 2021年6月12日

Compressed Gradient Tracking Methods for Decentralized Optimization with Linear Convergence

Arxiv

0+阅读 · 2021年6月11日

Conditional and Adversarial Euler-based Generators For Time Series

Arxiv

0+阅读 · 2021年6月11日

Integral Probability Metric based Regularization for Optimal Transport

Arxiv

0+阅读 · 2021年6月6日

A Two-Level Fourth-Order Approach For Time-Fractional Convection-Diffusion-Reaction Equation With Variable Coefficients

Arxiv

0+阅读 · 2021年4月6日

Application of Rényi and Tsallis Entropies to Topic Modeling Optimization

Arxiv

6+阅读 · 2018年2月28日

微信扫码咨询专知VIP会员