通过差异动态方案拟订,通过差异-引导优化控制 (Sparsity-Inducing Optimal Control via Differential Dynamic Programming) - 专知论文

会员服务 ·

0

控制器 · 优化器 · 正则化项 · 特化 · 平滑 ·

2021 年 3 月 22 日

Sparsity-Inducing Optimal Control via Differential Dynamic Programming

翻译：通过差异动态方案拟订,通过差异-引导优化控制

Traiko Dinev,Wolfgang Merkt,Vladimir Ivan,Ioannis Havoutis,Sethu Vijayakumar

from arxiv, 7 pages, 11 figures, accepted at IEEE ICRA 2021. The first two authors contributed equally. Supplementary video: https://www.youtube.com/watch?v=YMXRZjFsqhc Code: https://github.com/ipab-slmc/sparse_ddp

Optimal control is a popular approach to synthesize highly dynamic motion. Commonly, $L_2$ regularization is used on the control inputs in order to minimize energy used and to ensure smoothness of the control inputs. However, for some systems, such as satellites, the control needs to be applied in sparse bursts due to how the propulsion system operates. In this paper, we study approaches to induce sparsity in optimal control solutions -- namely via smooth $L_1$ and Huber regularization penalties. We apply these loss terms to state-of-the-art DDP-based solvers to create a family of sparsity-inducing optimal control methods. We analyze and compare the effect of the different losses on inducing sparsity, their numerical conditioning, their impact on convergence, and discuss hyperparameter settings. We demonstrate our method in simulation and hardware experiments on canonical dynamics systems, control of satellites, and the NASA Valkyrie humanoid robot. We provide an implementation of our method and all examples for reproducibility on GitHub.

翻译：最佳控制是综合高度动态运动的流行方法。通常,在控制投入上使用2美元的正规化,以尽量减少使用的能源,并确保控制投入的顺利性。但是,对于卫星等某些系统,由于推进系统的运作方式,控制需要以零星的连发方式加以应用。在本文中,我们研究如何引导最佳控制解决方案的宽度 -- -- 即平滑的1美元和Huber规范化处罚。我们将这些损失条件适用于以DDP为基础的最先进的解决方案,以创造一个迷幻剂家庭,产生最佳控制方法。我们分析并比较不同损失对诱导宽度、其数字调节、其对趋同的影响的影响,并讨论超参数设置。我们展示了在罐体动力系统、卫星控制以及美国航天局的Valkyrie人类机器人的模拟和硬件实验方法。我们介绍了我们在GitHub的再生应用方法和所有实例。

0

相关内容

控制器

【经典书】计算最优传输，209页pdf，Computational Optimal Transport

【经典书】计算最优传输，209页pdf，Computational Optimal Transport

专知会员服务

75+阅读 · 2021年1月10日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

专知会员服务

10+阅读 · 2020年1月7日

【ICCV 2019 Workshop】Geometric View of Optimal Transportation and Generative Adversarial Networks ，石溪大学，哈佛大学顾险峰教授

【ICCV 2019 Workshop】Geometric View of Optimal Transportation and Generative Adversarial Networks ，石溪大学，哈佛大学顾险峰教授

专知会员服务

26+阅读 · 2019年10月30日

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

专知会员服务

26+阅读 · 2019年10月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

统计学习与视觉计算组

17+阅读 · 2018年3月16日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Stochastic Control through Approximate Bayesian Input Inference

Arxiv

0+阅读 · 2021年5月17日

Short minimal codes and covering codes via strong blocking sets in projective spaces

Arxiv

0+阅读 · 2021年5月17日

Optimal control of robust team stochastic games

Arxiv

0+阅读 · 2021年5月16日

Stochastic Gradient Variance Reduction by Solving a Filtering Problem

Arxiv

0+阅读 · 2021年5月15日

Minimal Cycle Representatives in Persistent Homology using Linear Programming: an Empirical Study with User's Guide

Arxiv

0+阅读 · 2021年5月14日

Counterexample-Guided Synthesis of Perception Models and Control

Arxiv

1+阅读 · 2021年5月14日

On the capacity of deep generative networks for approximating distributions

Arxiv

0+阅读 · 2021年5月13日

Differential Dynamic Programming Neural Optimizer

Arxiv

7+阅读 · 2020年6月29日

Approximability of Discriminators Implies Diversity in GANs

Approximability of Discriminators Implies Diversity in GANs

Arxiv

4+阅读 · 2018年6月27日

A Tour of Reinforcement Learning: The View from Continuous Control

Arxiv

6+阅读 · 2018年6月25日

VIP会员

文章信息

相关主题

相关VIP内容

【经典书】计算最优传输，209页pdf，Computational Optimal Transport

【经典书】计算最优传输，209页pdf，Computational Optimal Transport

专知会员服务

75+阅读 · 2021年1月10日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

专知会员服务

10+阅读 · 2020年1月7日

【ICCV 2019 Workshop】Geometric View of Optimal Transportation and Generative Adversarial Networks ，石溪大学，哈佛大学顾险峰教授

【ICCV 2019 Workshop】Geometric View of Optimal Transportation and Generative Adversarial Networks ，石溪大学，哈佛大学顾险峰教授

专知会员服务

26+阅读 · 2019年10月30日

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

专知会员服务

26+阅读 · 2019年10月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

乌克兰太空研究（2022-2024年） | 176页

新型军用战斗机无人机（MFUAV’s）| 2025最新80页

国防领域人工智能走向何方？

无人机对士兵的心理影响

相关资讯

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

视频超分辨 Detail-revealing Deep Video Super-resolution 论文笔记

统计学习与视觉计算组

17+阅读 · 2018年3月16日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Stochastic Control through Approximate Bayesian Input Inference

Arxiv

0+阅读 · 2021年5月17日

Short minimal codes and covering codes via strong blocking sets in projective spaces

Arxiv

0+阅读 · 2021年5月17日

Optimal control of robust team stochastic games

Arxiv

0+阅读 · 2021年5月16日

Stochastic Gradient Variance Reduction by Solving a Filtering Problem

Arxiv

0+阅读 · 2021年5月15日

Minimal Cycle Representatives in Persistent Homology using Linear Programming: an Empirical Study with User's Guide

Arxiv

0+阅读 · 2021年5月14日

Counterexample-Guided Synthesis of Perception Models and Control

Arxiv

1+阅读 · 2021年5月14日

On the capacity of deep generative networks for approximating distributions

Arxiv

0+阅读 · 2021年5月13日

Differential Dynamic Programming Neural Optimizer

Arxiv

7+阅读 · 2020年6月29日

Approximability of Discriminators Implies Diversity in GANs

Approximability of Discriminators Implies Diversity in GANs

Arxiv

4+阅读 · 2018年6月27日

A Tour of Reinforcement Learning: The View from Continuous Control

Arxiv

6+阅读 · 2018年6月25日

微信扫码咨询专知VIP会员