规划与学术动态:通过利普施茨常量对安全和可及性进行概率保障 (Planning with Learned Dynamics: Probabilistic Guarantees on Safety and Reachability via Lipschitz Constants) - 专知论文

会员服务 ·

0

Lipschitz常数 · Lipschitz · 估计/估计量 · 学成 · 近似 ·

2021 年 2 月 28 日

Planning with Learned Dynamics: Probabilistic Guarantees on Safety and Reachability via Lipschitz Constants

翻译：规划与学术动态:通过利普施茨常量对安全和可及性进行概率保障

Craig Knuth,Glen Chou,Necmiye Ozay,Dmitry Berenson

from arxiv, Accepted at RA-L and ICRA 2021. Craig Knuth and Glen Chou contributed equally to this work

We present a method for feedback motion planning of systems with unknown dynamics which provides probabilistic guarantees on safety, reachability, and goal stability. To find a domain in which a learned control-affine approximation of the true dynamics can be trusted, we estimate the Lipschitz constant of the difference between the true and learned dynamics, and ensure the estimate is valid with a given probability. Provided the system has at least as many controls as states, we also derive existence conditions for a one-step feedback law which can keep the real system within a small bound of a nominal trajectory planned with the learned dynamics. Our method imposes the feedback law existence as a constraint in a sampling-based planner, which returns a feedback policy around a nominal plan ensuring that, if the Lipschitz constant estimate is valid, the true system is safe during plan execution, reaches the goal, and is ultimately invariant in a small set about the goal. We demonstrate our approach by planning using learned models of a 6D quadrotor and a 7DOF Kuka arm. We show that a baseline which plans using the same learned dynamics without considering the error bound or the existence of the feedback law can fail to stabilize around the plan and become unsafe.

翻译：我们提出了一个对动态不明的系统进行反馈运动规划的方法,该方法为安全、可达性和目标稳定性提供概率保障。为了找到一个能够信任真实动态的精明控制-快感近似的域,我们估计利普施茨对真实动态与所学动态之间的差异的常数,并确保估计数具有一定的概率。如果系统拥有至少与州一样多的控制,我们也为一步骤的反馈法提供了存在条件,使真正的系统保持在与所学动态相规划的微小微轨迹范围内。我们的方法将反馈法作为基于抽样的规划师的一种制约,该方法将反馈法作为围绕名义计划的一种反馈政策,确保如果利普施茨的常数估计有效,真正的系统在计划执行期间是安全的,达到目标,最终在一小组目标上是没有变化的。我们用6D quadortoror 和 7DOF Kuka 手臂的学习模型来规划我们的方法证明了我们的方法。我们表明,在不考虑错误或反馈法的存在的情况下,使用同一学习的动态计划的基准可能无法稳定在计划周围的不安全。

0

相关内容

Lipschitz常数

Lipschitz常数

【AAAI2021】Lipschitz终身强化学习

专知会员服务

31+阅读 · 2020年12月14日

【经典书】自主机器人导论:运动学，感知，定位和规划，241页pdf

【经典书】自主机器人导论:运动学，感知，定位和规划，241页pdf

专知会员服务

45+阅读 · 2020年11月18日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

【哥伦比亚大学】经济AI优化课程，Economics, AI, and Optimization

【哥伦比亚大学】经济AI优化课程，Economics, AI, and Optimization

专知会员服务

53+阅读 · 2020年2月15日

【AAAI2020】拓扑贝叶斯优化与持久性图：Topological Bayesian Optimization with Persistence Diagrams

【AAAI2020】拓扑贝叶斯优化与持久性图：Topological Bayesian Optimization with Persistence Diagrams

专知会员服务

11+阅读 · 2020年1月17日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

专知会员服务

7+阅读 · 2019年11月5日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大神一年100篇论文

大神一年100篇论文

CreateAMind

15+阅读 · 2018年12月31日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

机器学习研究会

36+阅读 · 2017年12月10日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

The Orbit Problem for Parametric Linear Dynamical Systems

Arxiv

0+阅读 · 2021年4月21日

Asymmetric compressive learning guarantees with applications to quantized sketches

Arxiv

0+阅读 · 2021年4月20日

Reinforcement learning for linear-convex models with jumps via stability analysis of feedback controls

Arxiv

0+阅读 · 2021年4月19日

Non-parametric Quantile Regression via the K-NN Fused Lasso

Arxiv

0+阅读 · 2021年4月19日

The MIT Humanoid Robot: Design, Motion Planning, and Control For Acrobatic Behaviors

Arxiv

0+阅读 · 2021年4月19日

Model Error Propagation via Learned Contraction Metrics for Safe Feedback Motion Planning of Unknown Systems

Arxiv

0+阅读 · 2021年4月18日

Adaptive Robust Model Predictive Control with Matched and Unmatched Uncertainty

Arxiv

0+阅读 · 2021年4月16日

A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants

Arxiv

0+阅读 · 2021年4月16日

Scaling Beyond Bandwidth Limitations: Wireless Control With Stability Guarantees Under Overload

Arxiv

0+阅读 · 2021年4月16日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

VIP会员

文章信息

相关主题

Lipschitz常数

估计/估计量

相关VIP内容

【AAAI2021】Lipschitz终身强化学习

专知会员服务

31+阅读 · 2020年12月14日

【经典书】自主机器人导论:运动学，感知，定位和规划，241页pdf

【经典书】自主机器人导论:运动学，感知，定位和规划，241页pdf

专知会员服务

45+阅读 · 2020年11月18日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

【哥伦比亚大学】经济AI优化课程，Economics, AI, and Optimization

【哥伦比亚大学】经济AI优化课程，Economics, AI, and Optimization

专知会员服务

53+阅读 · 2020年2月15日

【AAAI2020】拓扑贝叶斯优化与持久性图：Topological Bayesian Optimization with Persistence Diagrams

【AAAI2020】拓扑贝叶斯优化与持久性图：Topological Bayesian Optimization with Persistence Diagrams

专知会员服务

11+阅读 · 2020年1月17日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

专知会员服务

7+阅读 · 2019年11月5日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

网络安全技术生成式人工智能服务安全基本要求

【博士论文】面向下游任务的语言模型优化：一种后训练视角

【新书】AI红队演练：智能系统的攻击与防御

基于 Transformer 的脑电解码综述询问 ChatGPT

相关资讯

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大神一年100篇论文

大神一年100篇论文

CreateAMind

15+阅读 · 2018年12月31日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

机器学习研究会

36+阅读 · 2017年12月10日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

The Orbit Problem for Parametric Linear Dynamical Systems

Arxiv

0+阅读 · 2021年4月21日

Asymmetric compressive learning guarantees with applications to quantized sketches

Arxiv

0+阅读 · 2021年4月20日

Reinforcement learning for linear-convex models with jumps via stability analysis of feedback controls

Arxiv

0+阅读 · 2021年4月19日

Non-parametric Quantile Regression via the K-NN Fused Lasso

Arxiv

0+阅读 · 2021年4月19日

The MIT Humanoid Robot: Design, Motion Planning, and Control For Acrobatic Behaviors

Arxiv

0+阅读 · 2021年4月19日

Model Error Propagation via Learned Contraction Metrics for Safe Feedback Motion Planning of Unknown Systems

Arxiv

0+阅读 · 2021年4月18日

Adaptive Robust Model Predictive Control with Matched and Unmatched Uncertainty

Arxiv

0+阅读 · 2021年4月16日

A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants

Arxiv

0+阅读 · 2021年4月16日

Scaling Beyond Bandwidth Limitations: Wireless Control With Stability Guarantees Under Overload

Arxiv

0+阅读 · 2021年4月16日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

微信扫码咨询专知VIP会员