规划与学术动态:通过Lipschitz常量保证安全和可达性 (Planning with Learned Dynamics: Guaranteed Safety and Reachability via Lipschitz Constants) - 专知论文

会员服务 ·

0

Lipschitz常数 · Lipschitz · 学成 · 估计/估计量 · MoDELS ·

2021 年 2 月 17 日

Planning with Learned Dynamics: Guaranteed Safety and Reachability via Lipschitz Constants

翻译：规划与学术动态:通过Lipschitz常量保证安全和可达性

Craig Knuth,Glen Chou,Necmiye Ozay,Dmitry Berenson

from arxiv, Accepted at RA-L and submitted to ICRA 2021. Craig Knuth and Glen Chou contributed equally to this work

We present an approach for feedback motion planning of systems with unknown dynamics which provides guarantees on safety, reachability, and stability about the goal. Given a learned control-affine approximation of the true dynamics, we estimate the Lipschitz constant of the difference between the true and learned dynamics to determine a trusted domain for our learned model. Provided the system has at least as many controls as states, we further derive the conditions under which a one-step feedback law exists. This allows fora small bound on the tracking error when the trajectory is executed on the real system. Our method imposes a check for the existence of the feedback law as constraints in a sampling-based planner, which returns a feedback policy ensuring that under the true dynamics, the goal is reachable, the path is safe in execution, and the closed-loop system is invariant in a small set about the goal. We demonstrate our approach by planning using learned models of a 6D quadrotor and a 7DOF Kuka arm.We show that a baseline which plans using the same learned dynamics without considering the error bound or the existence of the feedback law can fail to stabilize around the plan and become unsafe.

翻译：我们提出一种方法,用于对具有未知动态的系统进行反馈运动规划,这些系统对安全、可达性和目标稳定性提供保障。根据对真实动态的精明控制-节奏近似,我们估计利普西茨对真实动态和学习动态之间的差异的常数,以确定我们所学模型的可信任域。如果系统至少拥有与州相同的控制,我们进一步得出存在一步骤反馈法的条件。这样,当轨迹在实际系统上执行时,就可以对追踪错误有小限制。我们的方法是将反馈法的存在作为基于抽样的规划师的制约因素进行检查,这种方法将反馈政策带来反馈政策,确保在真实动态下,目标是可以达到的,路径是安全的,而闭环系统对目标来说是一小套不变的。我们通过使用6D quadrotoror和7DOF Kuka arm的已知模型来规划我们的方法展示了我们的方法。我们显示,在不考虑错误或反馈法的存在的情况下,使用同一已知动态规划的基线可能无法在计划周围稳定并变得不安全。

0

相关内容

Lipschitz常数

Lipschitz常数

不可错过！「强化学习导论」多伦多大学2021课程，附SLIDES与140页pdf

不可错过！「强化学习导论」多伦多大学2021课程，附SLIDES与140页pdf

专知会员服务

67+阅读 · 2021年3月24日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】自主机器人导论:运动学，感知，定位和规划，241页pdf

【经典书】自主机器人导论:运动学，感知，定位和规划，241页pdf

专知会员服务

45+阅读 · 2020年11月18日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

101+阅读 · 2020年5月22日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【泡泡图灵智库】PL-VIO：使用点和线特征的紧耦合单目视觉惯性里程计

【泡泡图灵智库】PL-VIO：使用点和线特征的紧耦合单目视觉惯性里程计

泡泡机器人SLAM

53+阅读 · 2019年7月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Revisiting the Approximate Carathéodory Problem via the Frank-Wolfe Algorithm

Arxiv

0+阅读 · 2021年4月12日

Probabilistic Radio-Visual Active Sensing for Search and Tracking

Arxiv

0+阅读 · 2021年4月11日

Efficient Path Planning in Narrow Passages via Closed-Form Minkowski Operations

Arxiv

0+阅读 · 2021年4月10日

The Menu-Size Complexity of Revenue Approximation

Arxiv

0+阅读 · 2021年4月9日

Characteristic Logics for Behavioural Hemimetrics via Fuzzy Lax Extensions

Arxiv

0+阅读 · 2021年4月9日

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

Arxiv

1+阅读 · 2021年4月8日

Exploration-RRT: A multi-objective Path Planning and Exploration Framework for Unknown and Unstructured Environments

Arxiv

0+阅读 · 2021年4月8日

Lipschitz Lifelong Reinforcement Learning

Arxiv

4+阅读 · 2020年1月17日

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

Arxiv

8+阅读 · 2018年7月10日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

VIP会员

文章信息

相关主题

Lipschitz常数

估计/估计量

相关VIP内容

不可错过！「强化学习导论」多伦多大学2021课程，附SLIDES与140页pdf

不可错过！「强化学习导论」多伦多大学2021课程，附SLIDES与140页pdf

专知会员服务

67+阅读 · 2021年3月24日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】自主机器人导论:运动学，感知，定位和规划，241页pdf

【经典书】自主机器人导论:运动学，感知，定位和规划，241页pdf

专知会员服务

45+阅读 · 2020年11月18日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

101+阅读 · 2020年5月22日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

【泡泡图灵智库】PL-VIO：使用点和线特征的紧耦合单目视觉惯性里程计

【泡泡图灵智库】PL-VIO：使用点和线特征的紧耦合单目视觉惯性里程计

泡泡机器人SLAM

53+阅读 · 2019年7月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Revisiting the Approximate Carathéodory Problem via the Frank-Wolfe Algorithm

Arxiv

0+阅读 · 2021年4月12日

Probabilistic Radio-Visual Active Sensing for Search and Tracking

Arxiv

0+阅读 · 2021年4月11日

Efficient Path Planning in Narrow Passages via Closed-Form Minkowski Operations

Arxiv

0+阅读 · 2021年4月10日

The Menu-Size Complexity of Revenue Approximation

Arxiv

0+阅读 · 2021年4月9日

Characteristic Logics for Behavioural Hemimetrics via Fuzzy Lax Extensions

Arxiv

0+阅读 · 2021年4月9日

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

Arxiv

1+阅读 · 2021年4月8日

Exploration-RRT: A multi-objective Path Planning and Exploration Framework for Unknown and Unstructured Environments

Arxiv

0+阅读 · 2021年4月8日

Lipschitz Lifelong Reinforcement Learning

Arxiv

4+阅读 · 2020年1月17日

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

Arxiv

8+阅读 · 2018年7月10日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

微信扫码咨询专知VIP会员