以可达性为基础的轨迹保障:安全和快速增援学习安全层,用于持续控制 (Reachability-based Trajectory Safeguard (RTS): A Safe and Fast Reinforcement Learning Safety Layer for Continuous Control) - 专知论文

会员服务 ·

0

RTS · Continuity · 控制器 · FAST · 层 ·

2021 年 3 月 2 日

Reachability-based Trajectory Safeguard (RTS): A Safe and Fast Reinforcement Learning Safety Layer for Continuous Control

翻译：以可达性为基础的轨迹保障:安全和快速增援学习安全层,用于持续控制

Yifei Simon Shao,Chao Chen,Shreyas Kousik,Ram Vasudevan

Reinforcement Learning (RL) algorithms have achieved remarkable performance in decision making and control tasks due to their ability to reason about long-term, cumulative reward using trial and error. However, during RL training, applying this trial-and-error approach to real-world robots operating in safety critical environment may lead to collisions. To address this challenge, this paper proposes a Reachability-based Trajectory Safeguard (RTS), which leverages reachability analysis to ensure safety during training and operation. Given a known (but uncertain) model of a robot, RTS precomputes a Forward Reachable Set of the robot tracking a continuum of parameterized trajectories. At runtime, the RL agent selects from this continuum in a receding-horizon way to control the robot; the FRS is used to identify if the agent's choice is safe or not, and to adjust unsafe choices. The efficacy of this method is illustrated on three nonlinear robot models, including a 12-D quadrotor drone, in simulation and in comparison with state-of-the-art safe motion planning methods.

翻译：强化学习(RL)算法在决策和控制任务方面取得了显著的成绩,这是因为他们有能力利用试验和错误来解释长期、累积的奖赏。但是,在RL培训期间,对在安全临界环境中操作的现实世界机器人应用这种试和试方法可能会导致碰撞。为了应对这一挑战,本文件建议采用基于可变性的轨迹保护(RTS),利用可变性分析来确保培训和操作期间的安全。考虑到已知的(但不确定的)机器人模型,RTS预先计算了机器人的可前向可达数据集,跟踪参数化轨迹的连续体。在运行时,RL代理商从这一连续体中选择一种递减-正态方法来控制机器人;FRS用来确定该代理人的选择是否安全或不安全,并调整不安全的选择。这一方法的功效在三种非线性机器人模型上作了说明,包括12D孔德无人驾驶飞机,在模拟和与州级安全运动规划方法进行比较。

0

相关内容

RTS

RTS：Real-Time Systems。 Explanation：实时系统。 Publisher：Springer。 SIT:http://dblp.uni-trier.de/db/journals/rts/

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

【深度学习的未来】《The Future of Deep Learning》by Paramita (Guha) Ghosh

【深度学习的未来】《The Future of Deep Learning》by Paramita (Guha) Ghosh

专知会员服务

23+阅读 · 2020年1月16日

【麻省理工学院课程】MIT 6.S191：Introduction to Deep Learning , 深度学习导论,NSF研究员Alexander Amini

【麻省理工学院课程】MIT 6.S191：Introduction to Deep Learning , 深度学习导论,NSF研究员Alexander Amini

专知会员服务

34+阅读 · 2019年11月2日

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

Arxiv

0+阅读 · 2021年4月22日

A Prioritized Trajectory Planning Algorithm for Connected and Automated Vehicle Mandatory Lane Changes

Arxiv

0+阅读 · 2021年4月21日

Robust Biped Locomotion Using Deep Reinforcement Learning on Top of an Analytical Control Approach

Robust Biped Locomotion Using Deep Reinforcement Learning on Top of an Analytical Control Approach

Arxiv

0+阅读 · 2021年4月21日

Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems

Arxiv

0+阅读 · 2021年4月21日

CVLight: Deep Reinforcement Learning for Adaptive Traffic Signal Control with Connected Vehicles

Arxiv

0+阅读 · 2021年4月21日

Self-Driving Cars: A Survey

Self-Driving Cars: A Survey

Arxiv

41+阅读 · 2019年1月14日

End-to-end Active Object Tracking via Reinforcement Learning

Arxiv

3+阅读 · 2018年6月1日

Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction

Arxiv

6+阅读 · 2018年4月23日

Parameter Space Noise for Exploration

Arxiv

3+阅读 · 2018年1月31日

Experience-driven Networking: A Deep Reinforcement Learning based Approach

Arxiv

9+阅读 · 2018年1月17日

VIP会员

文章信息

相关主题

相关VIP内容

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

【深度学习的未来】《The Future of Deep Learning》by Paramita (Guha) Ghosh

【深度学习的未来】《The Future of Deep Learning》by Paramita (Guha) Ghosh

专知会员服务

23+阅读 · 2020年1月16日

【麻省理工学院课程】MIT 6.S191：Introduction to Deep Learning , 深度学习导论,NSF研究员Alexander Amini

【麻省理工学院课程】MIT 6.S191：Introduction to Deep Learning , 深度学习导论,NSF研究员Alexander Amini

专知会员服务

34+阅读 · 2019年11月2日

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《步兵小单元山地严寒作战指南》美军最新条令200页

《联合作战概念的发展》最新报告

俄制无人机弹药

《复杂场景下自主着陆的模型预测控制技术》92页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

Arxiv

0+阅读 · 2021年4月22日

A Prioritized Trajectory Planning Algorithm for Connected and Automated Vehicle Mandatory Lane Changes

Arxiv

0+阅读 · 2021年4月21日

Robust Biped Locomotion Using Deep Reinforcement Learning on Top of an Analytical Control Approach

Robust Biped Locomotion Using Deep Reinforcement Learning on Top of an Analytical Control Approach

Arxiv

0+阅读 · 2021年4月21日

Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems

Arxiv

0+阅读 · 2021年4月21日

CVLight: Deep Reinforcement Learning for Adaptive Traffic Signal Control with Connected Vehicles

Arxiv

0+阅读 · 2021年4月21日

Self-Driving Cars: A Survey

Self-Driving Cars: A Survey

Arxiv

41+阅读 · 2019年1月14日

End-to-end Active Object Tracking via Reinforcement Learning

Arxiv

3+阅读 · 2018年6月1日

Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction

Arxiv

6+阅读 · 2018年4月23日

Parameter Space Noise for Exploration

Arxiv

3+阅读 · 2018年1月31日

Experience-driven Networking: A Deep Reinforcement Learning based Approach

Arxiv

9+阅读 · 2018年1月17日

微信扫码咨询专知VIP会员