控制变换器: 通过 PRM 辅助返回条件序列模型在未知环境中的机器人导航 (Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling) - 专知论文

会员服务 ·

0

回合 · 控制器 · MoDELS · Learning · 变换 ·

2022 年 11 月 11 日

Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling

翻译：控制变换器: 通过 PRM 辅助返回条件序列模型在未知环境中的机器人导航

Daniel Lawson,Ahmed H. Qureshi

Learning long-horizon tasks such as navigation has presented difficult challenges for successfully applying reinforcement learning. However, from another perspective, under a known environment model, methods such as sampling-based planning can robustly find collision-free paths in environments without learning. In this work, we propose Control Transformer which models return-conditioned sequences from low-level policies guided by a sampling-based Probabilistic Roadmap (PRM) planner. Once trained, we demonstrate that our framework can solve long-horizon navigation tasks using only local information. We evaluate our approach on partially-observed maze navigation with MuJoCo robots, including Ant, Point, and Humanoid, and show that Control Transformer can successfully navigate large mazes and generalize to new, unknown environments. Additionally, we apply our method to a differential drive robot (Turtlebot3) and show zero-shot sim2real transfer under noisy observations.

翻译：诸如导航等学习长视线任务对成功应用强化学习提出了困难的挑战。然而,从另一个角度看,根据已知的环境模型,抽样规划等方法可以在不学习的情况下在环境中强有力地找到无碰撞路径。在这项工作中,我们提议控制变异器,该变异器在基于取样的概率性路线图(PRM)规划师的指导下,从低层次的政策中模拟有回归条件的序列。经过培训后,我们证明我们的框架仅使用当地信息就能解决长视线导航任务。我们评估了我们与包括Ant、Point和人类类人在内的穆乔科机器人进行部分观测的迷宫导航的方法,并表明控制变异器能够成功导航大型迷宫,并概括到新的、未知的环境。此外,我们将我们的方法应用到一个有差异的驱动器(Turtetlebot3),并显示在噪音观测下零发的Sim2真实传输。

0

相关内容

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

专知会员服务

68+阅读 · 2022年3月29日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

阵列式BiOCl/C60单晶面暴露自组装薄膜的制备与产氢性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于连续时间PWA模型的混杂系统预测控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Trop2对CBSCs移植修复梗死心肌的影响及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

控制有机半导体材料分子按照face-on 方式排列的高性能薄膜晶体管的研究

国家自然科学基金

0+阅读 · 2012年12月31日

可压缩Navier-Stokes方程及相关流体动力学模型的研究

国家自然科学基金

0+阅读 · 2011年12月31日

云制造环境下基于SOOA的动态服务资源集成与协调管理的研究

国家自然科学基金

0+阅读 · 2011年12月31日

南海深海沉积物来源真菌活性次级代谢产物研究

国家自然科学基金

0+阅读 · 2009年12月31日

ROS和电子转移诱导的DNA损伤机理的理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型高性能传动机构变形协调设计方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

穿膜肽Penetratin及其衍生物的解离动力学研究

国家自然科学基金

0+阅读 · 2008年12月31日

Locomotion-Action-Manipulation: Synthesizing Human-Scene Interactions in Complex 3D Environments

Arxiv

0+阅读 · 2023年1月9日

Verifying Learning-Based Robotic Navigation Systems

Arxiv

0+阅读 · 2023年1月9日

A soft robot that adapts to environments through shape change

Arxiv

0+阅读 · 2023年1月9日

Bidirectional Learning for Offline Model-based Biological Sequence Design

Arxiv

0+阅读 · 2023年1月7日

Optimization-Based Reference Generator for Nonlinear Model Predictive Control of Legged Robots

Arxiv

0+阅读 · 2023年1月6日

Centralized Cooperative Exploration Policy for Continuous Control Tasks

Arxiv

0+阅读 · 2023年1月6日

Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

Arxiv

0+阅读 · 2023年1月5日

Zero-shot object goal visual navigation

Arxiv

0+阅读 · 2023年1月5日

Decentralized and Communication-Free Multi-Robot Navigation through Distributed Games

Arxiv

40+阅读 · 2021年9月15日

Sequential Scenario-Specific Meta Learner for Online Recommendation

Sequential Scenario-Specific Meta Learner for Online Recommendation

Arxiv

16+阅读 · 2019年6月2日

VIP会员

文章信息

相关主题

相关VIP内容

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

专知会员服务

68+阅读 · 2022年3月29日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Locomotion-Action-Manipulation: Synthesizing Human-Scene Interactions in Complex 3D Environments

Arxiv

0+阅读 · 2023年1月9日

Verifying Learning-Based Robotic Navigation Systems

Arxiv

0+阅读 · 2023年1月9日

A soft robot that adapts to environments through shape change

Arxiv

0+阅读 · 2023年1月9日

Bidirectional Learning for Offline Model-based Biological Sequence Design

Arxiv

0+阅读 · 2023年1月7日

Optimization-Based Reference Generator for Nonlinear Model Predictive Control of Legged Robots

Arxiv

0+阅读 · 2023年1月6日

Centralized Cooperative Exploration Policy for Continuous Control Tasks

Arxiv

0+阅读 · 2023年1月6日

Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

Arxiv

0+阅读 · 2023年1月5日

Zero-shot object goal visual navigation

Arxiv

0+阅读 · 2023年1月5日

Decentralized and Communication-Free Multi-Robot Navigation through Distributed Games

Arxiv

40+阅读 · 2021年9月15日

Sequential Scenario-Specific Meta Learner for Online Recommendation

Sequential Scenario-Specific Meta Learner for Online Recommendation

Arxiv

16+阅读 · 2019年6月2日

相关基金

阵列式BiOCl/C60单晶面暴露自组装薄膜的制备与产氢性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于连续时间PWA模型的混杂系统预测控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Trop2对CBSCs移植修复梗死心肌的影响及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

控制有机半导体材料分子按照face-on 方式排列的高性能薄膜晶体管的研究

国家自然科学基金

0+阅读 · 2012年12月31日

可压缩Navier-Stokes方程及相关流体动力学模型的研究

国家自然科学基金

0+阅读 · 2011年12月31日

云制造环境下基于SOOA的动态服务资源集成与协调管理的研究

国家自然科学基金

0+阅读 · 2011年12月31日

南海深海沉积物来源真菌活性次级代谢产物研究

国家自然科学基金

0+阅读 · 2009年12月31日

ROS和电子转移诱导的DNA损伤机理的理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型高性能传动机构变形协调设计方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

穿膜肽Penetratin及其衍生物的解离动力学研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员