使用变换器进行有区别的空间规划 (Differentiable Spatial Planning using Transformers) - 专知论文

会员服务 ·

0

真实值 · 变换 · INFORMS · 泛化理论 · state-of-the-art ·

2021 年 12 月 2 日

Differentiable Spatial Planning using Transformers

翻译：使用变换器进行有区别的空间规划

Devendra Singh Chaplot,Deepak Pathak,Jitendra Malik

from arxiv, Published at ICML 2021. See project webpage at https://devendrachaplot.github.io/projects/spatial-planning-transformers

We consider the problem of spatial path planning. In contrast to the classical solutions which optimize a new plan from scratch and assume access to the full map with ground truth obstacle locations, we learn a planner from the data in a differentiable manner that allows us to leverage statistical regularities from past data. We propose Spatial Planning Transformers (SPT), which given an obstacle map learns to generate actions by planning over long-range spatial dependencies, unlike prior data-driven planners that propagate information locally via convolutional structure in an iterative manner. In the setting where the ground truth map is not known to the agent, we leverage pre-trained SPTs in an end-to-end framework that has the structure of mapper and planner built into it which allows seamless generalization to out-of-distribution maps and goals. SPTs outperform prior state-of-the-art differentiable planners across all the setups for both manipulation and navigation tasks, leading to an absolute improvement of 7-19%.

翻译：我们考虑空间路径规划问题。与从零开始优化新计划并以地面真相障碍位置获取完整地图的传统解决方案相比,我们以不同的方式从数据中学习了一位规划者,从而使我们能够利用过去数据的统计规律性。我们提议了空间规划变异器(SPT),该变异器提供了障碍图,通过规划远程空间依赖而学会通过规划产生行动,不同于以往的数据驱动规划者,前者以迭接方式通过动态结构在当地传播信息。在地面真相图不为代理人所知的环境下,我们利用经过预先训练的小组委员会在终端到终端的框架中发挥作用,这一框架将地图和规划师的结构建在其中,以便无缝地概括到分布地图和目标之外。防范小组委员会在操纵和导航任务方面超越了以往所有设置中最先进的不同规划者,导致7-19 %的绝对改善。

0

相关内容

真实值

【ICML2021】压缩最大似然

专知会员服务

22+阅读 · 2021年9月23日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

324+阅读 · 2020年11月26日

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

专知会员服务

45+阅读 · 2020年8月18日

【CMU-Google-斯坦福】可控行为的弱监督强化学习，Weakly-Supervised RL

【CMU-Google-斯坦福】可控行为的弱监督强化学习，Weakly-Supervised RL

专知会员服务

22+阅读 · 2020年4月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

已删除

将门创投

4+阅读 · 2017年11月1日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Self-conditioning pre-trained language models

Arxiv

0+阅读 · 2022年2月4日

TGFuse: An Infrared and Visible Image Fusion Approach Based on Transformer and Generative Adversarial Network

Arxiv

0+阅读 · 2022年2月4日

Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning

Arxiv

0+阅读 · 2022年2月3日

Technical Report: A Hierarchical Deliberative-Reactive System Architecture for Task and Motion Planning in Partially Known Environments

Arxiv

0+阅读 · 2022年2月3日

Generalized Approach to Matched Filtering using Neural Networks

Arxiv

0+阅读 · 2022年2月2日

Improving Sample Efficiency of Value Based Models Using Attention and Vision Transformers

Arxiv

0+阅读 · 2022年2月1日

OadTR: Online Action Detection with Transformers

Arxiv

7+阅读 · 2021年6月21日

Self-Improved Retrosynthetic Planning

Arxiv

3+阅读 · 2021年6月9日

Path Planning using Neural A* Search

Arxiv

5+阅读 · 2021年2月8日

InverseRenderNet: Learning single image inverse rendering

InverseRenderNet: Learning single image inverse rendering

Arxiv

3+阅读 · 2018年11月29日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【ICML2021】压缩最大似然

专知会员服务

22+阅读 · 2021年9月23日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

324+阅读 · 2020年11月26日

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

专知会员服务

45+阅读 · 2020年8月18日

【CMU-Google-斯坦福】可控行为的弱监督强化学习，Weakly-Supervised RL

【CMU-Google-斯坦福】可控行为的弱监督强化学习，Weakly-Supervised RL

专知会员服务

22+阅读 · 2020年4月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】迈向鲁棒的零样本强化学习

一种基于视觉算法生成三维场景重建的多任务系统 | 2025最新200页

【普林斯顿博士论文】量化、评估与缓解现代机器学习系统中的风险

遥感中基于深度学习的领域自适应方法：全面综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

已删除

将门创投

4+阅读 · 2017年11月1日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Self-conditioning pre-trained language models

Arxiv

0+阅读 · 2022年2月4日

TGFuse: An Infrared and Visible Image Fusion Approach Based on Transformer and Generative Adversarial Network

Arxiv

0+阅读 · 2022年2月4日

Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning

Arxiv

0+阅读 · 2022年2月3日

Technical Report: A Hierarchical Deliberative-Reactive System Architecture for Task and Motion Planning in Partially Known Environments

Arxiv

0+阅读 · 2022年2月3日

Generalized Approach to Matched Filtering using Neural Networks

Arxiv

0+阅读 · 2022年2月2日

Improving Sample Efficiency of Value Based Models Using Attention and Vision Transformers

Arxiv

0+阅读 · 2022年2月1日

OadTR: Online Action Detection with Transformers

Arxiv

7+阅读 · 2021年6月21日

Self-Improved Retrosynthetic Planning

Arxiv

3+阅读 · 2021年6月9日

Path Planning using Neural A* Search

Arxiv

5+阅读 · 2021年2月8日

InverseRenderNet: Learning single image inverse rendering

InverseRenderNet: Learning single image inverse rendering

Arxiv

3+阅读 · 2018年11月29日

微信扫码咨询专知VIP会员