ENTL: 嵌入式导航轨迹学习器 (ENTL: Embodied Navigation Trajectory Learner) - 专知论文

会员服务 ·

0

轨迹学习 · 学习器 · 嵌入式 · 嵌入 · 序列 ·

2023 年 4 月 5 日

ENTL: Embodied Navigation Trajectory Learner

翻译：ENTL: 嵌入式导航轨迹学习器

Klemen Kotar,Aaron Walsman,Roozbeh Mottaghi

We propose Embodied Navigation Trajectory Learner (ENTL), a method for extracting long sequence representations for embodied navigation. Our approach unifies world modeling, localization and imitation learning into a single sequence prediction task. We train our model using vector-quantized predictions of future states conditioned on current states and actions. ENTL's generic architecture enables the sharing of the the spatio-temporal sequence encoder for multiple challenging embodied tasks. We achieve competitive performance on navigation tasks using significantly less data than strong baselines while performing auxiliary tasks such as localization and future frame prediction (a proxy for world modeling). A key property of our approach is that the model is pre-trained without any explicit reward signal, which makes the resulting model generalizable to multiple tasks and environments.

翻译：我们提出了一种名为嵌入式导航轨迹学习器（ENTL）的方法，用于提取体现式导航的长序列表示。我们的方法将世界建模，定位和模仿学习统一为单一的序列预测任务。我们使用基于当前状态和动作条件的未来状态的矢量量化预测来训练模型。ENTL的通用架构使其在多个具有挑战性的体现式任务中可以共享空间时间序列编码器。尽管执行辅助任务，如本地化和未来帧预测（世界建模的代理），但我们使用比强大基线更少的数据来执行导航任务时实现了具有竞争力的性能。我们方法的一个关键属性是在没有任何明确的奖励信号的情况下预训练模型，这使得得到的模型可推广到多个任务和环境中。

1

相关内容

轨迹学习

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

专知会员服务

67+阅读 · 2022年3月29日

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

专知会员服务

36+阅读 · 2022年3月25日

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

专知会员服务

19+阅读 · 2022年3月13日

【CMU博士论文】机器人深度强化学习，128页pdf

【CMU博士论文】机器人深度强化学习，128页pdf

专知会员服务

133+阅读 · 2020年8月27日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

专知会员服务

31+阅读 · 2020年3月11日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

SSL-Lanes: 用于自动驾驶中运动预测的自监督学习

SSL-Lanes: 用于自动驾驶中运动预测的自监督学习

极市平台

6+阅读 · 2022年7月24日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

泡泡机器人SLAM

17+阅读 · 2018年10月12日

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

泡泡机器人SLAM

10+阅读 · 2018年5月26日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

无人驾驶中4D场景实时解析算法研究

国家自然科学基金

12+阅读 · 2017年12月31日

基于压缩感知的高精度实时视觉跟踪方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

重力/重力梯度补偿水下惯性导航系统理论和算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

高精度实时水汽Raman激光雷达自标定方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

大数据环境下基于耦合交通运行状态特征的公交客流OD预测方法

国家自然科学基金

0+阅读 · 2013年12月31日

面向人与Agent混合的多团队协作仿真训练方法研究

国家自然科学基金

19+阅读 · 2012年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

基于基准图解析重构的重力匹配辅助导航方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型顺磁性量子点的原位肝癌双模态靶向探测及术中实时导航精准切除研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向室内复杂环境的RFID定位方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Masked Path Modeling for Vision-and-Language Navigation

Arxiv

0+阅读 · 2023年5月23日

Multi-object Video Generation from Single Frame Layouts

Arxiv

0+阅读 · 2023年5月23日

Expanding the Deployment Envelope of Behavior Prediction via Adaptive Meta-Learning

Arxiv

0+阅读 · 2023年5月23日

Adaptive Gradient Prediction for DNN Training

Arxiv

0+阅读 · 2023年5月22日

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

Arxiv

0+阅读 · 2023年5月22日

GNM: A General Navigation Model to Drive Any Robot

Arxiv

0+阅读 · 2023年5月22日

Easy-to-Hard Learning for Information Extraction

Arxiv

0+阅读 · 2023年5月19日

Remembering What Is Important: A Factorised Multi-Head Retrieval and Auxiliary Memory Stabilisation Scheme for Human Motion Prediction

Arxiv

0+阅读 · 2023年5月19日

Cold-start Sequential Recommendation via Meta Learner

Cold-start Sequential Recommendation via Meta Learner

Arxiv

15+阅读 · 2020年12月10日

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Arxiv

12+阅读 · 2020年2月27日

VIP会员

文章信息

相关主题

相关VIP内容

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

专知会员服务

67+阅读 · 2022年3月29日

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

专知会员服务

36+阅读 · 2022年3月25日

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

专知会员服务

19+阅读 · 2022年3月13日

【CMU博士论文】机器人深度强化学习，128页pdf

【CMU博士论文】机器人深度强化学习，128页pdf

专知会员服务

133+阅读 · 2020年8月27日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

专知会员服务

31+阅读 · 2020年3月11日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

构建军事人工智能信任体系始于破除黑盒机制

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

相关资讯

SSL-Lanes: 用于自动驾驶中运动预测的自监督学习

SSL-Lanes: 用于自动驾驶中运动预测的自监督学习

极市平台

6+阅读 · 2022年7月24日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

泡泡机器人SLAM

17+阅读 · 2018年10月12日

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

泡泡机器人SLAM

10+阅读 · 2018年5月26日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Masked Path Modeling for Vision-and-Language Navigation

Arxiv

0+阅读 · 2023年5月23日

Multi-object Video Generation from Single Frame Layouts

Arxiv

0+阅读 · 2023年5月23日

Expanding the Deployment Envelope of Behavior Prediction via Adaptive Meta-Learning

Arxiv

0+阅读 · 2023年5月23日

Adaptive Gradient Prediction for DNN Training

Arxiv

0+阅读 · 2023年5月22日

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

Arxiv

0+阅读 · 2023年5月22日

GNM: A General Navigation Model to Drive Any Robot

Arxiv

0+阅读 · 2023年5月22日

Easy-to-Hard Learning for Information Extraction

Arxiv

0+阅读 · 2023年5月19日

Remembering What Is Important: A Factorised Multi-Head Retrieval and Auxiliary Memory Stabilisation Scheme for Human Motion Prediction

Arxiv

0+阅读 · 2023年5月19日

Cold-start Sequential Recommendation via Meta Learner

Cold-start Sequential Recommendation via Meta Learner

Arxiv

15+阅读 · 2020年12月10日

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Arxiv

12+阅读 · 2020年2月27日

相关基金

无人驾驶中4D场景实时解析算法研究

国家自然科学基金

12+阅读 · 2017年12月31日

基于压缩感知的高精度实时视觉跟踪方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

重力/重力梯度补偿水下惯性导航系统理论和算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

高精度实时水汽Raman激光雷达自标定方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

大数据环境下基于耦合交通运行状态特征的公交客流OD预测方法

国家自然科学基金

0+阅读 · 2013年12月31日

面向人与Agent混合的多团队协作仿真训练方法研究

国家自然科学基金

19+阅读 · 2012年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

基于基准图解析重构的重力匹配辅助导航方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型顺磁性量子点的原位肝癌双模态靶向探测及术中实时导航精准切除研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向室内复杂环境的RFID定位方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员