持续环境中愿景和语文导航监督 (Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments)

In the Vision-and-Language Navigation (VLN) task an embodied agent navigates a 3D environment, following natural language instructions. A challenge in this task is how to handle 'off the path' scenarios where an agent veers from a reference path. Prior work supervises the agent with actions based on the shortest path from the agent's location to the goal, but such goal-oriented supervision is often not in alignment with the instruction. Furthermore, the evaluation metrics employed by prior work do not measure how much of a language instruction the agent is able to follow. In this work, we propose a simple and effective language-aligned supervision scheme, and a new metric that measures the number of sub-instructions the agent has completed during navigation.

翻译：在视野和语言导航(VLN)任务中,一个体现的代理人按照自然语言指示在3D环境中航行。这一任务中的一项挑战是如何处理“脱离路径”的情景,即代理人的车辆从参照路径离开路径。先前的工作以从代理人所在地到目标的最短路径监督代理人的行动,但这种面向目标的监督往往与指示不一致。此外,以前的工作所采用的评价指标并不衡量代理人能够遵循的语言指示的多少。在这项工作中,我们提出了一个简单而有效的与语文一致的监督计划,以及衡量代理人在航行期间完成的次级指令数量的新指标。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

专知会员服务

14+阅读 · 2020年6月18日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

专知会员服务

31+阅读 · 2020年3月11日