无监督的视觉里程计和动作整合用于室内环境中的点目标导航 (Unsupervised Visual Odometry and Action Integration for PointGoal Navigation in Indoor Environment) - 专知论文

会员服务 ·

0

点目标 · GPS · 视觉里程 · 视觉里程计 · 无监督 ·

2023 年 4 月 3 日

Unsupervised Visual Odometry and Action Integration for PointGoal Navigation in Indoor Environment

翻译：无监督的视觉里程计和动作整合用于室内环境中的点目标导航

Yijun Cao,Xianshi Zhang,Fuya Luo,Chuan Lin,Yongjie Li

from arxiv, 12 pages, 6 figures

PointGoal navigation in indoor environment is a fundamental task for personal robots to navigate to a specified point. Recent studies solved this PointGoal navigation task with near-perfect success rate in photo-realistically simulated environments, under the assumptions with noiseless actuation and most importantly, perfect localization with GPS and compass sensors. However, accurate GPS signalis difficult to be obtained in real indoor environment. To improve the PointGoal navigation accuracy without GPS signal, we use visual odometry (VO) and propose a novel action integration module (AIM) trained in unsupervised manner. Sepecifically, unsupervised VO computes the relative pose of the agent from the re-projection error of two adjacent frames, and then replaces the accurate GPS signal with the path integration. The pseudo position estimated by VO is used to train action integration which assists agent to update their internal perception of location and helps improve the success rate of navigation. The training and inference process only use RGB, depth, collision as well as self-action information. The experiments show that the proposed system achieves satisfactory results and outperforms the partially supervised learning algorithms on the popular Gibson dataset.

翻译：在室内环境中进行点目标导航是个人机器人导航到指定点的基本任务。最近的研究在近乎完美的成功率下解决了这个点目标导航任务，这是在假设无噪声执行以及最重要的是GPS和罗盘传感器的完美定位的条件下完成的。然而，真实室内环境中很难获得精确的GPS信号。为了在没有GPS信号的情况下提高点目标导航精度，我们使用视觉里程计（VO）并提出了一种新的以无监督的方式训练的动作整合模块（AIM）。具体而言，无监督的VO通过两个相邻帧的重投影误差计算代理相对姿态，然后使用路径整合替代精确的GPS信号。VO估计的伪位置用于训练动作整合，该整合帮助代理更新其位置的内部感知，并有助于提高导航成功率。训练和推理过程只使用RGB、深度、碰撞以及自我行为信息。实验表明，所提出的系统实现了满意的结果，并在流行的Gibson数据集上优于部分监督学习算法。

0

相关内容

点目标

【CMU博士论文】非结构化环境中的多模态导航学习，177页pdf

【CMU博士论文】非结构化环境中的多模态导航学习，177页pdf

专知会员服务

49+阅读 · 2022年12月8日

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

专知会员服务

36+阅读 · 2022年3月25日

[ICLR2022]PU learning（Positive and Unlabeled learning）任务的mixup方法

[ICLR2022]PU learning（Positive and Unlabeled learning）任务的mixup方法

专知会员服务

19+阅读 · 2022年2月2日

【CMU博士论文】神经序列建模与应用，Neural Sequential Modeling and Applications

【CMU博士论文】神经序列建模与应用，Neural Sequential Modeling and Applications

专知会员服务

43+阅读 · 2021年11月2日

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

专知会员服务

18+阅读 · 2021年5月3日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

【CVPR2020】自监督的深度视觉测程与在线适应，Self-Supervised Deep Visual Odometry

【CVPR2020】自监督的深度视觉测程与在线适应，Self-Supervised Deep Visual Odometry

专知会员服务

32+阅读 · 2020年5月14日

【ICLR2020-CMU】学习使用主动神经SLAM进行探索，Active Neural SLAM

【ICLR2020-CMU】学习使用主动神经SLAM进行探索，Active Neural SLAM

专知会员服务

38+阅读 · 2020年4月13日

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

专知会员服务

31+阅读 · 2020年3月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【泡泡一分钟】用于视角可变重定位的语义地图构建

【泡泡一分钟】用于视角可变重定位的语义地图构建

泡泡机器人SLAM

19+阅读 · 2019年10月21日

【泡泡一分钟】单目视觉惯性SLAM的重定位，全局优化和地图融合

【泡泡一分钟】单目视觉惯性SLAM的重定位，全局优化和地图融合

泡泡机器人SLAM

59+阅读 · 2019年7月15日

【泡泡一分钟】学习行人如何导航：一种深度逆强化学习的方法

【泡泡一分钟】学习行人如何导航：一种深度逆强化学习的方法

泡泡机器人SLAM

20+阅读 · 2019年4月22日

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

泡泡机器人SLAM

12+阅读 · 2019年2月12日

【泡泡一分钟】DynaSLAM：基于动态目标检测和背景修复的视觉SLAM

【泡泡一分钟】DynaSLAM：基于动态目标检测和背景修复的视觉SLAM

泡泡机器人SLAM

16+阅读 · 2019年1月27日

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

泡泡机器人SLAM

23+阅读 · 2019年1月18日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

泡泡机器人SLAM

13+阅读 · 2018年12月20日

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

泡泡机器人SLAM

10+阅读 · 2018年5月26日

未知环境中移动机器人探索式路径规划方法研究

国家自然科学基金

7+阅读 · 2015年12月31日

Survivin在低氧诱导喉癌淋巴管生成中的调控作用及其分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

旋转飞行物体的状态估计与轨迹预测

国家自然科学基金

0+阅读 · 2014年12月31日

惯性/单目视觉的一体化深组合导航关键技术研究

国家自然科学基金

3+阅读 · 2013年12月31日

微小型水下机器人冗余惯性导航系统优化设计及故障检测方法

国家自然科学基金

0+阅读 · 2013年12月31日

TMOD1调节actin聚合影响胰岛素信号转导的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

新型顺磁性量子点的原位肝癌双模态靶向探测及术中实时导航精准切除研究

国家自然科学基金

0+阅读 · 2012年12月31日

水下惯导/重力的融合自主导航定位技术与试验

国家自然科学基金

1+阅读 · 2012年12月31日

面向室内复杂环境的RFID定位方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

多视觉-惯性单元实时高精度连续位姿估计方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Masked Path Modeling for Vision-and-Language Navigation

Arxiv

0+阅读 · 2023年5月23日

Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-modal Distillation and Super-Voxel Clustering

Arxiv

0+阅读 · 2023年5月23日

Synthesizing Diverse Human Motions in 3D Indoor Scenes

Arxiv

0+阅读 · 2023年5月23日

Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling

Arxiv

0+阅读 · 2023年5月22日

Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking

Arxiv

0+阅读 · 2023年5月22日

CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth

Arxiv

0+阅读 · 2023年5月19日

Data Redaction from Conditional Generative Models

Arxiv

0+阅读 · 2023年5月18日

On games and simulators as a platform for development of artificial intelligence for command and control

On games and simulators as a platform for development of artificial intelligence for command and control

Arxiv

88+阅读 · 2021年10月21日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Arxiv

10+阅读 · 2021年1月24日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

35+阅读 · 2020年9月3日

VIP会员

文章信息

相关主题

视觉里程计

相关VIP内容

【CMU博士论文】非结构化环境中的多模态导航学习，177页pdf

【CMU博士论文】非结构化环境中的多模态导航学习，177页pdf

专知会员服务

49+阅读 · 2022年12月8日

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

专知会员服务

36+阅读 · 2022年3月25日

[ICLR2022]PU learning（Positive and Unlabeled learning）任务的mixup方法

[ICLR2022]PU learning（Positive and Unlabeled learning）任务的mixup方法

专知会员服务

19+阅读 · 2022年2月2日

【CMU博士论文】神经序列建模与应用，Neural Sequential Modeling and Applications

【CMU博士论文】神经序列建模与应用，Neural Sequential Modeling and Applications

专知会员服务

43+阅读 · 2021年11月2日

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

专知会员服务

18+阅读 · 2021年5月3日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

【CVPR2020】自监督的深度视觉测程与在线适应，Self-Supervised Deep Visual Odometry

【CVPR2020】自监督的深度视觉测程与在线适应，Self-Supervised Deep Visual Odometry

专知会员服务

32+阅读 · 2020年5月14日

【ICLR2020-CMU】学习使用主动神经SLAM进行探索，Active Neural SLAM

【ICLR2020-CMU】学习使用主动神经SLAM进行探索，Active Neural SLAM

专知会员服务

38+阅读 · 2020年4月13日

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

【华盛顿大学】用于视觉和语言导航的多视图学习，Multi-View Learning for Vision-and-Language Navigation

专知会员服务

31+阅读 · 2020年3月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

【泡泡一分钟】用于视角可变重定位的语义地图构建

【泡泡一分钟】用于视角可变重定位的语义地图构建

泡泡机器人SLAM

19+阅读 · 2019年10月21日

【泡泡一分钟】单目视觉惯性SLAM的重定位，全局优化和地图融合

【泡泡一分钟】单目视觉惯性SLAM的重定位，全局优化和地图融合

泡泡机器人SLAM

59+阅读 · 2019年7月15日

【泡泡一分钟】学习行人如何导航：一种深度逆强化学习的方法

【泡泡一分钟】学习行人如何导航：一种深度逆强化学习的方法

泡泡机器人SLAM

20+阅读 · 2019年4月22日

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

泡泡机器人SLAM

12+阅读 · 2019年2月12日

【泡泡一分钟】DynaSLAM：基于动态目标检测和背景修复的视觉SLAM

【泡泡一分钟】DynaSLAM：基于动态目标检测和背景修复的视觉SLAM

泡泡机器人SLAM

16+阅读 · 2019年1月27日

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

泡泡机器人SLAM

23+阅读 · 2019年1月18日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

泡泡机器人SLAM

13+阅读 · 2018年12月20日

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

泡泡机器人SLAM

10+阅读 · 2018年5月26日

相关论文

Masked Path Modeling for Vision-and-Language Navigation

Arxiv

0+阅读 · 2023年5月23日

Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-modal Distillation and Super-Voxel Clustering

Arxiv

0+阅读 · 2023年5月23日

Synthesizing Diverse Human Motions in 3D Indoor Scenes

Arxiv

0+阅读 · 2023年5月23日

Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling

Arxiv

0+阅读 · 2023年5月22日

Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking

Arxiv

0+阅读 · 2023年5月22日

CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth

Arxiv

0+阅读 · 2023年5月19日

Data Redaction from Conditional Generative Models

Arxiv

0+阅读 · 2023年5月18日

On games and simulators as a platform for development of artificial intelligence for command and control

On games and simulators as a platform for development of artificial intelligence for command and control

Arxiv

88+阅读 · 2021年10月21日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Arxiv

10+阅读 · 2021年1月24日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

35+阅读 · 2020年9月3日

相关基金

未知环境中移动机器人探索式路径规划方法研究

国家自然科学基金

7+阅读 · 2015年12月31日

Survivin在低氧诱导喉癌淋巴管生成中的调控作用及其分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

旋转飞行物体的状态估计与轨迹预测

国家自然科学基金

0+阅读 · 2014年12月31日

惯性/单目视觉的一体化深组合导航关键技术研究

国家自然科学基金

3+阅读 · 2013年12月31日

微小型水下机器人冗余惯性导航系统优化设计及故障检测方法

国家自然科学基金

0+阅读 · 2013年12月31日

TMOD1调节actin聚合影响胰岛素信号转导的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

新型顺磁性量子点的原位肝癌双模态靶向探测及术中实时导航精准切除研究

国家自然科学基金

0+阅读 · 2012年12月31日

水下惯导/重力的融合自主导航定位技术与试验

国家自然科学基金

1+阅读 · 2012年12月31日

面向室内复杂环境的RFID定位方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

多视觉-惯性单元实时高精度连续位姿估计方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员