APS: 大型多式多式室内照相机定位系统 (APS: A Large-Scale Multi-Modal Indoor Camera Positioning System) - 专知论文

会员服务 ·

0

INFORMS · 估计/估计量 · Integration · Pair · 端到端 ·

2021 年 2 月 8 日

APS: A Large-Scale Multi-Modal Indoor Camera Positioning System

翻译：APS: 大型多式多式室内照相机定位系统

Ali Ghofrani,Rahil Mahdian Toroghi,Seyed Mojtaba Tabatabaie

from arxiv, 15 pages, 11 figures, MedPRAI 2020

Navigation inside a closed area with no GPS-signal accessibility is a highly challenging task. In order to tackle this problem, recently the imaging-based methods have grabbed the attention of many researchers. These methods either extract the features (e.g. using SIFT, or SOSNet) and map the descriptive ones to the camera position and rotation information, or deploy an end-to-end system that directly estimates this information out of RGB images, similar to PoseNet. While the former methods suffer from heavy computational burden during the test process, the latter suffers from lack of accuracy and robustness against environmental changes and object movements. However, end-to-end systems are quite fast during the test and inference and are pretty qualified for real-world applications, even though their training phase could be longer than the former ones. In this paper, a novel multi-modal end-to-end system for large-scale indoor positioning has been proposed, namely APS (Alpha Positioning System), which integrates a Pix2Pix GAN network to reconstruct the point cloud pair of the input query image, with a deep CNN network in order to robustly estimate the position and rotation information of the camera. For this integration, the existing datasets have the shortcoming of paired RGB/point cloud images for indoor environments. Therefore, we created a new dataset to handle this situation. By implementing the proposed APS system, we could achieve a highly accurate camera positioning with a precision level of less than a centimeter.

翻译：在一个没有GPS- 信号无障碍的封闭区域内的导航系统是一项极具挑战性的任务。为了解决这一问题,最近基于成像的方法已经吸引了许多研究人员的注意。这些方法要么提取特征(例如使用SIFT,或SOSNet),将描述性系统映射到相机的位置和旋转信息,要么将描述性系统映射到相机的位置和旋转信息,或者部署一个端对端系统,直接从RGB图像(类似于PoseNet)中估算这些信息,类似于PoseNet。虽然以前的方法在测试过程中有沉重的计算负担,但前者在环境变化和物体移动方面缺乏准确性和稳健性。然而,在测试和推断期间,端对端系统相当快,而且非常适合现实世界应用。在本文件中,提出了一个新的大型室内定位多模式端对端系统,即APS(Apha 定位系统),它可以结合一个 Pix2Pix GAN 网络来重建输入查询图像的点对焦云。一个更深的CNN网络在测试和深度的网络中非常快速的精确度上非常适合现实应用应用,尽管它们的训练阶段可能比新的数据定位环境更精确地估计,但现在的 RRC- 正在建立一个新的数据环境,从而实现新的图像,从而实现新的定位。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【NeurIPS2020】针对弱监督目标检测的综合注意力自蒸馏

【NeurIPS2020】针对弱监督目标检测的综合注意力自蒸馏

专知会员服务

32+阅读 · 2020年11月12日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

CVPR 2020 | MetaFuse：用于人体姿态估计的预训练信息融合模型

CVPR 2020 | MetaFuse：用于人体姿态估计的预训练信息融合模型

专知会员服务

25+阅读 · 2020年4月2日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

92+阅读 · 2020年2月12日

【2020新书】Python大数据处理，Mastering Large Datasets with Python

【2020新书】Python大数据处理，Mastering Large Datasets with Python

专知会员服务

54+阅读 · 2020年2月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

12+阅读 · 2019年1月16日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

【泡泡图灵智库】Flowdometry：基于光流和深度学习的视觉里程计（IWCACV-1）

【泡泡图灵智库】Flowdometry：基于光流和深度学习的视觉里程计（IWCACV-1）

泡泡机器人SLAM

5+阅读 · 2018年9月7日

已删除

雪球

6+阅读 · 2018年8月19日

2017-最全手势识别/跟踪相关资源大列表分享（论文、数据集、比赛等）

2017-最全手势识别/跟踪相关资源大列表分享（论文、数据集、比赛等）

深度学习与NLP

64+阅读 · 2017年10月29日

Cooperative UWB-Based Localization for Outdoors Positioning and Navigation of UAVs aided by Ground Robots

Arxiv

0+阅读 · 2021年4月1日

Learning Precise 3D Manipulation from Multiple Uncalibrated Cameras

Arxiv

0+阅读 · 2021年3月31日

Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors

Arxiv

0+阅读 · 2021年3月31日

SCFusion: Real-time Incremental Scene Reconstruction with Semantic Completion

Arxiv

0+阅读 · 2021年3月31日

Graph-Based Topological Exploration Planning in Large-Scale 3D Environments

Arxiv

0+阅读 · 2021年3月31日

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Arxiv

12+阅读 · 2020年2月27日

Real-time Scalable Dense Surfel Mapping

Real-time Scalable Dense Surfel Mapping

Arxiv

5+阅读 · 2019年9月10日

Clustered Object Detection in Aerial Images

Clustered Object Detection in Aerial Images

Arxiv

5+阅读 · 2019年8月27日

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Arxiv

3+阅读 · 2018年8月2日

A Framework for Evaluating 6-DOF Object Trackers

Arxiv

6+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【NeurIPS2020】针对弱监督目标检测的综合注意力自蒸馏

【NeurIPS2020】针对弱监督目标检测的综合注意力自蒸馏

专知会员服务

32+阅读 · 2020年11月12日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

CVPR 2020 | MetaFuse：用于人体姿态估计的预训练信息融合模型

CVPR 2020 | MetaFuse：用于人体姿态估计的预训练信息融合模型

专知会员服务

25+阅读 · 2020年4月2日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

92+阅读 · 2020年2月12日

【2020新书】Python大数据处理，Mastering Large Datasets with Python

【2020新书】Python大数据处理，Mastering Large Datasets with Python

专知会员服务

54+阅读 · 2020年2月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

扩散模型中的 Transformer：图像生成及其延展应用询问 ChatGPT

281页pdf《神经网络设计入门》

【普林斯顿博士论文】以奖励推动生成式人工智能的发展：奖励引导生成的理论与方法

中文版 | 火力支援与巡飞弹药的未来（附原文）

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

12+阅读 · 2019年1月16日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

【泡泡图灵智库】Flowdometry：基于光流和深度学习的视觉里程计（IWCACV-1）

【泡泡图灵智库】Flowdometry：基于光流和深度学习的视觉里程计（IWCACV-1）

泡泡机器人SLAM

5+阅读 · 2018年9月7日

已删除

雪球

6+阅读 · 2018年8月19日

2017-最全手势识别/跟踪相关资源大列表分享（论文、数据集、比赛等）

2017-最全手势识别/跟踪相关资源大列表分享（论文、数据集、比赛等）

深度学习与NLP

64+阅读 · 2017年10月29日

相关论文

Cooperative UWB-Based Localization for Outdoors Positioning and Navigation of UAVs aided by Ground Robots

Arxiv

0+阅读 · 2021年4月1日

Learning Precise 3D Manipulation from Multiple Uncalibrated Cameras

Arxiv

0+阅读 · 2021年3月31日

Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors

Arxiv

0+阅读 · 2021年3月31日

SCFusion: Real-time Incremental Scene Reconstruction with Semantic Completion

Arxiv

0+阅读 · 2021年3月31日

Graph-Based Topological Exploration Planning in Large-Scale 3D Environments

Arxiv

0+阅读 · 2021年3月31日

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Arxiv

12+阅读 · 2020年2月27日

Real-time Scalable Dense Surfel Mapping

Real-time Scalable Dense Surfel Mapping

Arxiv

5+阅读 · 2019年9月10日

Clustered Object Detection in Aerial Images

Clustered Object Detection in Aerial Images

Arxiv

5+阅读 · 2019年8月27日

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Arxiv

3+阅读 · 2018年8月2日

A Framework for Evaluating 6-DOF Object Trackers

Arxiv

6+阅读 · 2018年3月28日

微信扫码咨询专知VIP会员