复杂动态环境的多式语义解语 SLM (Multi-modal Semantic SLAM for Complex Dynamic Environments) - 专知论文

会员服务 ·

0

SLAM · 可约的 · 回合 · INFORMS · Performer ·

2022 年 5 月 14 日

Multi-modal Semantic SLAM for Complex Dynamic Environments

翻译：复杂动态环境的多式语义解语 SLM

Han Wang,Jing Ying Ko,Lihua Xie

Simultaneous Localization and Mapping (SLAM) is one of the most essential techniques in many real-world robotic applications. The assumption of static environments is common in most SLAM algorithms, which however, is not the case for most applications. Recent work on semantic SLAM aims to understand the objects in an environment and distinguish dynamic information from a scene context by performing image-based segmentation. However, the segmentation results are often imperfect or incomplete, which can subsequently reduce the quality of mapping and the accuracy of localization. In this paper, we present a robust multi-modal semantic framework to solve the SLAM problem in complex and highly dynamic environments. We propose to learn a more powerful object feature representation and deploy the mechanism of looking and thinking twice to the backbone network, which leads to a better recognition result to our baseline instance segmentation model. Moreover, both geometric-only clustering and visual semantic information are combined to reduce the effect of segmentation error due to small-scale objects, occlusion and motion blur. Thorough experiments have been conducted to evaluate the performance of the proposed method. The results show that our method can precisely identify dynamic objects under recognition imperfection and motion blur. Moreover, the proposed SLAM framework is able to efficiently build a static dense map at a processing rate of more than 10 Hz, which can be implemented in many practical applications. Both training data and the proposed method is open sourced at https://github.com/wh200720041/MMS_SLAM.

翻译：同步本地化和绘图(SLAM)是许多真实世界机器人应用中最重要的技术之一。静态环境的假设在大多数SLM算法中是常见的, 然而,大多数应用都不是这样。最近关于语义 SLAM 的工作旨在了解环境中的物体,通过进行基于图像的分化,将动态信息与场景环境区分开来。然而, 分解结果往往不完善或不完整, 从而可以随后降低绘图质量和本地化的准确性。在本文中, 我们提出了一个强有力的多模式语义框架, 以便在复杂和高度动态的环境中解决SLAM问题。我们提议学习一个更强大的对象特征代表, 并且两次向主干网部署寻找和思考的机制, 从而使我们的基线实例分解模型获得更好的识别结果。此外, 仅使用几何组合和视觉语义信息, 以降低因小物体、开源/ 封闭度 / 运动模糊性。索罗夫实验是为了评估拟议方法的性能表现。我们提出的方法可以精确地在动态模型中建立一个不完善的模型。

0

相关内容

SLAM

即时定位与地图构建（SLAM或Simultaneouslocalizationandmapping）是这样一种技术：使得机器人和自动驾驶汽车等设备能在未知环境（没有先验知识的前提下）建立地图,或者在已知环境（已给出该地图的先验知识）中能更新地图,并保证这些设备能在同时追踪它们的当前位置。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

泡泡机器人SLAM

23+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡机器人】ECCV2018之SLAM最新前沿动态（附文章链接和代码链接）

【泡泡机器人】ECCV2018之SLAM最新前沿动态（附文章链接和代码链接）

泡泡机器人SLAM

38+阅读 · 2018年9月23日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】SLAM相关资源大列表

【推荐】SLAM相关资源大列表

机器学习研究会

10+阅读 · 2017年8月18日

高阶微分方程的周期解及多重性

国家自然科学基金

0+阅读 · 2015年12月31日

带跳非耦合正倒向随机微分方程的Crank-Nicolson数值解法研究

国家自然科学基金

0+阅读 · 2014年12月31日

金属氧化物界面的自旋极化电子输运研究

国家自然科学基金

0+阅读 · 2014年12月31日

EBSD原位观察法研究无铅焊点热-机械疲劳再结晶机制

国家自然科学基金

0+阅读 · 2013年12月31日

区域战略性新兴产业技术创新联盟构建及治理机制研究：基于动态演化视角

国家自然科学基金

0+阅读 · 2012年12月31日

四跨膜蛋白CD151与Co-029对TNFα/TNFαR1系统介导的肝细胞癌侵袭与转移的调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

DLC-1信号通路系统介导TRAIL诱导人非小细胞肺癌细胞凋亡的研究

国家自然科学基金

0+阅读 · 2011年12月31日

癌细胞分泌exosome改变CTL细胞功能导致鼻咽癌免疫逃逸的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

石墨烯中自旋和类自旋自由度的调控

国家自然科学基金

1+阅读 · 2011年12月31日

面向复杂区域和高维问题的谱方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM

Arxiv

0+阅读 · 2022年7月4日

ReCoAt: A Deep Learning-based Framework for Multi-Modal Motion Prediction in Autonomous Driving Application

Arxiv

0+阅读 · 2022年7月2日

Masked Autoencoders for Self-Supervised Learning on Automotive Point Clouds

Arxiv

0+阅读 · 2022年7月1日

Towards Two-view 6D Object Pose Estimation: A Comparative Study on Fusion Strategy

Arxiv

0+阅读 · 2022年7月1日

Point Cloud Change Detection With Stereo V-SLAM:Dataset, Metrics and Baseline

Arxiv

0+阅读 · 2022年7月1日

Keeping Less is More: Point Sparsification for Visual SLAM

Arxiv

0+阅读 · 2022年7月1日

MotionSC: Data Set and Network for Real-Time Semantic Mapping in Dynamic Environments

MotionSC: Data Set and Network for Real-Time Semantic Mapping in Dynamic Environments

Arxiv

0+阅读 · 2022年6月30日

Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera

Arxiv

0+阅读 · 2022年6月30日

Exploring Temporally Dynamic Data Augmentation for Video Recognition

Arxiv

0+阅读 · 2022年6月30日

Point Cloud Semantic Segmentation using Multi Scale Sparse Convolution Neural Network

Arxiv

0+阅读 · 2022年6月30日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

最新《扩散模型原理》新书，470页pdf

无人机作战：演进、创新与未来战场

AI 智能体简史

多模态空间推理在大模型时代：综述与基准测试

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

泡泡机器人SLAM

23+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡机器人】ECCV2018之SLAM最新前沿动态（附文章链接和代码链接）

【泡泡机器人】ECCV2018之SLAM最新前沿动态（附文章链接和代码链接）

泡泡机器人SLAM

38+阅读 · 2018年9月23日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】SLAM相关资源大列表

【推荐】SLAM相关资源大列表

机器学习研究会

10+阅读 · 2017年8月18日

相关论文

VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM

Arxiv

0+阅读 · 2022年7月4日

ReCoAt: A Deep Learning-based Framework for Multi-Modal Motion Prediction in Autonomous Driving Application

Arxiv

0+阅读 · 2022年7月2日

Masked Autoencoders for Self-Supervised Learning on Automotive Point Clouds

Arxiv

0+阅读 · 2022年7月1日

Towards Two-view 6D Object Pose Estimation: A Comparative Study on Fusion Strategy

Arxiv

0+阅读 · 2022年7月1日

Point Cloud Change Detection With Stereo V-SLAM:Dataset, Metrics and Baseline

Arxiv

0+阅读 · 2022年7月1日

Keeping Less is More: Point Sparsification for Visual SLAM

Arxiv

0+阅读 · 2022年7月1日

MotionSC: Data Set and Network for Real-Time Semantic Mapping in Dynamic Environments

MotionSC: Data Set and Network for Real-Time Semantic Mapping in Dynamic Environments

Arxiv

0+阅读 · 2022年6月30日

Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera

Arxiv

0+阅读 · 2022年6月30日

Exploring Temporally Dynamic Data Augmentation for Video Recognition

Arxiv

0+阅读 · 2022年6月30日

Point Cloud Semantic Segmentation using Multi Scale Sparse Convolution Neural Network

Arxiv

0+阅读 · 2022年6月30日

相关基金

高阶微分方程的周期解及多重性

国家自然科学基金

0+阅读 · 2015年12月31日

带跳非耦合正倒向随机微分方程的Crank-Nicolson数值解法研究

国家自然科学基金

0+阅读 · 2014年12月31日

金属氧化物界面的自旋极化电子输运研究

国家自然科学基金

0+阅读 · 2014年12月31日

EBSD原位观察法研究无铅焊点热-机械疲劳再结晶机制

国家自然科学基金

0+阅读 · 2013年12月31日

区域战略性新兴产业技术创新联盟构建及治理机制研究：基于动态演化视角

国家自然科学基金

0+阅读 · 2012年12月31日

四跨膜蛋白CD151与Co-029对TNFα/TNFαR1系统介导的肝细胞癌侵袭与转移的调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

DLC-1信号通路系统介导TRAIL诱导人非小细胞肺癌细胞凋亡的研究

国家自然科学基金

0+阅读 · 2011年12月31日

癌细胞分泌exosome改变CTL细胞功能导致鼻咽癌免疫逃逸的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

石墨烯中自旋和类自旋自由度的调控

国家自然科学基金

1+阅读 · 2011年12月31日

面向复杂区域和高维问题的谱方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员