Reliable and accurate localization and mapping are key components of most autonomous systems. Besides geometric information about the mapped environment, the semantics plays an important role to enable intelligent navigation behaviors. In most realistic environments, this task is particularly complicated due to dynamics caused by moving objects, which can corrupt the mapping step or derail localization. In this paper, we propose an extension of a recently published surfel-based mapping approach exploiting three-dimensional laser range scans by integrating semantic information to facilitate the mapping process. The semantic information is efficiently extracted by a fully convolutional neural network and rendered on a spherical projection of the laser range data. This computed semantic segmentation results in point-wise labels for the whole scan, allowing us to build a semantically-enriched map with labeled surfels. This semantic map enables us to reliably filter moving objects, but also improve the projective scan matching via semantic constraints. Our experimental evaluation on challenging highways sequences from KITTI dataset with very few static structures and a large amount of moving cars shows the advantage of our semantic SLAM approach in comparison to a purely geometric, state-of-the-art approach.
翻译:可靠和准确的本地化和绘图是大多数自主系统的关键组成部分。除了关于已映射环境的几何信息外,语义学对于智能导航行为起着重要作用。在最现实的环境中,由于移动物体造成的动态,这项任务特别复杂,因为移动物体会腐蚀映射步骤或破坏本地化。在本文中,我们提议扩大最近出版的以冲绳为基础的绘图方法,利用三维激光范围的扫描,通过整合语义信息促进映射过程。语义学信息通过完全相近的神经神经系统网络有效提取,并用于激光射程数据的球状投影。这种计算出语义分割的结果是对整个扫描的点定位标签产生结果,使我们能够用贴有标签的表面设计建立一个精密的地图。这一语义图使我们能够可靠地筛选移动物体,同时通过语义限制改进投影扫描匹配。我们对由KITTI数据集组成的具有挑战性的高速公路序列的实验性评估,其静态结构非常少,移动的汽车数量很大,显示了我们用纯度的地理测量方法的优势。