While 2D occupancy maps commonly used in mobile robotics enable safe navigation in indoor environments, in order for robots to understand and interact with their environment and its inhabitants representing 3D geometry and semantic environment information is required. Semantic information is crucial in effective interpretation of the meanings humans attribute to different parts of a space, while 3D geometry is important for safety and high-level understanding. We propose a pipeline that can generate a multi-layer representation of indoor environments for robotic applications. The proposed representation includes 3D metric-semantic layers, a 2D occupancy layer, and an object instance layer where known objects are replaced with an approximate model obtained through a novel model-matching approach. The metric-semantic layer and the object instance layer are combined to form an augmented representation of the environment. Experiments show that the proposed shape matching method outperforms a state-of-the-art deep learning method when tasked to complete unseen parts of objects in the scene. The pipeline performance translates well from simulation to real world as shown by F1-score analysis, with semantic segmentation accuracy using Mask R-CNN acting as the major bottleneck. Finally, we also demonstrate on a real robotic platform how the multi-layer map can be used to improve navigation safety.
翻译:虽然在移动机器人中常用的2D占用图有助于室内环境的安全导航,但为了让机器人能够了解环境并与其环境和代表3D几何和语义环境信息的居住者进行互动,需要语义信息对于有效解释空间不同部分人类属性的含义至关重要,而3D几何则对于安全和高层理解非常重要。我们建议了一条管道,可为机器人应用生成室内环境的多层代表。拟议的演示包括3D 度测量层、2D居住层和一个对象实例层,其中已知物体被替换为通过新颖模型匹配方法获得的近似模型。测量层和对象实例层组合起来形成环境的强化代表面。实验表明,拟议的形状匹配方法在完成现场无法见的物体部分时,超越了最先进的深层学习方法。管道性能从模拟转换到真实世界,如F1核心分析所显示的那样,通过使用Masyal RCN-N 和对象实例图解的精度来显示如何改进主要导航平台。最后,我们还可以用Semmay RCN-CN-N 来演示如何改进真实-rbolnial-leck 。