Many point-based semantic segmentation methods have been designed for indoor scenarios, but they struggle if they are applied to point clouds that are captured by a LiDAR sensor in an outdoor environment. In order to make these methods more efficient and robust such that they can handle LiDAR data, we introduce the general concept of reformulating 3D point-based operations such that they can operate in the projection space. While we show by means of three point-based methods that the reformulated versions are between 300 and 400 times faster and achieve a higher accuracy, we furthermore demonstrate that the concept of reformulating 3D point-based operations allows to design new architectures that unify the benefits of point-based and image-based methods. As an example, we introduce a network that integrates reformulated 3D point-based operations into a 2D encoder-decoder architecture that fuses the information from different 2D scales. We evaluate the approach on four challenging datasets for semantic LiDAR point cloud segmentation and show that leveraging reformulated 3D point-based operations with 2D image-based operations achieves very good results for all four datasets.
翻译:许多基于点的语义分解方法是为室内情景设计的,但是,如果将这种方法用于指向由激光雷达传感器在室外环境中捕捉到的云层,它们就会挣扎。为了使这些方法更加有效和稳健,以便它们能够处理激光雷达数据,我们引入了重订三维点操作的一般概念,以便它们能够在投影空间中运行。我们通过三种基于点的方法表明,重新拟订的3D点操作速度为300至400倍,并达到更高的准确度,但我们进一步表明,重新拟订的3D点操作概念允许设计新的结构,以统一点基和图像法方法的好处。举例来说,我们引入了一个将重新拟订的三维点操作纳入二维点分解码结构的网络,将不同2D尺度的信息结合在一起。我们评估了四种具有挑战性的数据元的激光雷达点云分解方法,并表明,重新拟订的三维点操作以2D图像为基础的图像操作能够为所有四个数据集取得非常好的结果。