LiDAR sensors are an integral part of modern autonomous vehicles as they provide an accurate, high-resolution 3D representation of the vehicle's surroundings. However, it is computationally difficult to make use of the ever-increasing amounts of data from multiple high-resolution LiDAR sensors. As frame-rates, point cloud sizes and sensor resolutions increase, real-time processing of these point clouds must still extract semantics from this increasingly precise picture of the vehicle's environment. One deciding factor of the run-time performance and accuracy of deep neural networks operating on these point clouds is the underlying data representation and the way it is computed. In this work, we examine the relationship between the computational representations used in neural networks and their performance characteristics. To this end, we propose a novel computational taxonomy of LiDAR point cloud representations used in modern deep neural networks for 3D point cloud processing. Using this taxonomy, we perform a structured analysis of different families of approaches. Thereby, we uncover common advantages and limitations in terms of computational efficiency, memory requirements, and representational capacity as measured by semantic segmentation performance. Finally, we provide some insights and guidance for future developments in neural point cloud processing methods.
翻译:LiDAR 传感器是现代自主飞行器的一个组成部分,因为它们提供了该飞行器周围准确、高分辨率3D的精确表示,然而,在计算上很难利用多个高分辨率LIDAR传感器不断增加的数据数量。随着框架率、点云尺寸和传感器分辨率的增加,这些点云的实时处理仍必须从这一日益精确的车辆环境图中提取出语义学。运行于这些点云上的深神经网络运行的运行时间性能和准确性的一个决定性因素是基本数据表示及其计算方式。在这项工作中,我们研究了神经网络中使用的计算代表及其性能特点之间的关系。为此,我们提议对3D点云处理现代深线网络中使用的LIDAR点云表示进行新的计算分类。我们利用这种分类对不同方法的类别进行结构分析。我们发现了计算效率、记忆要求和以分辨率云分解性表现衡量的代表性能力方面的共同优势和局限性。我们最后为未来线段处理方法提供了一些洞察和导。