Detecting pedestrians is a crucial task in autonomous driving systems to ensure the safety of drivers and pedestrians. The technologies involved in these algorithms must be precise and reliable, regardless of environment conditions. Relying solely on RGB cameras may not be enough to recognize road environments in situations where cameras cannot capture scenes properly. Some approaches aim to compensate for these limitations by combining RGB cameras with TOF sensors, such as LIDARs. However, there are few works that address this problem using exclusively the 3D geometric information provided by LIDARs. In this paper, we propose a PointNet++ based architecture to detect pedestrians in dense 3D point clouds. The aim is to explore the potential contribution of geometric information alone in pedestrian detection systems. We also present a semi-automatic labeling system that transfers pedestrian and non-pedestrian labels from RGB images onto the 3D domain. The fact that our datasets have RGB registered with point clouds enables label transferring by back projection from 2D bounding boxes to point clouds, with only a light manual supervision to validate results. We train PointNet++ with the geometry of the resulting 3D labelled clusters. The evaluation confirms the effectiveness of the proposed method, yielding precision and recall values around 98%.
翻译:检测行人是独立驾驶系统的一项关键任务,以确保驾驶员和行人的安全。这些算法所涉及的技术必须是精确和可靠的,不管环境条件如何。仅仅依靠RGB相机可能不足以在相机无法正确拍摄场景的情况下识别道路环境。有些做法的目的是通过将RGB相机与LIDARs等TOF传感器相结合来弥补这些限制。然而,在完全使用LIDARs提供的3D几何信息来解决这一问题的工程中,很少有人能够使用LIDARs提供的3D几何信息来解决这个问题。在本文中,我们提议以PointNet++为基础的建筑来探测密度3D点云层中的行人。目的是探索行人探测系统中仅使用几何信息的潜在贡献。我们还提出了一个半自动标签系统,将行人和非行人标签从RGB图像转移到3D域。我们的数据集登记有点云的RGB能够通过背投影从2D捆绑箱转移到点云体云体。我们只用一个简单的手册监督来验证结果。我们用3D标签组合组合的精确度来训练PentNet+,并测量结果。