Recent progress on 2D object detection has featured Cascade RCNN, which capitalizes on a sequence of cascade detectors to progressively improve proposal quality, towards high-quality object detection. However, there has not been evidence in support of building such cascade structures for 3D object detection, a challenging detection scenario with highly sparse LiDAR point clouds. In this work, we present a simple yet effective cascade architecture, named 3D Cascade RCNN, that allocates multiple detectors based on the voxelized point clouds in a cascade paradigm, pursuing higher quality 3D object detector progressively. Furthermore, we quantitatively define the sparsity level of the points within 3D bounding box of each object as the point completeness score, which is exploited as the task weight for each proposal to guide the learning of each stage detector. The spirit behind is to assign higher weights for high-quality proposals with relatively complete point distribution, while down-weight the proposals with extremely sparse points that often incur noise during training. This design of completeness-aware re-weighting elegantly upgrades the cascade paradigm to be better applicable for the sparse input data, without increasing any FLOP budgets. Through extensive experiments on both the KITTI dataset and Waymo Open Dataset, we validate the superiority of our proposed 3D Cascade RCNN, when comparing to state-of-the-art 3D object detection techniques. The source code is publicly available at \url{https://github.com/caiqi/Cascasde-3D}.
翻译:在2D天体探测方面最近的进展包括Cascade RCNNN,它利用一系列级联探测器来逐步提高建议质量,达到高质量的物体探测质量;然而,没有证据表明支持为3D天体探测建立这种级联结构,这是三维天体探测的富有挑战性的探测场景,而三维天体云则极为稀少。在这项工作中,我们提出了一个简单而有效的级联结构,名为3D Cascaade RCNN,它根据一个级联模式的氧化点云分配多种探测器,逐步追求质量更高的3D天体探测器。此外,我们还从数量上界定了每个天体3D捆绑框中点的宽度水平,作为点完整性评分,而这是用来指导每个阶段探测器学习的每个建议的任务重量。 后面的精神是给质量建议高的重量定得更高一些,在培训期间往往引起噪音的极小点。 设计了完整觉察觉的3D天体探测器的精度升级模型,以便更好地适用于稀少的输入数据,而没有增加任何FLOP-3天体标的精确度。