Currently, existing state-of-the-art 3D object detectors are in two-stage paradigm. These methods typically comprise two steps: 1) Utilize region proposal network to propose a fraction of high-quality proposals in a bottom-up fashion. 2) Resize and pool the semantic features from the proposed regions to summarize RoI-wise representations for further refinement. Note that these RoI-wise representations in step 2) are considered individually as an uncorrelated entry when fed to following detection headers. Nevertheless, we observe these proposals generated by step 1) offset from ground truth somehow, emerging in local neighborhood densely with an underlying probability. Challenges arise in the case where a proposal largely forsakes its boundary information due to coordinate offset while existing networks lack corresponding information compensation mechanism. In this paper, we propose BANet for 3D object detection from point clouds. Specifically, instead of refining each proposal independently as previous works do, we represent each proposal as a node for graph construction within a given cut-off threshold, associating proposals in the form of local neighborhood graph, with boundary correlations of an object being explicitly exploited. Besiedes, we devise a lightweight Region Feature Aggregation Network to fully exploit voxel-wise, pixel-wise, and point-wise feature with expanding receptive fields for more informative RoI-wise representations. As of Apr. 17th, 2021, our BANet achieves on par performance on KITTI 3D detection leaderboard and ranks $1^{st}$ on $Moderate$ difficulty of $Car$ category on KITTI BEV detection leaderboard. The source code will be released once the paper is accepted.
翻译:目前,现有最先进的3D物体探测器处于两阶段范式中,这些方法通常包括两个步骤:(1) 利用区域建议网络,以自下而上的方式提出部分高质量建议;(2) 调整和汇集拟议区域的语义特征,以总结RoI的表示方式,以便进一步完善。请注意,这些步骤2中的RoI逻辑表示方式,在提供给检测头目的时,单独被视为与图表有关的条目。然而,我们观察到这些提议是由步骤1产生的,从地面真相中抵消,以某种方式抵消,在本地密集的居民区出现,潜在概率很大。如果一项提议主要以自下而上的方式提出高质量的建议,以协调方式提出高质量的建议;(2) 将拟议区域建议的规模从拟议区域变小,从点云中总结3D的表示方式; 具体地说,我们将每一项提议作为在特定截断线阈值内进行图形构造的节点,一旦将本地邻域图形式中的建议与某一物体的边界源进行明确利用。 回避,我们设计了一个较轻的域域域域标值的域标值为B。