Current efficient LiDAR-based detection frameworks are lacking in exploiting object relations, which naturally present in both spatial and temporal manners. To this end, we introduce a simple, efficient, and effective two-stage detector, termed as Ret3D. At the core of Ret3D is the utilization of novel intra-frame and inter-frame relation modules to capture the spatial and temporal relations accordingly. More Specifically, intra-frame relation module (IntraRM) encapsulates the intra-frame objects into a sparse graph and thus allows us to refine the object features through efficient message passing. On the other hand, inter-frame relation module (InterRM) densely connects each object in its corresponding tracked sequences dynamically, and leverages such temporal information to further enhance its representations efficiently through a lightweight transformer network. We instantiate our novel designs of IntraRM and InterRM with general center-based or anchor-based detectors and evaluate them on Waymo Open Dataset (WOD). With negligible extra overhead, Ret3D achieves the state-of-the-art performance, being 5.5% and 3.2% higher than the recent competitor in terms of the LEVEL 1 and LEVEL 2 mAPH metrics on vehicle detection, respectively.
翻译:在探索自然以空间和时间方式自然存在的天体关系方面,目前缺乏以LiDAR为基础的基于LiDAR为基础的高效探测框架。为此目的,我们引入了一个简单、高效和有效的两阶段探测器,称为Ret3D。在Ret3D的核心是利用新的内部和机体间关系模块来相应地捕捉空间和时间关系。更具体地说,内部关系模块(内部关系模块)将机体内物体封成一个稀薄的图表,从而使我们能够通过有效传递信息来改进天体特征。另一方面,机体关系模块(InterRM)在相应的跟踪序列中将每个物体紧密连接起来,并利用这种时间信息通过轻量变压器网络来进一步高效地加强其表现。我们用一般的中基或锚基探测器对我们的IntraRM和机际关系新设计进行回现,并在Waymo Open D数据集(WOD)上对它们进行评价。Ret3D通过可忽略的外加电路多的电路段,其达到最新状态性能性,在1和2MIS级别上分别高于5.5%和3.2%。