Sharing collective perception messages (CPM) between vehicles is investigated to decrease occlusions, so as to improve perception accuracy and safety of autonomous driving. However, highly accurate data sharing and low communication overhead is a big challenge for collective perception, especially when real-time communication is required among connected and automated vehicles. In this paper, we propose an efficient and effective keypoints-based deep feature fusion framework, called FPV-RCNN, for collective perception, which is built on top of the 3D object detector PV-RCNN. We introduce a bounding box proposal matching module and a keypoints selection strategy to compress the CPM size and solve the multi-vehicle data fusion problem. Compared to a bird's-eye view (BEV) keypoints feature fusion, FPV-RCNN achieves improved detection accuracy by about 14% at a high evaluation criterion (IoU 0.7) on a synthetic dataset COMAP dedicated to collective perception. Also, its performance is comparable to two raw data fusion baselines that have no data loss in sharing. Moreover, our method also significantly decreases the CPM size to less than 0.3KB, which is about 50 times smaller than the BEV feature map sharing used in previous works. Even with a further decreased number of CPM feature channels, i.e., from 128 to 32, the detection performance only drops about 1%. The code of our method is available at https://github.com/YuanYunshuang/FPV_RCNN.
翻译:对车辆之间共享集体感知信息(CPM)进行调查,以降低隔离度,从而提高自主驾驶的准确性和安全性。然而,高度准确的数据共享和低通信管理是集体感知的一大挑战,特别是在需要连接和自动化车辆之间进行实时通信的情况下。在本文中,我们提议在3D对象探测器PV-RCN的顶端建立集体感知(FPV-RCN)的高效和有效的基于关键点的深度特征聚合框架,称为FPV-RCNN,用于集体感知。我们引入了一个捆绑框建议匹配模块和关键点选择战略,以压缩CPM大小和解决多车辆数据融合问题。与鸟眼视图(BEV)关键点特征融合相比,FPV-RCNN在用于集体感知的合成数据集COMAP(IU0.7)上实现了约14%的检测准确度提高。此外,其性能与两个没有数据损失的原始数据聚合基准相比。此外,我们的方法也大大降低了CMM/RRC的下降幅度,比B的数值小于0.3KB的性观测功能特征大约50倍。