The development of autonomous vehicles provides an opportunity to have a complete set of camera sensors capturing the environment around the car. Thus, it is important for object detection and tracking to address new challenges, such as achieving consistent results across views of cameras. To address these challenges, this work presents a new Global Association Graph Model with Link Prediction approach to predict existing tracklets location and link detections with tracklets via cross-attention motion modeling and appearance re-identification. This approach aims at solving issues caused by inconsistent 3D object detection. Moreover, our model exploits to improve the detection accuracy of a standard 3D object detector in the nuScenes detection challenge. The experimental results on the nuScenes dataset demonstrate the benefits of the proposed method to produce SOTA performance on the existing vision-based tracking dataset.
翻译:自主车辆的开发为全套摄像传感器捕捉汽车周围的环境提供了机会,因此,物体的探测和跟踪对于应对新的挑战非常重要,如在对相机的观察中取得一致结果;为应对这些挑战,这项工作提出了一个新的全球联系图形模型,与链接预测方法一道预测现有的跟踪点位置,并通过交叉注意运动模型和外观再识别将探测与跟踪点连接起来;这一方法旨在解决3D天体探测不一致所造成的问题;此外,我们的模型利用了提高核天体探测挑战中标准3D天体探测器的探测准确性;关于核天体数据集的实验结果展示了拟议方法在现有的基于愿景的跟踪数据集上产生SOTA性能的好处。