Moving object Detection (MOD) is a critical task in autonomous driving as moving agents around the ego-vehicle need to be accurately detected for safe trajectory planning. It also enables appearance agnostic detection of objects based on motion cues. There are geometric challenges like motion-parallax ambiguity which makes it a difficult problem. In this work, we aim to leverage the vehicle motion information and feed it into the model to have an adaptation mechanism based on ego-motion. The motivation is to enable the model to implicitly perform ego-motion compensation to improve performance. We convert the six degrees of freedom vehicle motion into a pixel-wise tensor which can be fed as input to the CNN model. The proposed model using Vehicle Motion Tensor (VMT) achieves an absolute improvement of 5.6% in mIoU over the baseline architecture. We also achieve state-of-the-art results on the public KITTI_MoSeg_Extended dataset even compared to methods which make use of LiDAR and additional input frames. Our model is also lightweight and runs at 85 fps on a TitanX GPU. Qualitative results are provided in https://youtu.be/ezbfjti-kTk.
翻译:移动物体探测(MOD)是自动驾驶的关键任务,因为需要准确地检测自我车辆周围移动的动力剂,以便安全地进行轨迹规划。它还可以根据运动提示对物体进行外观的不可知性检测。存在运动-平行模糊性等几何挑战,这使得它成为一个困难问题。在这项工作中,我们的目标是利用车辆运动信息并将其输入模型,以建立基于自我动作的适应机制。动机是使模型能够暗中执行自我动作补偿来提高性能。我们把自由车辆运动的6度转换成像素,可以作为对CNN模型的投入。使用车辆移动信号仪(VMT)的拟议模型在基线结构上实现了5.6%的绝对改进。我们还在公共 KITTI_MOSeg_Extfended数据集上实现了最新的结果,甚至与使用LIDAR和额外输入框架的方法相比。我们模型也是轻重的,在TitanX GPU上运行85 fyoups。 Qualitititivez 提供https://s://youb-tivez。