Visual tracker includes network and post-processing. Despite the color distortion and low contrast of underwater images, advanced trackers can still be very competitive in underwater object tracking because deep learning empowers the networks to discriminate the appearance features of the target. However, underwater object tracking also faces another problem. Underwater targets such as fish and dolphins, usually appear in groups, and creatures of the same species usually have similar expressions of appearance features, so it is challenging to distinguish the weak differences characteristics only by the network itself. The existing detection-based post-processing only reflects the results of single frame detection, but cannot locate real targets among similar targets. In this paper, we propose a new post-processing strategy based on motion, which uses Kalman filter (KF) to maintain the motion information of the target and exclude similar targets around. Specifically, we use the KF predicted box and the candidate boxes in the response map and their confidence to calculate the candidate location score to find the real target. Our method does not change the network structure, nor does it perform additional training for the tracker. It can be quickly applied to other tracking fields with similar target problem. We improved SOTA trackers based on our method, and proved the effectiveness of our method on UOT100 and UTB180. The AUC of our method for OSTrack on similar subsequences is improved by more than 3% on average, and the precision and normalization precision are improved by more than 3.5% on average. It has been proved that our method has good compatibility in dealing with similar target problems and can enhance performance of the tracker together with other methods. More details can be found in: https://github.com/LiYunfengLYF/KF_in_underwater_trackers.
翻译:视觉跟踪器包括网络和后处理。尽管水下图像的颜色扭曲和对比较低,但先进的跟踪器在水下物体跟踪中仍然具有很高的竞争力,因为深层学习使网络能够区分目标的外观特征。然而,水下物体跟踪也面临另一个问题。水下目标,例如鱼和海豚通常以群居形式出现,同一物种的生物通常有相似的外观特征表现,因此只有网络本身才能区分薄弱的差异特征是具有挑战性的。现有的基于探测的180号后处理只反映单一框架探测的结果,但无法在类似目标中找到真正的目标。在本文中,我们建议基于运动的后处理战略,使用Kalman过滤器(KF)来维持目标的动作信息,并排除周围的类似目标。具体地,我们使用KF预测的框和候选框来计算候选人的分数以找到真正的目标。我们的方法并不改变网络结构,也没有对跟踪器进行更多的培训。它可以很快应用到类似目标的字段。我们用SOTA追踪器的更精度方法改进了我们的SOTA跟踪器, 和OOL的精度方法也证明了了我们的方法。我们用更精度方法改进了我们的OOOOOOB的精度方法的精度。我们用更精度方法, 改进了OOOOOB的精度方法的精度方法的精度。我们用比的精度方法的精度方法的精度。在SOB的精度方法的精度方法的精度方法的精度。