One of the key factors behind the recent success in visual tracking is the availability of dedicated benchmarks. While being greatly benefiting to the tracking research, existing benchmarks do not pose the same difficulty as before with recent trackers achieving higher performance mainly due to (i) the introduction of more sophisticated transformers-based methods and (ii) the lack of diverse scenarios with adverse visibility such as, severe weather conditions, camouflage and imaging effects. We introduce AVisT, a dedicated benchmark for visual tracking in diverse scenarios with adverse visibility. AVisT comprises 120 challenging sequences with 80k annotated frames, spanning 18 diverse scenarios broadly grouped into five attributes with 42 object categories. The key contribution of AVisT is diverse and challenging scenarios covering severe weather conditions such as, dense fog, heavy rain and sandstorm; obstruction effects including, fire, sun glare and splashing water; adverse imaging effects such as, low-light; target effects including, small targets and distractor objects along with camouflage. We further benchmark 17 popular and recent trackers on AVisT with detailed analysis of their tracking performance across attributes, demonstrating a big room for improvement in performance. We believe that AVisT can greatly benefit the tracking community by complementing the existing benchmarks, in developing new creative tracking solutions in order to continue pushing the boundaries of the state-of-the-art. Our dataset along with the complete tracking performance evaluation is available at: https://github.com/visionml/pytracking
翻译:最近目视追踪成功的一个关键因素是具备了专门的基准。现有的基准虽然对跟踪研究大有裨益,但并不象最近跟踪者取得更高绩效之前那样,带来与以往相同的困难,主要原因是:(一) 采用更先进的变压器法,以及(二) 缺乏不同情景,其可见度不利,例如恶劣的天气条件、迷彩和成像效应;我们引入了AVisT,这是在各种情景中进行视觉跟踪的专用基准,具有负面可见度;AVisT由120个具有挑战性的序列和80公里附加说明的框框组成,广泛分为18个不同情景,分为5个属性,有42个对象类别。 AVisT的主要贡献是多种多样和具有挑战性的情景,涵盖严重天气条件,如浓雾、暴雨和沙暴雨;阻碍效应,包括火灾、阳光和泼水;不良的图像效应,如低光;目标效应,包括小目标和分散物体以及迷彩。我们进一步将17个受欢迎的和最近跟踪者标定在AVisT上,并详细分析其跨属性的绩效,显示需要改进业绩的大空间。我们认为,AVist-laft-travelyal tracking the tracking the reporting the custing the reporting the cofulational