多目标通用物体跟踪算法：超越 SOT (Beyond SOT: Tracking Multiple Generic Objects at Once) - 专知论文

会员服务 ·

0

物体跟踪 · 基准 · 多目标 · 跟踪算法 · 跟踪器 ·

2023 年 4 月 6 日

Beyond SOT: Tracking Multiple Generic Objects at Once

翻译：多目标通用物体跟踪算法：超越 SOT

Christoph Mayer,Martin Danelljan,Ming-Hsuan Yang,Vittorio Ferrari,Luc Van Gool,Alina Kuznetsova

from arxiv, 16 pages

Generic Object Tracking (GOT) is the problem of tracking target objects, specified by bounding boxes in the first frame of a video. While the task has received much attention in the last decades, researchers have almost exclusively focused on the single object setting. Multi-object GOT benefits from a wider applicability, rendering it more attractive in real-world applications. We attribute the lack of research interest into this problem to the absence of suitable benchmarks. In this work, we introduce a new large-scale GOT benchmark, LaGOT, containing multiple annotated target objects per sequence. Our benchmark allows users to tackle key remaining challenges in GOT, aiming to increase robustness and reduce computation through joint tracking of multiple objects simultaneously. In addition, we propose a transformer-based GOT tracker baseline capable of joint processing of multiple objects through shared computation. Our approach achieves a 4x faster run-time in case of 10 concurrent objects compared to tracking each object independently and outperforms existing single object trackers on our new benchmark. In addition, our approach achieves highly competitive results on single-object GOT datasets, setting a new state of the art on TrackingNet with a success rate AUC of 84.4%. Our benchmark, code, and trained models will be made publicly available.

翻译：通用物体跟踪(GOT)是指在视频的第一帧中，跟踪由边界框定义的目标物体的问题。虽然这个任务在过去几十年中已经得到了很多关注，但研究人员几乎完全集中于单个目标的设置。多目标GOT有更广泛的适用性，使它在实际应用中更具吸引力。我们认为缺乏对这个问题的研究兴趣是因为没有合适的基准。在这项工作中，我们引入了一个新的大规模GOT基准，称为 LaGOT，其中包含每个序列多个注释的目标对象。我们的基准允许用户解决GOT中的关键挑战，旨在通过同时跟踪多个对象来增加鲁棒性和减少计算量。此外，我们提出了一个基于Transformer的GOT跟踪器基线，能够通过共享计算联合处理多个对象。相比于独立跟踪每个对象，我们的方法在10个并发对象的情况下实现了4倍的运行时间，并在我们的新基准上优于现有的单个物体跟踪器。此外，我们的方法在单目标GOT数据集上实现了非常有竞争力的结果，在TrackingNet上的成功率AUC为84.4%，创下了新的最高水平。我们的基准、代码和训练模型将公开提供。

0

相关内容

物体跟踪

自监督学习在CV进展？何恺明等最新ECCV2022教程《自监督表示学习在计算机视觉》，全面讲述自监督视觉学习进展

自监督学习在CV进展？何恺明等最新ECCV2022教程《自监督表示学习在计算机视觉》，全面讲述自监督视觉学习进展

专知会员服务

54+阅读 · 2022年12月10日

【RecSys22教程】多阶段推荐系统的神经重排序，90页ppt

【RecSys22教程】多阶段推荐系统的神经重排序，90页ppt

专知会员服务

27+阅读 · 2022年9月30日

【CVPR 2022-UCSD&英伟达】GroupViT:从文本监督中产生语义分割，Semantic Segmentation Emerges from Text Supervision

【CVPR 2022-UCSD&英伟达】GroupViT:从文本监督中产生语义分割，Semantic Segmentation Emerges from Text Supervision

专知会员服务

12+阅读 · 2022年3月9日

AAAI 2022 | 基于预训练-微调框架的图像差异描述任务

AAAI 2022 | 基于预训练-微调框架的图像差异描述任务

专知会员服务

18+阅读 · 2022年2月26日

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

专知会员服务

60+阅读 · 2021年6月14日

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

22+阅读 · 2021年4月20日

【视频目标检测与跟踪：综述论文】Video Object Segmentation and Tracking: A Survey

专知会员服务

66+阅读 · 2020年6月4日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

专知会员服务

29+阅读 · 2019年6月16日

ECCV 2022 | 同时完成四项跟踪任务！Unicorn: 迈向目标跟踪的大统一

ECCV 2022 | 同时完成四项跟踪任务！Unicorn: 迈向目标跟踪的大统一

PaperWeekly

0+阅读 · 2022年7月26日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

车辆目标检测

车辆目标检测

数据挖掘入门与实战

30+阅读 · 2018年3月30日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

基于多源视频的大范围场景目标跟踪

国家自然科学基金

2+阅读 · 2015年12月31日

基于回归的视角转换框架下的多视角行人步态识别

国家自然科学基金

2+阅读 · 2014年12月31日

求解可分凸规划的并行分裂算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

非一致指数二分与伪轨跟踪

国家自然科学基金

0+阅读 · 2013年12月31日

车载自组网实时协作定位系统及多数据源融合算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

生物启发的视觉目标搜索和定位研究

国家自然科学基金

0+阅读 · 2012年12月31日

拓扑连通性保持与目标任务共同引导的多智能体跨层协同控制

国家自然科学基金

2+阅读 · 2011年12月31日

非曼哈顿结构下带粒子群优化的VLSI总体布线算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

汽车复杂约束下的多目标集成控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于先验三维模型的车辆监控关键算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

RobustLoc: Robust Camera Pose Regression in Challenging Driving Environments

Arxiv

0+阅读 · 2023年5月25日

NCHO: Unsupervised Learning for Neural 3D Composition of Humans and Objects

Arxiv

0+阅读 · 2023年5月23日

Siamese Masked Autoencoders

Arxiv

0+阅读 · 2023年5月23日

Active Learning Principles for In-Context Learning with Large Language Models

Arxiv

0+阅读 · 2023年5月23日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

62+阅读 · 2021年10月25日

Transformer Tracking

Arxiv

17+阅读 · 2021年3月29日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Arxiv

12+阅读 · 2020年2月19日

3D Backbone Network for 3D Object Detection

Arxiv

12+阅读 · 2019年1月24日

Improving Multiple Object Tracking with Optical Flow and Edge Preprocessing

Arxiv

10+阅读 · 2018年1月29日

VIP会员

文章信息

相关主题

相关VIP内容

自监督学习在CV进展？何恺明等最新ECCV2022教程《自监督表示学习在计算机视觉》，全面讲述自监督视觉学习进展

自监督学习在CV进展？何恺明等最新ECCV2022教程《自监督表示学习在计算机视觉》，全面讲述自监督视觉学习进展

专知会员服务

54+阅读 · 2022年12月10日

【RecSys22教程】多阶段推荐系统的神经重排序，90页ppt

【RecSys22教程】多阶段推荐系统的神经重排序，90页ppt

专知会员服务

27+阅读 · 2022年9月30日

【CVPR 2022-UCSD&英伟达】GroupViT:从文本监督中产生语义分割，Semantic Segmentation Emerges from Text Supervision

【CVPR 2022-UCSD&英伟达】GroupViT:从文本监督中产生语义分割，Semantic Segmentation Emerges from Text Supervision

专知会员服务

12+阅读 · 2022年3月9日

AAAI 2022 | 基于预训练-微调框架的图像差异描述任务

AAAI 2022 | 基于预训练-微调框架的图像差异描述任务

专知会员服务

18+阅读 · 2022年2月26日

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

【CMU博士论文】开放世界目标检测与跟踪，168页pdf

专知会员服务

60+阅读 · 2021年6月14日

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

22+阅读 · 2021年4月20日

【视频目标检测与跟踪：综述论文】Video Object Segmentation and Tracking: A Survey

专知会员服务

66+阅读 · 2020年6月4日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

【CVPR 2019 | tutorial】视觉识别Visual Recognition and Beyond，Facebook|Ross Girshick，Justin Johnson（李飞飞高徒）

专知会员服务

29+阅读 · 2019年6月16日

热门VIP内容

开通专知VIP会员享更多权益服务

《美陆军徒步机动作战条令手册》最新168页

【博士论文】基于不确定性的可靠性：现代机器学习中的选择性预测与可信部署

军事后勤数字化未来展望

《美海军后勤体系整合与创新挑战》最新报告

相关资讯

ECCV 2022 | 同时完成四项跟踪任务！Unicorn: 迈向目标跟踪的大统一

ECCV 2022 | 同时完成四项跟踪任务！Unicorn: 迈向目标跟踪的大统一

PaperWeekly

0+阅读 · 2022年7月26日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

车辆目标检测

车辆目标检测

数据挖掘入门与实战

30+阅读 · 2018年3月30日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

RobustLoc: Robust Camera Pose Regression in Challenging Driving Environments

Arxiv

0+阅读 · 2023年5月25日

NCHO: Unsupervised Learning for Neural 3D Composition of Humans and Objects

Arxiv

0+阅读 · 2023年5月23日

Siamese Masked Autoencoders

Arxiv

0+阅读 · 2023年5月23日

Active Learning Principles for In-Context Learning with Large Language Models

Arxiv

0+阅读 · 2023年5月23日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

62+阅读 · 2021年10月25日

Transformer Tracking

Arxiv

17+阅读 · 2021年3月29日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Arxiv

12+阅读 · 2020年2月19日

3D Backbone Network for 3D Object Detection

Arxiv

12+阅读 · 2019年1月24日

Improving Multiple Object Tracking with Optical Flow and Edge Preprocessing

Arxiv

10+阅读 · 2018年1月29日

相关基金

基于多源视频的大范围场景目标跟踪

国家自然科学基金

2+阅读 · 2015年12月31日

基于回归的视角转换框架下的多视角行人步态识别

国家自然科学基金

2+阅读 · 2014年12月31日

求解可分凸规划的并行分裂算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

非一致指数二分与伪轨跟踪

国家自然科学基金

0+阅读 · 2013年12月31日

车载自组网实时协作定位系统及多数据源融合算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

生物启发的视觉目标搜索和定位研究

国家自然科学基金

0+阅读 · 2012年12月31日

拓扑连通性保持与目标任务共同引导的多智能体跨层协同控制

国家自然科学基金

2+阅读 · 2011年12月31日

非曼哈顿结构下带粒子群优化的VLSI总体布线算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

汽车复杂约束下的多目标集成控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于先验三维模型的车辆监控关键算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员