With the recent development of Deep Learning applied to Computer Vision, sport video understanding has gained a lot of attention, providing much richer information for both sport consumers and leagues. This paper introduces DeepSportradar-v1, a suite of computer vision tasks, datasets and benchmarks for automated sport understanding. The main purpose of this framework is to close the gap between academic research and real world settings. To this end, the datasets provide high-resolution raw images, camera parameters and high quality annotations. DeepSportradar currently supports four challenging tasks related to basketball: ball 3D localization, camera calibration, player instance segmentation and player re-identification. For each of the four tasks, a detailed description of the dataset, objective, performance metrics, and the proposed baseline method are provided. To encourage further research on advanced methods for sport understanding, a competition is organized as part of the MMSports workshop from the ACM Multimedia 2022 conference, where participants have to develop state-of-the-art methods to solve the above tasks. The four datasets, development kits and baselines are publicly available.
翻译:随着计算机视野应用的深入学习的最近发展,体育视频理解引起了人们的极大关注,为体育消费者和联盟提供了更丰富的信息。本文介绍了DeepSportradar-v1,这是一套自动体育理解的计算机视觉任务、数据集和基准。这一框架的主要目的是缩小学术研究和现实世界背景之间的差距。为此,数据集提供了高分辨率原始图像、相机参数和高质量的说明。深体育网络目前支持与篮球有关的四项挑战性任务:3D球本地化、相机校准、播放器实例分解和播放器再识别。对这四项任务中的每一项都提供了数据集、目标、性能指标和拟议基线方法的详细说明。为了鼓励进一步研究体育理解的先进方法,在ACM MMSport 2022会议上组织了一场竞赛,作为MMSport讲习班的一部分,参加者必须制定解决上述任务的最新方法。四种数据集、开发工具包和基线都公开提供。