The automatic detection of events in complex sports games like soccer and handball using positional or video data is of large interest in research and industry. One requirement is a fundamental understanding of underlying concepts, i.e., events that occur on the pitch. Previous work often deals only with so-called low-level events based on well-defined rules such as free kicks, free throws, or goals. High-level events, such as passes, are less frequently approached due to a lack of consistent definitions. This introduces a level of ambiguity that necessities careful validation when regarding event annotations. Yet, this validation step is usually neglected as the majority of studies adopt annotations from commercial providers on private datasets of unknown quality and focuses on soccer only. To address these issues, we present (1) a universal taxonomy that covers a wide range of low and high-level events for invasion games and is exemplarily refined to soccer and handball, and (2) release two multi-modal datasets comprising video and positional data with gold-standard annotations to foster research in fine-grained and ball-centered event spotting. Experiments on human performance demonstrate the robustness of the proposed taxonomy, and that disagreements and ambiguities in the annotation increase with the complexity of the event. An I3D model for video classification is adopted for event spotting and reveals the potential for benchmarking. Datasets are available at: https://github.com/mm4spa/eigd
翻译:利用定位或视频数据自动检测足球和手球等复杂体育比赛中的事件,对于研究和行业来说,这是一个非常感兴趣的问题。一项要求是对基本概念的基本理解,即现场发生的事件。以前的工作通常只涉及基于自由踢球、免费投球或目标等明确界定的规则的所谓低层次活动。由于缺乏一致的定义,诸如通行证等高级别活动较不经常得到处理。这造成了一定程度的模糊,在对事件说明进行仔细验证时需要仔细验证。然而,这一验证步骤通常被忽视,因为大多数研究都采纳商业提供者关于私人数据组合的说明,即只注重足球。为了解决这些问题,我们提出(1) 一种普遍性的分类学,涵盖广泛的低层次和高级别事件,例如免费踢、免费投球或目标。 由于缺乏一致的定义,因此较不经常地处理诸如通行证等高级别活动。(2) 公布两个包含视频和定位数据的多模式数据集,配有黄金标准说明,以促进对精细度和以球为中心的活动进行考察。关于人类业绩的实验表明,在拟议的分类中,在可能发生的不透明性、不透明性事件方面,在提议的分类中显示数据分析中,在采用的数据显示数据分析/估价方面,其复杂性是可能的。