To understand human behaviors, action recognition based on videos is a common approach. Compared with image-based action recognition, videos provide much more information. Reducing the ambiguity of actions and in the last decade, many works focused on datasets, novel models and learning approaches have improved video action recognition to a higher level. However, there are challenges and unsolved problems, in particular in sports analytics where data collection and labeling are more sophisticated, requiring sport professionals to annotate data. In addition, the actions could be extremely fast and it becomes difficult to recognize them. Moreover, in team sports like football and basketball, one action could involve multiple players, and to correctly recognize them, we need to analyse all players, which is relatively complicated. In this paper, we present a survey on video action recognition for sports analytics. We introduce more than ten types of sports, including team sports, such as football, basketball, volleyball, hockey and individual sports, such as figure skating, gymnastics, table tennis, tennis, diving and badminton. Then we compare numerous existing frameworks for sports analysis to present status quo of video action recognition in both team sports and individual sports. Finally, we discuss the challenges and unsolved problems in this area and to facilitate sports analytics, we develop a toolbox using PaddlePaddle, which supports football, basketball, table tennis and figure skating action recognition.
翻译:为了理解人类的行为,基于视频的行动识别是一种常见的方法。 与基于图像的行动识别相比, 视频可以提供更多的信息。 减少行动的模糊性, 在过去十年里,许多侧重于数据集、新模式和学习方法的工作提高了视频动作识别的高度。 然而,存在挑战和未解决的问题,特别是在体育分析方面,数据收集和标签更为精密,要求体育专业人员对数据进行批注。此外,行动可能非常快,难以识别。 此外,在足球和篮球等团队运动中,一项行动可能涉及多个球员,并且为了正确识别它们,我们需要分析所有球员,而这是相对复杂的。 在本文中,我们对体育解析的视频动作识别进行了调查。 我们引入了十多种体育类型,包括团队运动,如足球、篮球、排球、曲和个人体育,例如图解、体操、网球、网球、潜水和羽毛球。 然后,我们将许多现有的体育分析框架与目前状态的足球行动识别状况进行了比较,我们最后在团队和个体体育中,我们用脚踏的动作识别方式, 来讨论一个非体育识别领域。