Action recognition, which is formulated as a task to identify various human actions in a video, has attracted increasing interest from computer vision researchers due to its importance in various applications. Recently, appearance-based methods have achieved promising progress towards accurate action recognition. In general, these methods mainly fulfill the task by applying various schemes to model spatial and temporal visual information effectively. To better understand the current progress of appearance-based action recognition, we provide a comprehensive review of recent achievements in this area. In particular, we summarise and discuss several dozens of related research papers, which can be roughly divided into four categories according to different appearance modelling strategies. The obtained categories include 2D convolutional methods, 3D convolutional methods, motion representation-based methods, and context representation-based methods. We analyse and discuss representative methods from each category, comprehensively. Empirical results are also summarised to better illustrate cutting-edge algorithms. We conclude by identifying important areas for future research gleaned from our categorisation.
翻译:行动承认是作为在视频中确定各种人类行动的一项任务而拟订的,它由于在各种应用中的重要性而吸引了计算机视觉研究人员越来越多的兴趣。最近,以外观为基础的方法在准确行动承认方面取得了有希望的进展。一般而言,这些方法主要是通过应用各种办法有效地模拟空间和时间视觉信息来完成这项任务。为了更好地了解以外观为基础的行动承认目前的进展,我们全面审查了这一领域最近的成绩。特别是,我们总结并讨论了几十份相关的研究论文,根据不同的外观建模战略可以大致分为四类。获得的分类包括2D演进法、3D演进法、以动态代表为基础的方法和以背景代表为基础的方法。我们全面分析和讨论每个类别的代表性方法。我们还总结了经验性结果,以更好地说明尖端算法。我们最后通过从分类中找出未来研究的重要领域。我们的结论是,我们从一个分类中可以找到重要的领域。