瓦迪斯,Skeleton行动识别? (Quo Vadis, Skeleton Action Recognition ?)

In this paper, we study current and upcoming frontiers across the landscape of skeleton-based human action recognition. To study skeleton-action recognition in the wild, we introduce Skeletics-152, a curated and 3-D pose-annotated subset of RGB videos sourced from Kinetics-700, a large-scale action dataset. We extend our study to include out-of-context actions by introducing Skeleton-Mimetics, a dataset derived from the recently introduced Mimetics dataset. We also introduce Metaphorics, a dataset with caption-style annotated YouTube videos of the popular social game Dumb Charades and interpretative dance performances. We benchmark state-of-the-art models on the NTU-120 dataset and provide multi-layered assessment of the results. The results from benchmarking the top performers of NTU-120 on the newly introduced datasets reveal the challenges and domain gap induced by actions in the wild. Overall, our work characterizes the strengths and limitations of existing approaches and datasets. Via the introduced datasets, our work enables new frontiers for human action recognition.

翻译：在本文中,我们研究基于骨骼的人类行动认知的地貌的当前和即将到来的边界。为了研究野生的骨架行动识别,我们引入了Sleetictics-152,一个由“动因-700”组成的大规模行动数据集,根据“动因-700”提供的RGB视频集集集集整理和3D“3-D”附加说明。我们扩展了我们的研究,通过引入由最近推出的“模拟数据集”产生的数据集,包括了“超脱脂行动”。我们还引入了“Memetics”数据集,这是一个带有流行的社会游戏“哑剧”和解释性舞蹈表演的注解的YouTube视频的插图式数据集。我们在NTU-120数据集上对最新版“NTU-120”的顶级表演者基准测试结果进行基准评估,揭示了野生行动带来的挑战和领域差距。总体而言,我们的工作体现了现有方法和数据集的优势和局限性。通过引入的数据集,我们的工作为人类行动提供了新的前沿。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

专知会员服务

39+阅读 · 2020年11月3日

近期必读的五篇计算机视觉顶会CVPR 2020【图神经网络 (GNN) 】相关论文-Part 3

专知会员服务

90+阅读 · 2020年5月19日

【CVPR2020-小鹏汽车】判别性多模态语音识别, Discriminative Multi-modality SR

专知会员服务

41+阅读 · 2020年5月13日

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

专知会员服务

78+阅读 · 2020年2月25日

【NeurIPS2019】高性能浅层RNN的类脑目标识别（Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNs）