JRDB-法案:关于时空行动、社会群体和活动探测的大规模数据集 (JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection)

The availability of large-scale video action understanding datasets has facilitated advances in the interpretation of visual scenes containing people. However, learning to recognise human actions and their social interactions in an unconstrained real-world environment comprising numerous people, with potentially highly unbalanced and long-tailed distributed action labels from a stream of sensory data captured from a mobile robot platform remains a significant challenge, not least owing to the lack of a reflective large-scale dataset. In this paper, we introduce JRDB-Act, as an extension of the existing JRDB, which is captured by a social mobile manipulator and reflects a real distribution of human daily-life actions in a university campus environment. JRDB-Act has been densely annotated with atomic actions, comprises over 2.8M action labels, constituting a large-scale spatio-temporal action detection dataset. Each human bounding box is labeled with one pose-based action label and multiple~(optional) interaction-based action labels. Moreover JRDB-Act provides social group annotation, conducive to the task of grouping individuals based on their interactions in the scene to infer their social activities~(common activities in each social group). Each annotated label in JRDB-Act is tagged with the annotators' confidence level which contributes to the development of reliable evaluation strategies. In order to demonstrate how one can effectively utilise such annotations, we develop an end-to-end trainable pipeline to learn and infer these tasks, i.e. individual action and social group detection. The data and the evaluation code is publicly available at https://jrdb.erc.monash.edu/.

翻译：大规模视频行动理解数据集的可用性促进了对包含人的视觉场景的解读。然而,在由众多人组成的不受限制的现实世界环境中,学习承认人类行动及其社会互动,而从移动机器人平台获取的感官数据流中,可能高度不平衡和长尾分布的动作标签,这仍是一个重大挑战,尤其是由于缺乏一个反映性大规模数据集,我们在此文件中引入JRDB-Act,作为现有JRDB的延伸,该JRDB-Act是社会流动操纵器的捕捉,反映大学校园环境中人类日常生活行动的真实分布。JRDB-Ac 以原子行动为高度,由超过2.8M动作标签组成,构成大规模波形时空动作检测数据集。每个人类捆绑框都标有一种基于表面的动作标签和多个(可选)基于互动的动作标签。此外,JRDB-AC提供社会群体的评估,有助于根据他们在现场的交互互动情况对个人进行分组任务进行分类。JRD-Ac在每组中,每个组织都展示一个最新的社交活动。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

【DSAA教程】可解释人工智能金融服务，325页ppt，Explainable AI in Financial Services

专知会员服务

43+阅读 · 2021年10月21日

生成式对抗网络异常检测，GANs for Anomaly Detection

专知会员服务

34+阅读 · 2021年9月16日

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

专知会员服务

55+阅读 · 2020年4月26日

【北卡罗莱纳州立大学】单场景视频异常检测综述，A Survey of Single-Scene Video Anomaly Detection

专知会员服务

31+阅读 · 2020年4月13日