To facilitate depth-based 3D action recognition, 3D dynamic voxel (3DV) is proposed as a novel 3D motion representation. With 3D space voxelization, the key idea of 3DV is to encode 3D motion information within depth video into a regular voxel set (i.e., 3DV) compactly, via temporal rank pooling. Each available 3DV voxel intrinsically involves 3D spatial and motion feature jointly. 3DV is then abstracted as a point set and input into PointNet++ for 3D action recognition, in the end-to-end learning way. The intuition for transferring 3DV into the point set form is that, PointNet++ is lightweight and effective for deep feature learning towards point set. Since 3DV may lose appearance clue, a multi-stream 3D action recognition manner is also proposed to learn motion and appearance feature jointly. To extract richer temporal order information of actions, we also divide the depth video into temporal splits and encode this procedure in 3DV integrally. The extensive experiments on 4 well-established benchmark datasets demonstrate the superiority of our proposition. Impressively, we acquire the accuracy of 82.4% and 93.5% on NTU RGB+D 120 [13] with the cross-subject and crosssetup test setting respectively. 3DV's code is available at https://github.com/3huo/3DV-Action.


翻译:为了便利基于深度的 3D 3D 动作识别, 3D 动态 voxel (3DV) 将3D 动态 voxel (3D 3D 3D V) 作为一种新型 3D 动作演示。 3D V 3D 3D 3D 空间 voxel (3D 3D 3D 3D 3D 3D 3D 动作演示, 3D 3D 3D 3D 3xelization 3D 3D 3D 3D 3 的直觉将3D 运动信息在深度视频中通过时间级集合将3D 运动信息集中编码成普通的 Voxel 3V 3D 3D 3D 3D 3D 3D 3 3D 3D 3 3D 3D 3 3D 3D 3D 3D 3D 3 3D 3D 3D 3 3 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3D 3 3 3 3 3D 3 3 3D 3D 3D 3 3 3 3 3D 3D 3 3 3 3 3 3 3 3D 3 3 3 3D 3D 3 3 3 3 3 3 3 3D 3 3 3 3

0
下载
关闭预览

相关内容

3DV(3D视觉)会议提供了一个绝佳的平台,用于传播研究结果,涵盖计算机视觉和图形3D研究领域的广泛主题,包括新型光学传感器,信号处理,几何建模,表示和传输,可视化和交互以及各种应用程序。 官网地址:https://dblp.uni-trier.de/db/conf/3dim/
CVPR2020 | 商汤-港中文等提出PV-RCNN:3D目标检测新网络
专知会员服务
43+阅读 · 2020年4月17日
【阿里巴巴-CVPR2020】频域学习,Learning in the Frequency Domain
抢鲜看!13篇CVPR2020论文链接/开源代码/解读
专知会员服务
49+阅读 · 2020年2月26日
【泡泡汇总】CVPR2019 SLAM Paperlist
泡泡机器人SLAM
14+阅读 · 2019年6月12日
简评 | Video Action Recognition 的近期进展
极市平台
20+阅读 · 2019年4月21日
Unsupervised Learning via Meta-Learning
CreateAMind
41+阅读 · 2019年1月3日
SlowFast Networks for Video Recognition
Arxiv
4+阅读 · 2019年4月18日
Arxiv
5+阅读 · 2018年4月30日
VIP会员
相关VIP内容
Top
微信扫码咨询专知VIP会员