对 3D 人类形状和粒子估计多层次关注的编码器编码器编码器编码器和编码器编码器 (Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation) - 专知论文

会员服务 ·

0

估计/估计量 · 注意力机制 · 3D · 块 · 塑造 ·

2021 年 9 月 6 日

Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

翻译：对 3D 人类形状和粒子估计多层次关注的编码器编码器编码器编码器和编码器编码器

Ziniu Wan,Zhengjia Li,Maoqing Tian,Jianbo Liu,Shuai Yi,Hongsheng Li

3D human shape and pose estimation is the essential task for human motion analysis, which is widely used in many 3D applications. However, existing methods cannot simultaneously capture the relations at multiple levels, including spatial-temporal level and human joint level. Therefore they fail to make accurate predictions in some hard scenarios when there is cluttered background, occlusion, or extreme pose. To this end, we propose Multi-level Attention Encoder-Decoder Network (MAED), including a Spatial-Temporal Encoder (STE) and a Kinematic Topology Decoder (KTD) to model multi-level attentions in a unified framework. STE consists of a series of cascaded blocks based on Multi-Head Self-Attention, and each block uses two parallel branches to learn spatial and temporal attention respectively. Meanwhile, KTD aims at modeling the joint level attention. It regards pose estimation as a top-down hierarchical process similar to SMPL kinematic tree. With the training set of 3DPW, MAED outperforms previous state-of-the-art methods by 6.2, 7.2, and 2.4 mm of PA-MPJPE on the three widely used benchmarks 3DPW, MPI-INF-3DHP, and Human3.6M respectively. Our code is available at https://github.com/ziniuwan/maed.

翻译：3D人类形状和估计是人类运动分析的基本任务,在许多3D应用中广泛使用。然而,现有方法不能同时捕捉多层次的关系,包括空间-时空水平和人类联合水平。因此,当背景、封闭性或极端面形成时,它们无法在某些硬情景中作出准确预测。为此,我们提议多层关注编码-Decoder网络(MAED),包括空间-时空编码器和九元表层解码器(KTD),以在统一的框架内模拟多层次的关注。STE由一系列基于多领导人自我关注的级联队组成,每个区都使用两个平行分支分别学习时空关注。与此同时,KTD旨在模拟联合关注的模型。它把估计视为一个上下层的级别进程,类似于SMPL运动树。在3DPW的培训中,MAED超越了以6.2、7.2、7.2和2.4毫米MAM-MDM 分别用于3MA-MAP 和2.4毫米 MAMPA-MA-MA-MA-MA-M-M-C 3号基准中。

0

相关内容

估计/估计量

估计/估计量

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【CVPR2021】基于结构保持的弱监督目标定位

专知会员服务

16+阅读 · 2021年6月6日

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

22+阅读 · 2021年4月20日

【MLSS2020】最新《深度学习基础》视频讲解，42页ppt

【MLSS2020】最新《深度学习基础》视频讲解，42页ppt

专知会员服务

47+阅读 · 2020年8月5日

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

专知会员服务

21+阅读 · 2020年6月13日

【视频目标检测与跟踪：综述论文】Video Object Segmentation and Tracking: A Survey

专知会员服务

66+阅读 · 2020年6月4日

【北京大学】CVPR 2020 | PQ-NET：序列化的三维形状生成网络

【北京大学】CVPR 2020 | PQ-NET：序列化的三维形状生成网络

专知会员服务

10+阅读 · 2020年3月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

跟踪SLAM前沿动态系列之ICCV2019

跟踪SLAM前沿动态系列之ICCV2019

泡泡机器人SLAM

7+阅读 · 2019年11月23日

极市直播| 重磅！旷视科技研发总监俞刚带来Human pose Estimation直播分享，附代码链接

极市直播| 重磅！旷视科技研发总监俞刚带来Human pose Estimation直播分享，附代码链接

极市平台

4+阅读 · 2019年8月18日

CVPR2019| 05-09更新10篇论文及代码合集（含图像恢复/图神经网络/人体形状重建等）

CVPR2019| 05-09更新10篇论文及代码合集（含图像恢复/图神经网络/人体形状重建等）

极市平台

17+阅读 · 2019年5月9日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

人体姿态估计资源大列表（Human Pose Estimation）

人体姿态估计资源大列表（Human Pose Estimation）

专知

9+阅读 · 2018年10月6日

计算机视觉领域顶会CVPR 2018 接受论文列表

计算机视觉领域顶会CVPR 2018 接受论文列表

专知

7+阅读 · 2018年5月26日

【CV-Pose estimation】王晓刚教授团队论文PyraNet阅读笔记

【CV-Pose estimation】王晓刚教授团队论文PyraNet阅读笔记

极市平台

6+阅读 · 2017年12月16日

【论文】【论文】王晓刚老师课题组ICCV2017论文：学习特征金字塔用于人体姿态估计（附代码）

【论文】【论文】王晓刚老师课题组ICCV2017论文：学习特征金字塔用于人体姿态估计（附代码）

机器学习研究会

6+阅读 · 2017年8月5日

Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation

Arxiv

0+阅读 · 2021年10月22日

Occlusion-Robust Object Pose Estimation with Holistic Representation

Arxiv

0+阅读 · 2021年10月22日

Modeling Multi-Label Action Dependencies for Temporal Action Localization

Arxiv

3+阅读 · 2021年3月4日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

Object detection at 200 Frames Per Second

Arxiv

5+阅读 · 2018年5月16日

Fine-Grained Head Pose Estimation Without Keypoints

Arxiv

5+阅读 · 2018年4月13日

3D Pose Estimation and 3D Model Retrieval for Objects in the Wild

Arxiv

7+阅读 · 2018年3月30日

2D-3D Pose Consistency-based Conditional Random Fields for 3D Human Pose Estimation

Arxiv

3+阅读 · 2017年12月28日

Detect-and-Track: Efficient Pose Estimation in Videos

Arxiv

7+阅读 · 2017年12月26日

VIP会员

文章信息

相关主题

估计/估计量

注意力机制

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【CVPR2021】基于结构保持的弱监督目标定位

专知会员服务

16+阅读 · 2021年6月6日

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

22+阅读 · 2021年4月20日

【MLSS2020】最新《深度学习基础》视频讲解，42页ppt

【MLSS2020】最新《深度学习基础》视频讲解，42页ppt

专知会员服务

47+阅读 · 2020年8月5日

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

专知会员服务

21+阅读 · 2020年6月13日

【视频目标检测与跟踪：综述论文】Video Object Segmentation and Tracking: A Survey

专知会员服务

66+阅读 · 2020年6月4日

【北京大学】CVPR 2020 | PQ-NET：序列化的三维形状生成网络

【北京大学】CVPR 2020 | PQ-NET：序列化的三维形状生成网络

专知会员服务

10+阅读 · 2020年3月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《巡飞弹药（爆炸性无人机）威胁态势分析》最新24页报告

《军用后勤无人机：破解战场运输挑战的创新方案》

人工智能战争：以色列、伊朗与新型AI战争形态

《俄乌战争：现代战争未来的启示与经验》

相关资讯

跟踪SLAM前沿动态系列之ICCV2019

跟踪SLAM前沿动态系列之ICCV2019

泡泡机器人SLAM

7+阅读 · 2019年11月23日

极市直播| 重磅！旷视科技研发总监俞刚带来Human pose Estimation直播分享，附代码链接

极市直播| 重磅！旷视科技研发总监俞刚带来Human pose Estimation直播分享，附代码链接

极市平台

4+阅读 · 2019年8月18日

CVPR2019| 05-09更新10篇论文及代码合集（含图像恢复/图神经网络/人体形状重建等）

CVPR2019| 05-09更新10篇论文及代码合集（含图像恢复/图神经网络/人体形状重建等）

极市平台

17+阅读 · 2019年5月9日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

人体姿态估计资源大列表（Human Pose Estimation）

人体姿态估计资源大列表（Human Pose Estimation）

专知

9+阅读 · 2018年10月6日

计算机视觉领域顶会CVPR 2018 接受论文列表

计算机视觉领域顶会CVPR 2018 接受论文列表

专知

7+阅读 · 2018年5月26日

【CV-Pose estimation】王晓刚教授团队论文PyraNet阅读笔记

【CV-Pose estimation】王晓刚教授团队论文PyraNet阅读笔记

极市平台

6+阅读 · 2017年12月16日

【论文】【论文】王晓刚老师课题组ICCV2017论文：学习特征金字塔用于人体姿态估计（附代码）

【论文】【论文】王晓刚老师课题组ICCV2017论文：学习特征金字塔用于人体姿态估计（附代码）

机器学习研究会

6+阅读 · 2017年8月5日

相关论文

Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation

Arxiv

0+阅读 · 2021年10月22日

Occlusion-Robust Object Pose Estimation with Holistic Representation

Arxiv

0+阅读 · 2021年10月22日

Modeling Multi-Label Action Dependencies for Temporal Action Localization

Arxiv

3+阅读 · 2021年3月4日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

Object detection at 200 Frames Per Second

Arxiv

5+阅读 · 2018年5月16日

Fine-Grained Head Pose Estimation Without Keypoints

Arxiv

5+阅读 · 2018年4月13日

3D Pose Estimation and 3D Model Retrieval for Objects in the Wild

Arxiv

7+阅读 · 2018年3月30日

2D-3D Pose Consistency-based Conditional Random Fields for 3D Human Pose Estimation

Arxiv

3+阅读 · 2017年12月28日

Detect-and-Track: Efficient Pose Estimation in Videos

Arxiv

7+阅读 · 2017年12月26日

微信扫码咨询专知VIP会员