混合解析-神经反向运动学方法用于重建全身网格 (HybrIK-X: Hybrid Analytical-Neural Inverse Kinematics for Whole-body Mesh Recovery) - 专知论文

会员服务 ·

0

网格 · 重建 · 点估计 · 混合 · 推断 ·

2023 年 4 月 12 日

HybrIK-X: Hybrid Analytical-Neural Inverse Kinematics for Whole-body Mesh Recovery

翻译：混合解析-神经反向运动学方法用于重建全身网格

Jiefeng Li,Siyuan Bian,Chao Xu,Zhicun Chen,Lixin Yang,Cewu Lu

from arxiv, An eXpressive extension of HybrIK [arXiv:2011.14672], supports SMPL-X. arXiv admin note: substantial text overlap with arXiv:2011.14672

Recovering whole-body mesh by inferring the abstract pose and shape parameters from visual content can obtain 3D bodies with realistic structures. However, the inferring process is highly non-linear and suffers from image-mesh misalignment, resulting in inaccurate reconstruction. In contrast, 3D keypoint estimation methods utilize the volumetric representation to achieve pixel-level accuracy but may predict unrealistic body structures. To address these issues, this paper presents a novel hybrid inverse kinematics solution, HybrIK, that integrates the merits of 3D keypoint estimation and body mesh recovery in a unified framework. HybrIK directly transforms accurate 3D joints to body-part rotations via twist-and-swing decomposition. The swing rotations are analytically solved with 3D joints, while the twist rotations are derived from visual cues through neural networks. To capture comprehensive whole-body details, we further develop a holistic framework, HybrIK-X, which enhances HybrIK with articulated hands and an expressive face. HybrIK-X is fast and accurate by solving the whole-body pose with a one-stage model. Experiments demonstrate that HybrIK and HybrIK-X preserve both the accuracy of 3D joints and the realistic structure of the parametric human model, leading to pixel-aligned whole-body mesh recovery. The proposed method significantly surpasses the state-of-the-art methods on various benchmarks for body-only, hand-only, and whole-body scenarios. Code and results can be found at https://jeffli.site/HybrIK-X/

翻译：从视觉内容中推断抽象的姿势和形状参数，可以获得具有逼真结构的三维人体网格。然而，该推断过程非常非线性，并且易受图像-网格偏差影响，导致重建不准确。相比之下，三维关键点估计方法利用体积表示实现像素级准确性，但可能预测不真实的身体结构。为了解决这些问题，本文提出了一种新颖的混合反向运动学解决方案，HybrIK，它在统一框架中集成了3D关键点估计和身体网格恢复的优点。HybrIK直接通过扭-摆分解将准确的三维关节转化为身体部位旋转。摆动旋转使用三维关节点解析求解，而扭转旋转则通过神经网络从视觉线索中推导出来。为了捕捉全身细节，作者进一步开发了一个全面的框架，HybrIK-X，增强HybrIK以包括灵活的手部和表情丰富的面部。HybrIK-X通过一阶段模型快速且准确地解决全身姿势。实验表明，HybrIK和HybrIK-X保留了3D关节的准确性和参数化人体模型的逼真结构，从而实现像素对齐的全身网格重建。该方法在身体、手、全身场景的各种基准测试中均显著优于现有的最新方法。代码和结果可以在https://jeffli.site/HybrIK-X/上找到。

0

相关内容

【NUS-Xavier教授】生成模型VAE与GAN，69页ppt

【NUS-Xavier教授】生成模型VAE与GAN，69页ppt

专知会员服务

74+阅读 · 2022年4月6日

【CVPR 2022】paper解读——从头盔信号中解析生成3D姿势，这为AR/VR创造可信虚拟形象迈出了重要一步，FLAG: Flow-based 3D Avatar Generation from Sparse Observations

专知会员服务

19+阅读 · 2022年3月6日

【CVPR 2022】从大量非正式视频中构建可动画的3D神经模型，BANMo: Building Animatable 3D Neural Models from Many Casual Videos

【CVPR 2022】从大量非正式视频中构建可动画的3D神经模型，BANMo: Building Animatable 3D Neural Models from Many Casual Videos

专知会员服务

25+阅读 · 2022年3月3日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

CVPR2021-单目实时全身捕捉的方法

专知会员服务

20+阅读 · 2021年3月18日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

专知会员服务

22+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

CVPR 2022最佳学生论文：单张图像估计物体在3D空间中的位姿估计

CVPR 2022最佳学生论文：单张图像估计物体在3D空间中的位姿估计

PaperWeekly

1+阅读 · 2022年7月4日

【泡泡图灵智库】基于上采样预积分测量值的3D Lidar-IMU校准来矫正运动失真

【泡泡图灵智库】基于上采样预积分测量值的3D Lidar-IMU校准来矫正运动失真

泡泡机器人SLAM

11+阅读 · 2019年9月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

泡泡机器人SLAM

17+阅读 · 2018年10月12日

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

泡泡机器人SLAM

20+阅读 · 2018年5月8日

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

泡泡机器人SLAM

11+阅读 · 2018年3月31日

全球海洋热含量估计中的Mapping方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

结合计算流体力学与三维光学相干断层成像评价冠心病支架治疗后的局部血流动力学

国家自然科学基金

0+阅读 · 2014年12月31日

基于深度信息和显著计算的手势交互技术研究及应用

国家自然科学基金

1+阅读 · 2014年12月31日

旋转飞行物体的状态估计与轨迹预测

国家自然科学基金

0+阅读 · 2014年12月31日

基于剪切实验的心房力学属性

国家自然科学基金

0+阅读 · 2013年12月31日

纳米修饰可降解双层支架修复兔膝关节骨软骨缺损的研究

国家自然科学基金

0+阅读 · 2013年12月31日

非均匀的神经元网络簇同步和斑图随机动力学

国家自然科学基金

0+阅读 · 2012年12月31日

胸腰椎损伤运动功能重建的生物力学研究

国家自然科学基金

0+阅读 · 2009年12月31日

脊髓损伤膀胱功能重建术后脑功能重塑研究

国家自然科学基金

0+阅读 · 2009年12月31日

组织工程构建视网膜色素上皮细胞膜片移植后结构重建及功能评价

国家自然科学基金

0+阅读 · 2008年12月31日

Superiority of GNN over NN in generalizing bandlimited functions

Arxiv

0+阅读 · 2023年5月29日

Factored-NeuS: Reconstructing Surfaces, Illumination, and Materials of Possibly Glossy Objects

Arxiv

0+阅读 · 2023年5月29日

Volume Feature Rendering for Fast Neural Radiance Field Reconstruction

Arxiv

0+阅读 · 2023年5月29日

FastMESH: Fast Surface Reconstruction by Hexagonal Mesh-based Neural Rendering

Arxiv

0+阅读 · 2023年5月29日

A linear adaptive second-order backward differentiation formulation scheme for the phase field crystal equation

Arxiv

0+阅读 · 2023年5月28日

Simulator-Based Self-Supervision for Learned 3D Tomography Reconstruction

Arxiv

0+阅读 · 2023年5月26日

Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps

Arxiv

0+阅读 · 2023年5月26日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Multi-task Learning of Order-Consistent Causal Graphs

Arxiv

10+阅读 · 2021年11月3日

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Arxiv

12+阅读 · 2020年2月27日

VIP会员

文章信息

相关主题

相关VIP内容

【NUS-Xavier教授】生成模型VAE与GAN，69页ppt

【NUS-Xavier教授】生成模型VAE与GAN，69页ppt

专知会员服务

74+阅读 · 2022年4月6日

【CVPR 2022】paper解读——从头盔信号中解析生成3D姿势，这为AR/VR创造可信虚拟形象迈出了重要一步，FLAG: Flow-based 3D Avatar Generation from Sparse Observations

专知会员服务

19+阅读 · 2022年3月6日

【CVPR 2022】从大量非正式视频中构建可动画的3D神经模型，BANMo: Building Animatable 3D Neural Models from Many Casual Videos

【CVPR 2022】从大量非正式视频中构建可动画的3D神经模型，BANMo: Building Animatable 3D Neural Models from Many Casual Videos

专知会员服务

25+阅读 · 2022年3月3日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

CVPR2021-单目实时全身捕捉的方法

专知会员服务

20+阅读 · 2021年3月18日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

专知会员服务

22+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

CVPR 2022最佳学生论文：单张图像估计物体在3D空间中的位姿估计

CVPR 2022最佳学生论文：单张图像估计物体在3D空间中的位姿估计

PaperWeekly

1+阅读 · 2022年7月4日

【泡泡图灵智库】基于上采样预积分测量值的3D Lidar-IMU校准来矫正运动失真

【泡泡图灵智库】基于上采样预积分测量值的3D Lidar-IMU校准来矫正运动失真

泡泡机器人SLAM

11+阅读 · 2019年9月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

泡泡机器人SLAM

17+阅读 · 2018年10月12日

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

泡泡机器人SLAM

20+阅读 · 2018年5月8日

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

泡泡机器人SLAM

11+阅读 · 2018年3月31日

相关论文

Superiority of GNN over NN in generalizing bandlimited functions

Arxiv

0+阅读 · 2023年5月29日

Factored-NeuS: Reconstructing Surfaces, Illumination, and Materials of Possibly Glossy Objects

Arxiv

0+阅读 · 2023年5月29日

Volume Feature Rendering for Fast Neural Radiance Field Reconstruction

Arxiv

0+阅读 · 2023年5月29日

FastMESH: Fast Surface Reconstruction by Hexagonal Mesh-based Neural Rendering

Arxiv

0+阅读 · 2023年5月29日

A linear adaptive second-order backward differentiation formulation scheme for the phase field crystal equation

Arxiv

0+阅读 · 2023年5月28日

Simulator-Based Self-Supervision for Learned 3D Tomography Reconstruction

Arxiv

0+阅读 · 2023年5月26日

Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps

Arxiv

0+阅读 · 2023年5月26日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Multi-task Learning of Order-Consistent Causal Graphs

Arxiv

10+阅读 · 2021年11月3日

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Arxiv

12+阅读 · 2020年2月27日

相关基金

全球海洋热含量估计中的Mapping方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

结合计算流体力学与三维光学相干断层成像评价冠心病支架治疗后的局部血流动力学

国家自然科学基金

0+阅读 · 2014年12月31日

基于深度信息和显著计算的手势交互技术研究及应用

国家自然科学基金

1+阅读 · 2014年12月31日

旋转飞行物体的状态估计与轨迹预测

国家自然科学基金

0+阅读 · 2014年12月31日

基于剪切实验的心房力学属性

国家自然科学基金

0+阅读 · 2013年12月31日

纳米修饰可降解双层支架修复兔膝关节骨软骨缺损的研究

国家自然科学基金

0+阅读 · 2013年12月31日

非均匀的神经元网络簇同步和斑图随机动力学

国家自然科学基金

0+阅读 · 2012年12月31日

胸腰椎损伤运动功能重建的生物力学研究

国家自然科学基金

0+阅读 · 2009年12月31日

脊髓损伤膀胱功能重建术后脑功能重塑研究

国家自然科学基金

0+阅读 · 2009年12月31日

组织工程构建视网膜色素上皮细胞膜片移植后结构重建及功能评价

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员