MetaAvatar:从很少深度的图像中学习有活力布衣人类模型 (MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images)

In this paper, we aim to create generalizable and controllable neural signed distance fields (SDFs) that represent clothed humans from monocular depth observations. Recent advances in deep learning, especially neural implicit representations, have enabled human shape reconstruction and controllable avatar generation from different sensor inputs. However, to generate realistic cloth deformations from novel input poses, watertight meshes or dense full-body scans are usually needed as inputs. Furthermore, due to the difficulty of effectively modeling pose-dependent cloth deformations for diverse body shapes and cloth types, existing approaches resort to per-subject/cloth-type optimization from scratch, which is computationally expensive. In contrast, we propose an approach that can quickly generate realistic clothed human avatars, represented as controllable neural SDFs, given only monocular depth images. We achieve this by using meta-learning to learn an initialization of a hypernetwork that predicts the parameters of neural SDFs. The hypernetwork is conditioned on human poses and represents a clothed neural avatar that deforms non-rigidly according to the input poses. Meanwhile, it is meta-learned to effectively incorporate priors of diverse body shapes and cloth types and thus can be much faster to fine-tune, compared to models trained from scratch. We qualitatively and quantitatively show that our approach outperforms state-of-the-art approaches that require complete meshes as inputs while our approach requires only depth frames as inputs and runs orders of magnitudes faster. Furthermore, we demonstrate that our meta-learned hypernetwork is very robust, being the first to generate avatars with realistic dynamic cloth deformations given as few as 8 monocular depth frames.

翻译：在本文中,我们的目标是创建可笼统且可控的神经标志的距离场(SDFs),它们代表着单心深度观测的人类。最近深层学习的进展,特别是神经隐含的表达方式,使得人类形状的重建和可控的阿凡达体生成了不同感官输入物。然而,为了产生现实的布质变形,我们通常需要用新输入物来生成,水密胶片或密集的全体扫描通常需要作为输入物。此外,由于难以有效模拟不同身体形状和布型的成形自制布质变形,现有方法从擦入开始就采用每个部位/布型的优化,这是计算成本昂贵的。相比之下,我们建议的一种方法可以快速生成现实的衣质变的人类腹形变形变形变形变形变形变形,因为只有单心深度的图像,我们通过元化学习来学习一种超网络的初始化,可以预测神经变形变形变形变形变形变形的参数。超级网络只能以人体变形变形变形和变形变形的内脏方法为条件,我们首先需要穿的神经变形变形变形变形变形变形的神经的神经变形,而要将变形变形变形变形变形变形变形变形变形变形的内变形变形变形,同时要显示和变形变形变形的变形的变形变形变形的变形的变形的变形的变形的变形变形变形变形的变形变形变形变形变形变形变形变形变形的变形的变形的变形的变形的变形变形变形的变形变形变形的变形变形变形变形变形变形变形变形体,要的变形变形变形的变形体,要的变形和变形变形的变形和变形的变形的变形的变形变形变形的变形的变形变形的变形变形的变形的变形的变形和变形的变形的变形的变形的变形变形变形变形变形变形的变形的变形变形变形的变形体,

相关内容

MoDELS

关注 30

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

56+阅读 · 2019年10月17日