While recent work has shown progress on extracting clothed 3D human avatars from a single image, video, or a set of 3D scans, several limitations remain. Most methods use a holistic representation to jointly model the body and clothing, which means that the clothing and body cannot be separated for applications like virtual try-on. Other methods separately model the body and clothing, but they require training from a large set of 3D clothed human meshes obtained from 3D/4D scanners or physics simulations. Our insight is that the body and clothing have different modeling requirements. While the body is well represented by a mesh-based parametric 3D model, implicit representations and neural radiance fields are better suited to capturing the large variety in shape and appearance present in clothing. Building on this insight, we propose SCARF (Segmented Clothed Avatar Radiance Field), a hybrid model combining a mesh-based body with a neural radiance field. Integrating the mesh into the volumetric rendering in combination with a differentiable rasterizer enables us to optimize SCARF directly from monocular videos, without any 3D supervision. The hybrid modeling enables SCARF to (i) animate the clothed body avatar by changing body poses (including hand articulation and facial expressions), (ii) synthesize novel views of the avatar, and (iii) transfer clothing between avatars in virtual try-on applications. We demonstrate that SCARF reconstructs clothing with higher visual quality than existing methods, that the clothing deforms with changing body pose and body shape, and that clothing can be successfully transferred between avatars of different subjects. The code and models are available at https://github.com/YadiraF/SCARF.
翻译:虽然最近的工作显示,在从单一图像、视频或一组3D扫描中提取3D人的衣着模具方面取得了进展,但仍有一些限制。大多数方法都使用整体表示法来共同模拟体形和服装,这意味着不能将衣着和身体分开用于虚拟试穿等应用。其他方法分别模拟体型和服装,但需要从3D/4D扫描仪或物理模拟中获取的大批3D穿人衣模具进行训练。我们的见识是,体形和服装有不同的虚拟模型要求。虽然体形和服装由基于网状的3D参数的3D模型、隐含性表示法和神经光亮场字段来很好地代表,以共同的体形和外观来模拟体形和外观。我们建议SACRF(Sload Affatar Radiance Fieldation Field) 混合模型,将基于色色色的体形体形和面体型结构化(我们将内变形的体形和面型体形的体型化) 能够通过单色的变体变色显示,使机型的体型和体型的体型结构变形显示能够的变形显示。