Although much progress has been made in 3D clothed human reconstruction, most of the existing methods fail to produce robust results from in-the-wild images, which contain diverse human poses and appearances. This is mainly due to the large domain gap between training datasets and in-the-wild datasets. The training datasets are usually synthetic ones, which contain rendered images from GT 3D scans. However, such datasets contain simple human poses and less natural image appearances compared to those of real in-the-wild datasets, which makes generalization of it to in-the-wild images extremely challenging. To resolve this issue, in this work, we propose ClothWild, a 3D clothed human reconstruction framework that firstly addresses the robustness on in-thewild images. First, for the robustness to the domain gap, we propose a weakly supervised pipeline that is trainable with 2D supervision targets of in-the-wild datasets. Second, we design a DensePose-based loss function to reduce ambiguities of the weak supervision. Extensive empirical tests on several public in-the-wild datasets demonstrate that our proposed ClothWild produces much more accurate and robust results than the state-of-the-art methods. The codes are available in here: https://github.com/hygenie1228/ClothWild_RELEASE.
翻译:尽管在3D布衣人重建方面取得了很大进展,但大多数现有方法都未能从含有不同人形和外观的圆形图像中产生强有力的结果,这主要是由于培训数据集和圆形数据集之间存在巨大的领域差距。培训数据集通常是合成数据集,包含GT 3D扫描的图像。然而,这些数据集包含简单的人形,与实在维格数据集相比,其自然形象的显示较少。第二,我们设计了一个基于登斯-波斯的丢失功能,以减少监管不力的模糊性。在这项工作中,我们提议ClothWild,一个3D布衣人重建框架,首先解决网上图像的稳健性。首先,为了对域格扫描的图像的稳健性,我们建议一个监管不力的管道,由2D监督目标在网络内部数据集中进行训练。第二,我们设计了一个基于Dense-Pose的丢失功能,以降低监管不力的模糊性。广度的经验测试显示,在几个公众中,在网上生产更稳健固的、更精确的版本数据方法中,这是我们提出的。