We present a system for realistic one-shot mesh-based human head avatars creation, ROME for short. Using a single photograph, our model estimates a person-specific head mesh and the associated neural texture, which encodes both local photometric and geometric details. The resulting avatars are rigged and can be rendered using a neural network, which is trained alongside the mesh and texture estimators on a dataset of in-the-wild videos. In the experiments, we observe that our system performs competitively both in terms of head geometry recovery and the quality of renders, especially for the cross-person reenactment. See results https://samsunglabs.github.io/rome/
翻译:我们提出了一个现实的以一发网格为基础的人类头形生成系统,即短短的ROME。使用一张照片,我们的模型估计一个人特有的头网和相关的神经纹理,它同时编码了当地的光度和几何细节。由此产生的动因是操纵的,可以使用神经网络来制造,该神经网络与网格和纹理估测器一起,在网上视频数据集上接受培训。在实验中,我们观察到我们的系统在头几何恢复和成份质量两方面都具有竞争力,特别是在跨人再反应方面。见结果https://samsunglabs.github.io/rome/