Real 图像基于前端编辑的“ 真实图像” 轴向图 (Pivotal Tuning for Latent-based Editing of Real Images)

Recently, a surge of advanced facial editing techniques have been proposed that leverage the generative power of a pre-trained StyleGAN. To successfully edit an image this way, one must first project (or invert) the image into the pre-trained generator's domain. As it turns out, however, StyleGAN's latent space induces an inherent tradeoff between distortion and editability, i.e. between maintaining the original appearance and convincingly altering some of its attributes. Practically, this means it is still challenging to apply ID-preserving facial latent-space editing to faces which are out of the generator's domain. In this paper, we present an approach to bridge this gap. Our technique slightly alters the generator, so that an out-of-domain image is faithfully mapped into an in-domain latent code. The key idea is pivotal tuning - a brief training process that preserves the editing quality of an in-domain latent region, while changing its portrayed identity and appearance. In Pivotal Tuning Inversion (PTI), an initial inverted latent code serves as a pivot, around which the generator is fined-tuned. At the same time, a regularization term keeps nearby identities intact, to locally contain the effect. This surgical training process ends up altering appearance features that represent mostly identity, without affecting editing capabilities. We validate our technique through inversion and editing metrics, and show preferable scores to state-of-the-art methods. We further qualitatively demonstrate our technique by applying advanced edits (such as pose, age, or expression) to numerous images of well-known and recognizable identities. Finally, we demonstrate resilience to harder cases, including heavy make-up, elaborate hairstyles and/or headwear, which otherwise could not have been successfully inverted and edited by state-of-the-art methods.

翻译：最近, 推荐了一个先进的面部编辑技术, 利用受过训练的 StyleGAN 的基因能力。要成功编辑一个图像, 就必须首先将图像投放到经过训练的发电机域。但是, 事实证明, StyleGAN 的潜伏空间在扭曲和可编辑性之间产生了内在的权衡, 也就是说, 在保持原始外观和令人信服地改变其某些属性之间, 也就是说, 使用身份识别和外观编辑仍然具有挑战性。在Pivatal Talning Inversion (PTI) 中, 初始的反向暗面显示的是更高级的图像, 在本文中, 我们的技术稍稍有改变, 使外向外的图像忠实地映射到一个内部隐蔽的代码。关键的想法是调整一个简短的培训过程, 保持一个内在隐蔽的区域的编辑质量, 同时改变其描绘的身份和外观。在Pivotal Tivate (PTI) 中, 初始的隐形代码作为更高级的直角,, 在本文中, 显示更高级的直观的直观的直观的图像,, 显示着的直观的直观的直观的直观的直观的直观,, 显示着的直观的直观的直观, 显示着的直方, 直观, 直观, 直观, 显示着的直观的直观, 直观的直观的直观的直观的直观, 直观, 直观的直观的直观, 直观, 直观的直观, 直观的直观, 直观的直观的直观的直观的直观的直观的直观的直观的直观的直观, 显示到直观, 直观, 直观, 直观, 直观, 直观, 直观, 直观, 直观的直观, 直观, 直观, 直观, 直观, 直观, 直观, 直观, 直观, 直观, 直观, 直观, 显示的直观, 直观, 直观, 直观, 直观, 直观,