We present a novel high-fidelity generative adversarial network (GAN) inversion framework that enables attribute editing with image-specific details well-preserved (e.g., background, appearance and illumination). We first formulate GAN inversion as a lossy data compression problem and carefully discuss the Rate-Distortion-Edit trade-off. Due to this trade-off, previous works fail to achieve high-fidelity reconstruction while keeping compelling editing ability with a low bit-rate latent code only. In this work, we propose a distortion consultation approach that employs the distortion map as a reference for reconstruction. In the distortion consultation inversion (DCI), the distortion map is first projected to a high-rate latent map, which then complements the basic low-rate latent code with (lost) details via consultation fusion. To achieve high-fidelity editing, we propose an adaptive distortion alignment (ADA) module with a self-supervised training scheme. Extensive experiments in the face and car domains show a clear improvement in terms of both inversion and editing quality.
翻译:我们提出了一个新颖的高纤维基因对抗网络(GAN)倒置框架,使编辑与图像特有细节(例如背景、外观和光照)的属性得到良好的保护(例如背景、外观和光化),我们首先将GAN反向写成数据压缩丢失的问题,并仔细讨论利率扭曲-电磁交换交易。由于这一权衡,先前的工程未能实现高纤维重建,同时只保留低位速率潜伏代码的编辑能力。在这项工作中,我们建议采用扭曲协商方法,将扭曲地图用作重建的参考。在扭曲反向协商(DCI)中,扭曲地图首先被预测为高比例潜伏图,然后通过协商聚合(Loission)对基本低利率潜伏代码进行补充。为了实现高纤维化编辑,我们建议采用自强的训练计划,调整扭曲调整模块。在面部和汽车领域的广泛实验显示在转换和编辑质量两方面都有明显改进。