Facial image inpainting is a challenging problem as it requires generating new pixels that include semantic information for masked key components in a face, e.g., eyes and nose. Recently, remarkable methods have been proposed in this field. Most of these approaches use encoder-decoder architectures and have different limitations such as allowing unique results for a given image and a particular mask. Alternatively, some optimization-based approaches generate promising results using different masks with generator networks. However, these approaches are computationally more expensive. In this paper, we propose an efficient solution to the facial image inpainting problem using the Cyclic Reverse Generator (CRG) architecture, which provides an encoder-generator model. We use the encoder to embed a given image to the generator space and incrementally inpaint the masked regions until a plausible image is generated; we trained a discriminator model to assess the quality of the generated images during the iterations and determine the convergence. After the generation process, for the post-processing, we utilize a Unet model that we trained specifically for this task to remedy the artifacts close to the mask boundaries. We empirically observed that only a few iterations are sufficient to generate realistic images with the proposed model. Since the models are not trained for particular mask types, our method allows applying sketch-based inpaintings, using a variety of mask types, and producing multiple and diverse results. We compared our method with the state-of-the-art models both quantitatively and qualitatively, and observed that our method can compete with the other models in all mask types; it is particularly better in images where larger masks are utilized. Our code, dataset and models are available at: https://github.com/yahyadogan72/iterative facial image inpainting.
翻译:涂鸦中的表面图像是一个具有挑战性的问题, 因为它需要生成新的像素, 包括面部遮盖关键部件的语义信息, 例如眼睛和鼻子。 最近, 在这一领域提出了显著的方法 。 这些方法大多使用 encoder- decoder 结构, 并有不同的局限性, 例如允许给特定图像和特定遮罩带来独特的结果 。 或者, 某些基于优化的方法使用发电机网络的不同遮罩产生有希望的结果 。 然而, 这些方法在计算上成本更高 。 在本文中, 我们建议用 Cyclic Reverse 生成器( CRG) 结构来有效解决面部涂色图像的问题 ; 提供一种编码器- 更高级的图像 。 我们使用编码器将一个特定图像嵌入到生成器空间中, 并逐渐插入遮蔽区域, 直到产生一个可信的图像 。 我们训练了一个分析器模型, 用于所有生成后处理, 我们使用一个Unet 模型, 我们专门训练了这个模型, 用来在这种模型中使用了 比较的 类的 。