Most existing methods for image inpainting focus on learning the intra-image priors from the known regions of the current input image to infer the content of the corrupted regions in the same image. While such methods perform well on images with small corrupted regions, it is challenging for these methods to deal with images with large corrupted area due to two potential limitations: 1) such methods tend to overfit each single training pair of images relying solely on the intra-image prior knowledge learned from the limited known area; 2) the inter-image prior knowledge about the general distribution patterns of visual semantics, which can be transferred across images sharing similar semantics, is not exploited. In this paper, we propose the Generative Memory-Guided Semantic Reasoning Model (GM-SRM), which not only learns the intra-image priors from the known regions, but also distills the inter-image reasoning priors to infer the content of the corrupted regions. In particular, the proposed GM-SRM first pre-learns a generative memory from the whole training data to capture the semantic distribution patterns in a global view. Then the learned memory are leveraged to retrieve the matching inter-image priors for the current corrupted image to perform semantic reasoning during image inpainting. While the intra-image priors are used for guaranteeing the pixel-level content consistency, the inter-image priors are favorable for performing high-level semantic reasoning, which is particularly effective for inferring semantic content for large corrupted area. Extensive experiments on Paris Street View, CelebA-HQ, and Places2 benchmarks demonstrate that our GM-SRM outperforms the state-of-the-art methods for image inpainting in terms of both the visual quality and quantitative metrics.
翻译:多数现有图像绘制方法都侧重于从已知区域中学习当前输入图像的当前图像的图像内部偏好分布模式,以推断同一图像中腐败区域的内容。虽然这些方法在小腐败区域的图像中表现良好,但由于两个潜在的局限性,这些方法在处理大腐败区域的图像时具有挑战性:1)这类方法往往过分适合仅仅依靠从已知有限区域所学到的先前图像内部知识的每套图像;2)关于当前输入图像的当前图像图像中一般分布模式的预感的预感性知识没有被利用。虽然在本文中,这些方法在小腐败区域的图像中表现良好,但不仅学习了已知区域的图像内部图像前科,而且还在推断了腐败区域的内容。拟议的GM-SRM前期图像中,从整个培训数据中传递的可感知性推理学性记忆,以捕捉当前具有类似语义特征的图像中包含的语义分布模式,在之前的图像中,正在将我们所学到的内脏数据前的内存状态,在之前的图像中进行高级内置的内变。