Using only a model that was trained to predict where people look at images, and no additional training data, we can produce a range of powerful editing effects for reducing distraction in images. Given an image and a mask specifying the region to edit, we backpropagate through a state-of-the-art saliency model to parameterize a differentiable editing operator, such that the saliency within the masked region is reduced. We demonstrate several operators, including: a recoloring operator, which learns to apply a color transform that camouflages and blends distractors into their surroundings; a warping operator, which warps less salient image regions to cover distractors, gradually collapsing objects into themselves and effectively removing them (an effect akin to inpainting); a GAN operator, which uses a semantic prior to fully replace image regions with plausible, less salient alternatives. The resulting effects are consistent with cognitive research on the human visual system (e.g., since color mismatch is salient, the recoloring operator learns to harmonize objects' colors with their surrounding to reduce their saliency), and, importantly, are all achieved solely through the guidance of the pretrained saliency model, with no additional supervision. We present results on a variety of natural images and conduct a perceptual study to evaluate and validate the changes in viewers' eye-gaze between the original images and our edited results.
翻译:我们只使用一个经过训练的模型来预测人们看图像的位置,而没有额外的培训数据,我们就可以产生一系列强大的编辑效应来减少图像中的分心。一个图像和面罩可以指定编辑的区域。一个图像和面罩可以指定要编辑的区域,我们通过一个最先进的突出模型进行反演,以参数化一个不同的编辑操作员,这样可以降低遮盖区域内的显著特征。我们展示了几个操作员,包括:一个再色操作员,学会应用颜色变换,迷彩和将分散器混入周围的颜色;一个扭曲操作员,它将不那么突出的图像区域扭曲,以覆盖分散器,逐渐将物体排入到自己体内,并有效地去除它们(一种类似于油漆的效应);一个GAN操作员,在完全取代图像区域之前使用语义化,以貌似、不太突出的替代品取代不同的编辑操作员。由此产生的效果与对人类视觉系统的认知研究是一致的(例如,由于色彩变色明显,再颜色操作员学会将物体的颜色颜色与周围的颜色相协调,以减少其突出的特性),而且重要的是,在原始的图像的观察结果之间, 全部都是通过我们对前的观察前的观察结果的判断和观察结果。