In applications such as optical see-through and projector augmented reality, producing images amounts to solving non-negative image generation, where one can only add light to an existing image. Most image generation methods, however, are ill-suited to this problem setting, as they make the assumption that one can assign arbitrary color to each pixel. In fact, naive application of existing methods fails even in simple domains such as MNIST digits, since one cannot create darker pixels by adding light. We know, however, that the human visual system can be fooled by optical illusions involving certain spatial configurations of brightness and contrast. Our key insight is that one can leverage this behavior to produce high quality images with negligible artifacts. For example, we can create the illusion of darker patches by brightening surrounding pixels. We propose a novel optimization procedure to produce images that satisfy both semantic and non-negativity constraints. Our approach can incorporate existing state-of-the-art methods, and exhibits strong performance in a variety of tasks including image-to-image translation and style transfer.
翻译:在光学透视和投影增强现实等应用中,制作图像等于解决非消极图像生成,只能给现有图像增加光线。但是,大多数图像生成方法都不适合这一问题设置,因为它们假设每个像素可以任意指定颜色。事实上,即使像像素这样的简单领域,如MNIST数字,现有方法的天真应用也失败,因为增加光不能创造更暗的像素。然而,我们知道,人类视觉系统可能被包含某些亮度和对比的空间配置的光学幻觉所蒙骗。我们的关键洞察力是,人们可以利用这种行为来用可忽略的文物生成高质量的图像。例如,我们可以通过周围像素亮亮亮的像素来创造更暗的错觉。我们提出一个新颖的优化程序来产生既满足语义性又不透明性约束的图像。我们的方法可以将现有的最先进的方法纳入到各种任务中,包括图像到图像翻译和风格转换。