Semantic image editing utilizes local semantic label maps to generate the desired content in the edited region. A recent work borrows SPADE block to achieve semantic image editing. However, it cannot produce pleasing results due to style discrepancy between the edited region and surrounding pixels. We attribute this to the fact that SPADE only uses an image-independent local semantic layout but ignores the image-specific styles included in the known pixels. To address this issue, we propose a style-preserved modulation (SPM) comprising two modulations processes: The first modulation incorporates the contextual style and semantic layout, and then generates two fused modulation parameters. The second modulation employs the fused parameters to modulate feature maps. By using such two modulations, SPM can inject the given semantic layout while preserving the image-specific context style. Moreover, we design a progressive architecture for generating the edited content in a coarse-to-fine manner. The proposed method can obtain context-consistent results and significantly alleviate the unpleasant boundary between the generated regions and the known pixels.
翻译:语义图像编辑使用本地语义标签地图来生成编辑区域中想要的内容。 最近的一项工作 借用 SPADE 块来进行语义图像编辑。 但是, 由于编辑区域和周围像素之间的风格差异, 它无法产生令人愉快的结果。 我们将此归因于 SPADE 只使用独立图像的本地语义布局, 却忽略了已知像素中包含的图像特有风格。 为了解决这个问题, 我们提议了一种由两种调制程序组成的风格预设的调制( SPM) : 第一个调制包含上下文样式和语义布局, 然后生成两个引信调制参数。 第二个调制使用引信参数来调制地貌图。 通过使用这两种调制, SPM 可以输入给定的语义布局, 同时保存特定图像的语义样式。 此外, 我们设计了一个渐进的架构, 用于以直角到线的方式生成编辑的内容。 拟议的方法可以获取上下文调结果, 并大大减轻生成区域与已知像素的平方之间不愉快的边界 。