With recent advances in image-to-image translation tasks, remarkable progress has been witnessed in generating face images from sketches. However, existing methods frequently fail to generate images with details that are semantically and geometrically consistent with the input sketch, especially when various decoration strokes are drawn. To address this issue, we introduce a novel W-W+ encoder architecture to take advantage of the high expressive power of W+ space and semantic controllability of W space. We introduce an explicit intermediate representation for sketch semantic embedding. With a semantic feature matching loss for effective semantic supervision, our sketch embedding precisely conveys the semantics in the input sketches to the synthesized images. Moreover, a novel sketch semantic interpretation approach is designed to automatically extract semantics from vectorized sketches. We conduct extensive experiments on both synthesized sketches and hand-drawn sketches, and the results demonstrate the superiority of our method over existing approaches on both semantics-preserving and generalization ability.
翻译:随着图像到图像翻译任务的最新进展,在从草图生成面部图像方面取得了显著进展;然而,现有方法往往未能产生与输入草图相一致的语义和几何性详细图像,特别是在绘制各种装饰划线时。为解决这一问题,我们引入了一个新的W-W+编码器结构,以利用W+空间和W空间的语义可控性的高度表达力和W空间的语义可控性。我们为草图的语义嵌入引入了明确的中间表示器。用语义特征匹配损失以进行有效的语义监督,我们的草图将语义精确地传递到合成图像输入草图中的语义。此外,我们设计了一个新的草图语义解释方法,以自动从矢量的草图中提取语义学。我们对合成的草图和手写草图进行了广泛的实验,结果表明我们的方法优于现有的语义保存和概括能力方法。