Image cartoonization is recently dominated by generative adversarial networks (GANs) from the perspective of unsupervised image-to-image translation, in which an inherent challenge is to precisely capture and sufficiently transfer characteristic cartoon styles (e.g., clear edges, smooth color shading, abstract fine structures, etc.). Existing advanced models try to enhance cartoonization effect by learning to promote edges adversarially, introducing style transfer loss, or learning to align style from multiple representation space. This paper demonstrates that more distinct and vivid cartoonization effect could be easily achieved with only basic adversarial loss. Observing that cartoon style is more evident in cartoon-texture-salient local image regions, we build a region-level adversarial learning branch in parallel with the normal image-level one, which constrains adversarial learning on cartoon-texture-salient local patches for better perceiving and transferring cartoon texture features. To this end, a novel cartoon-texture-saliency-sampler (CTSS) module is proposed to dynamically sample cartoon-texture-salient patches from training data. With extensive experiments, we demonstrate that texture saliency adaptive attention in adversarial learning, as a missing ingredient of related methods in image cartoonization, is of significant importance in facilitating and enhancing image cartoon stylization, especially for high-resolution input pictures.
翻译:最近,从未经监督的图像到图像翻译的角度,图像图像的变异式网络(GANs)最近主导了图像漫画的演化。 从不受监督的图像到图像翻译(GANs)的角度看,一个固有的挑战是精确捕捉和充分转移典型的漫画风格(例如,清晰边缘、光色阴影、抽象的精细结构等)。现有的先进模型试图通过学习促进边缘对立、引入风格转移损失或学习多代表空间的风格来增强漫画的演化效果。本文表明,只有基本的对抗性损失,才能更容易实现更独特和生动的漫画效果。观察,漫画风格在具有高度适应性的地方图像区域形象区域中更加明显,我们与普通的图像级别一平行地建立一个区域级的对抗性学习分支,这制约了对漫画-文字适应性地方补丁的学习,以便更好地了解和转移漫画纹理特征。为此,一个新的漫画-纹理-感官(CTSS)模块模块被推荐为动态模样的漫画-文字适应性补版,从培训数据中更加明显,我们在高调的图像的图像输入中,我们展示了一种重要的感官能的感官化方法,在高调性图像上学习中,我们展示了与感官的感官的感官的感官成像学的感。