Given an input face photo, the goal of caricature generation is to produce stylized, exaggerated caricatures that share the same identity as the photo. It requires simultaneous style transfer and shape exaggeration with rich diversity, and meanwhile preserving the identity of the input. To address this challenging problem, we propose a novel framework called Multi-Warping GAN (MW-GAN), including a style network and a geometric network that are designed to conduct style transfer and geometric exaggeration respectively. We bridge the gap between the style and landmarks of an image with corresponding latent code spaces by a dual way design, so as to generate caricatures with arbitrary styles and geometric exaggeration, which can be specified either through random sampling of latent code or from a given caricature sample. Besides, we apply identity preserving loss to both image space and landmark space, leading to a great improvement in quality of generated caricatures. Experiments show that caricatures generated by MW-GAN have better quality than existing methods.
翻译:根据一张输入面部照片,漫画生成的目的是生成与照片具有相同特性的星系化、夸大式漫画。它需要同时进行风格传输和形状夸大,同时要保存输入的特性。为了解决这个具有挑战性的问题,我们提议了一个名为多警告GAN(MW-GAN)的新框架,包括一个风格网络和一个几何网络,分别设计用来进行风格传输和几何夸大。我们通过双轨方式设计来弥合图像样式和标志与相应的潜在代码空间之间的差距,以便生成带有任意风格和几何式夸大特征的漫画,可以通过随机抽样采集潜在代码或某个特定图形样本加以具体说明。此外,我们还将身份保护损失应用到图像空间和里程碑空间,导致生成的漫画质量大幅提高。实验显示,MW-GAN产生的漫画质量高于现有方法。