Face sketch synthesis has made significant progress with the development of deep neural networks in these years. The delicate depiction of sketch portraits facilitates a wide range of applications like digital entertainment and law enforcement. However, accurate and realistic face sketch generation is still a challenging task due to the illumination variations and complex backgrounds in the real scenes. To tackle these challenges, we propose a novel Semantic-Driven Generative Adversarial Network (SDGAN) which embeds global structure-level style injection and local class-level knowledge re-weighting. Specifically, we conduct facial saliency detection on the input face photos to provide overall facial texture structure, which could be used as a global type of prior information. In addition, we exploit face parsing layouts as the semantic-level spatial prior to enforce globally structural style injection in the generator of SDGAN. Furthermore, to enhance the realistic effect of the details, we propose a novel Adaptive Re-weighting Loss (ARLoss) which dedicates to balance the contributions of different semantic classes. Experimentally, our extensive experiments on CUFS and CUFSF datasets show that our proposed algorithm achieves state-of-the-art performance.
翻译:近些年来,随着深层神经网络的发展,面部素描合成工作取得了显著进展。对素描肖像的微妙描绘促进了数字娱乐和执法等广泛应用。然而,由于真实场景中的光照变化和背景复杂,准确和现实的面部素描制作工作仍是一项艰巨的任务。为了应对这些挑战,我们提议建立一个新型的语义-多功能基因反转网络(SDGAN),它包含全球结构层次风格注射和当地等级知识重新加权。具体地说,我们在投入面部照片上进行面部显眼检测,以提供总体面部纹理结构,可以用作全球先前信息的类型。此外,我们利用面部面面部面面面部面面面面貌作为语义空间层次,然后在SDGAN生成器中实施全球结构风格注入。此外,为了提高细节的现实效果,我们提议了一个新的适应性再加权损失(ARLoss),专门平衡不同语系类的贡献。实验中,我们在CUFS和CUFSFSD数据库中进行的广泛实验,显示我们拟议的状态表现。