DifffaceSketch:高纤维面相图像合成,配有 Strach-Guided 延迟传播模型</s> (DiffFaceSketch: High-Fidelity Face Image Synthesis with Sketch-Guided Latent Diffusion Model)

Synthesizing face images from monochrome sketches is one of the most fundamental tasks in the field of image-to-image translation. However, it is still challenging to (1)~make models learn the high-dimensional face features such as geometry and color, and (2)~take into account the characteristics of input sketches. Existing methods often use sketches as indirect inputs (or as auxiliary inputs) to guide the models, resulting in the loss of sketch features or the alteration of geometry information. In this paper, we introduce a Sketch-Guided Latent Diffusion Model (SGLDM), an LDM-based network architect trained on the paired sketch-face dataset. We apply a Multi-Auto-Encoder (AE) to encode the different input sketches from different regions of a face from pixel space to a feature map in latent space, which enables us to reduce the dimension of the sketch input while preserving the geometry-related information of local face details. We build a sketch-face paired dataset based on the existing method that extracts the edge map from an image. We then introduce a Stochastic Region Abstraction (SRA), an approach to augment our dataset to improve the robustness of SGLDM to handle sketch input with arbitrary abstraction. The evaluation study shows that SGLDM can synthesize high-quality face images with different expressions, facial accessories, and hairstyles from various sketches with different abstraction levels.

翻译：从单色草图中合成面容图像是图像到图像翻译领域最根本的任务之一。然而,对于(1) 建立模型以学习高维面貌特征,例如几何和颜色,以及(2) 考虑输入草图的特征,仍然具有挑战性。现有方法经常使用草图作为间接投入(或辅助投入)来指导模型,从而导致草图特征的丧失或几何信息的变化。在本文中,我们引入了一个基于图像到图像化成图像领域的基于LDM的网络设计师,即基于LDM的网络设计师。我们采用了多自动- Encoder(AE)来将来自面部不同区域的不同输入草图进行编码,从像素空间到潜在空间的地貌图图,这使我们能够减少草图投入的维度,同时保存与当地面貌细节相关的地理信息信息。我们根据从图像上提取边缘地图的现有方法,建立了一个以LDMDM为主的网络设计师。我们随后采用了一个多层次的直径直径直径直径,用Sto-Ecoder-Degraphal deal commadial shal shal shal Shamagraphal 研究,我们用一个高的Sq Somal-graphal-graphal-graphal-graphal-graphal-graphal-graphyal-graphal-graphal-graphal 研究,我们用Sho化了一种高可增加了一种高的Sq-graphal-Sq-Sq-Sq-Sqsmal-graphyal-Sqsmal-graphal-graphyal-graphal-graphal-graphal-graphal-graphal-graphal-graphal-graphal-graphal-graphal-SG-SD-Sq-Sq-SG-SG-SG-SG-SG-Sq-Sq-Sq-Sq-SG-graphal-graphal-SG-SG-SG-SG-SG-SG-SD-SG-SG-SG-SG-SG-SG-SG-SG-SG-S</s>

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日