DreamBooth: 精美制图,用于对象驱动一代的文字到图像传播模型</s> (DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation)

Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-quality and diverse synthesis of images from a given text prompt. However, these models lack the ability to mimic the appearance of subjects in a given reference set and synthesize novel renditions of them in different contexts. In this work, we present a new approach for "personalization" of text-to-image diffusion models. Given as input just a few images of a subject, we fine-tune a pretrained text-to-image model such that it learns to bind a unique identifier with that specific subject. Once the subject is embedded in the output domain of the model, the unique identifier can be used to synthesize novel photorealistic images of the subject contextualized in different scenes. By leveraging the semantic prior embedded in the model with a new autogenous class-specific prior preservation loss, our technique enables synthesizing the subject in diverse scenes, poses, views and lighting conditions that do not appear in the reference images. We apply our technique to several previously-unassailable tasks, including subject recontextualization, text-guided view synthesis, and artistic rendering, all while preserving the subject's key features. We also provide a new dataset and evaluation protocol for this new task of subject-driven generation. Project page: https://dreambooth.github.io/

翻译：大型文本到图像模型在AI的演进中取得了显著的飞跃,使得能够对特定文本的图像进行高质量和多样的合成。但是, 这些模型缺乏在不同的背景中模仿特定参考集中主题外观和合成其新翻版的能力。在这项工作中, 我们展示了文本到图像扩散模型的“ 个性化” 新方法。作为一种主题的几张图像, 我们微调了一个预先训练的文本到图像模型, 以便它学会将一个独特的标识符与该特定主题捆绑在一起。一旦该主题嵌入该模型的产出域, 独特的标识符就可以用于合成不同场景中该主题的新摄影现实图像。通过利用该模型中以前嵌入的语义化, 以及一个新的自自动分类的先前保存损失, 我们的技术可以将该主题组合在不同的场景中, 提出、观点和照明条件, 而不是出现在参考图像中。我们将我们的技术应用到一些先前无法完成的任务, 包括主题的重新翻版、文本- 指南- 图像- 和艺术生成的新主题, 提供这个新主题的组合和艺术任务。我们驱动的版本/ 将所有关键版本的版本的版本的合成和艺术任务。</s>

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日