Inversion-Based Style Transfer with Diffusion Models (Inversion-Based Style Transfer with Diffusion Models)

The artistic style within a painting is the means of expression, which includes not only the painting material, colors, and brushstrokes, but also the high-level attributes including semantic elements, object shapes, etc. Previous arbitrary example-guided artistic image generation methods often fail to control shape changes or convey elements. The pre-trained text-to-image synthesis diffusion probabilistic models have achieved remarkable quality, but it often requires extensive textual descriptions to accurately portray attributes of a particular painting. We believe that the uniqueness of an artwork lies precisely in the fact that it cannot be adequately explained with normal language. Our key idea is to learn artistic style directly from a single painting and then guide the synthesis without providing complex textual descriptions. Specifically, we assume style as a learnable textual description of a painting. We propose an inversion-based style transfer method (InST), which can efficiently and accurately learn the key information of an image, thus capturing and transferring the artistic style of a painting. We demonstrate the quality and efficiency of our method on numerous paintings of various artists and styles. Code and models are available at https://github.com/zyxElsa/InST.

翻译：---- 逆向扩散模型下的风格转移摘要：绘画中的艺术风格是表达的手段，不仅包括绘画材料、色彩和笔触，还包括高级属性，包括语义元素、对象形状等。以往的任意示例指导的艺术图像生成方法往往无法控制形状变化或传达元素。预训练的文本到图像生成扩散概率模型取得了显著的质量，但通常需要详细的文本描述才能准确描述特定绘画的属性。我们认为，艺术品的独特之处恰恰在于它不能用正常语言充分解释。我们的核心思想是直接从单幅画中学习艺术风格，然后在不提供复杂文本描述的情况下指导合成。具体而言，我们将风格视为绘画的可学习文本描述。我们提出了一种逆向风格转移方法（InST），可以高效精确地学习图像的关键信息，从而捕捉和转移绘画的艺术风格。我们在各种艺术家和风格的众多绘画上展示了我们的方法的质量和效率。代码和模型可在 https://github.com/zyxElsa/InST 找到。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】可控图像合成与编辑的合成生成先验学习，SemanticStyleGAN: Learning Compositonal Generative Priors for Controllable Image Synthesis and Editing

专知会员服务

23+阅读 · 2022年3月3日

康奈尔大学「深度概率与生成模型」2021SP课程

专知会员服务

49+阅读 · 2021年4月24日

【ICLR 2019】双曲注意力网络，Hyperbolic Attention Network

专知会员服务

84+阅读 · 2020年6月21日

【SIGIR2020】学习搜索查询的颜色表示，Learning Colour Representations of Search Queries

专知会员服务

17+阅读 · 2020年6月18日