Photorealistic stylization aims to transfer the style of a reference photo onto a content photo in a natural fashion, such that the stylized image looks like a real photo taken by a camera. State-of-the-art methods stylize the image locally within each matched semantic region and are prone to global color inconsistency across semantic objects/parts, making the stylized image less photorealistic. To tackle the challenging issues, we propose a non-local representation scheme, constrained with a mutual affine-transfer network (NL-MAT). Through a dictionary-based decomposition, NL-MAT is able to successfully decouple matched non-local representations and color information of the image pair, such that the context correspondence between the image pair is incorporated naturally, which largely facilitates local style transfer in a global-consistent fashion. To the best of our knowledge, this is the first attempt to address the photorealistic stylization problem with a non-local representation scheme, such that no additional models or steps for semantic matching are required during stylization. Experimental results demonstrate that the proposed method is able to generate photorealistic results with local style transfer while preserving both the spatial structure and global color consistency of the content image.
翻译:相片现实化的目的是自然地将参考照片的风格转换到内容照片上,使标准化图像看上去像相机拍摄的真照片。 最先进的方法将每个相匹配的语义区域中的图像在本地进行同步,并容易在语义对象/片段之间出现全球色彩不一致,使标准化图像不那么光真化。 为了解决具有挑战性的问题,我们提议采用非本地代表制,同时受相互通缩网络的限制(NL-MAT ) 。 通过基于字典的分解,NL-MAT 能够成功地脱钩匹配非本地的图像表象和彩色信息, 从而将图像配对之间的背景对应自然地整合起来, 这在很大程度上有利于以全球一致的方式进行本地风格的转换。 据我们所知,这是第一次尝试用非本地代表制解决光真化问题, 因此在基于字典的分解法化期间,不需要额外的语义匹配模型或步骤。 实验性结果显示, 既保持了当地图像格式,也能够保持当地图像格式的传输方式。