学习用于图像修饰的多种调调样式 (Learning Diverse Tone Styles for Image Retouching)

Image retouching, aiming to regenerate the visually pleasing renditions of given images, is a subjective task where the users are with different aesthetic sensations. Most existing methods deploy a deterministic model to learn the retouching style from a specific expert, making it less flexible to meet diverse subjective preferences. Besides, the intrinsic diversity of an expert due to the targeted processing on different images is also deficiently described. To circumvent such issues, we propose to learn diverse image retouching with normalizing flow-based architectures. Unlike current flow-based methods which directly generate the output image, we argue that learning in a style domain could (i) disentangle the retouching styles from the image content, (ii) lead to a stable style presentation form, and (iii) avoid the spatial disharmony effects. For obtaining meaningful image tone style representations, a joint-training pipeline is delicately designed, which is composed of a style encoder, a conditional RetouchNet, and the image tone style normalizing flow (TSFlow) module. In particular, the style encoder predicts the target style representation of an input image, which serves as the conditional information in the RetouchNet for retouching, while the TSFlow maps the style representation vector into a Gaussian distribution in the forward pass. After training, the TSFlow can generate diverse image tone style vectors by sampling from the Gaussian distribution. Extensive experiments on MIT-Adobe FiveK and PPR10K datasets show that our proposed method performs favorably against state-of-the-art methods and is effective in generating diverse results to satisfy different human aesthetic preferences. Source code and pre-trained models are publicly available at https://github.com/SSRHeart/TSFlow.

翻译：图像重新触摸, 目的是对给定图像进行视觉上令人愉快的转换, 这是一种主观的任务, 用户在这种任务中有着不同的审美感。多数现有方法都使用一种确定型模式, 以便从特定专家那里学习感知感应风格, 使其更不灵活地适应不同的主观偏好。此外, 以不同图像进行定向处理的专家的内在多样性也不够描述。为了绕过这些问题, 我们提议学习多种图像, 用正常流基结构来重新抚摸。与直接生成输出图像的当前流基方法不同, 我们认为, 在一个样式域中学习可以( i) 将感应式风格的样式与图像内容分解开, (ii) 导致一个稳定的样式演示表格式, (iii) 避免空间上的不协调效应。为了获得有意义的图像色调表达方式, 联合培训管道的设计很微妙, 由风格编码、有条件的 Retouch 网络和图像调控流流( TSF) 模块, 特别, 将目标样式预示着目标样式的Slent- tremal- develyal- lial develop sloveal press 演示图, 将显示一个驱动式的图像在Tre to the tremvidudududududududustral slatedal viductions