In recent years, conditional image synthesis has attracted growing attention due to its controllability in the image generation process. Although recent works have achieved realistic results, most of them fail to handle fine-grained styles with subtle details. To address this problem, a novel normalization module, named DRAN, is proposed. It learns fine-grained style representation, while maintaining the robustness to general styles. Specifically, we first introduce a multi-level structure, Spatiality-Aware Pyramid Pooling, to guide the model to learn coarse-to-fine features. Then, to adaptively fuse different levels of styles, we propose Dynamic Gating, making it possible to choose different styles according to different spatial regions. To evaluate the effectiveness and generalization ability of DRAN, we conduct a set of experiments on makeup transfer and semantic image synthesis. Quantitative and qualitative experiments show that equipped with DRAN, the baseline models are able to achieve significant improvement in complex style transfer and texture details reconstruction.
翻译:近年来,有条件的图像合成因其在图像生成过程中的可控性而引起越来越多的关注。虽然最近的工作已经取得了现实的结果,但大多数都未能处理细细细细细的风格。为了解决这个问题,提出了名为DRAN的新的正常化模块。它学习了细细细的风格代表,同时保持了通样式的稳健性。具体地说,我们首先引入了一个多层次的结构,即空间性软件软件软件库,以指导模型学习粗体到软体特征。然后,为了适应性地结合不同样式,我们建议了动态定位,以便能够根据不同的空间区域选择不同样式。为了评估DRAN的有效性和一般化能力,我们进行了一套关于化妆性传输和语义图像合成的实验。定量和定性实验显示,配有DRAN软件的基线模型能够在复杂风格传输和文字细节重建方面实现重大改进。