Recently, image-to-image translation has made significant progress in achieving both multi-label (\ie, translation conditioned on different labels) and multi-style (\ie, generation with diverse styles) tasks. However, due to the unexplored independence and exclusiveness in the labels, existing endeavors are defeated by involving uncontrolled manipulations to the translation results. In this paper, we propose Hierarchical Style Disentanglement (HiSD) to address this issue. Specifically, we organize the labels into a hierarchical tree structure, in which independent tags, exclusive attributes, and disentangled styles are allocated from top to bottom. Correspondingly, a new translation process is designed to adapt the above structure, in which the styles are identified for controllable translations. Both qualitative and quantitative results on the CelebA-HQ dataset verify the ability of the proposed HiSD. We hope our method will serve as a solid baseline and provide fresh insights with the hierarchically organized annotations for future research in image-to-image translation. The code has been released at https://github.com/imlixinyang/HiSD.
翻译:最近,图像到图像翻译在实现多标签任务(\\\\\\\\根据不同标签的翻译条件)和多样式(\\\\\\\\\\\\\以不同风格生成)任务方面取得了重大进展。 但是,由于标签中未探索的独立性和独一性,现有的努力因对翻译结果进行不受控制的操纵而失败。在本文件中,我们提议将等级风格分解(HISD)来解决这个问题。具体地说,我们将标签组织成一个从上到下分配独立标签、独家属性和分解样式的分级树结构。相应的是,设计一个新的翻译过程来调整上述结构,确定可控翻译的样式。CelebA-HQ数据集的定性和定量结果都验证了拟议的 HISD的能力。我们希望我们的方法能成为坚实的基线,并为今后图像到image翻译的研究提供分级化说明的新见解。代码已在 https://github.com/imlixyang/hiSDD中发布。