While unsupervised domain translation (UDT) has seen a lot of success recently, we argue that mediating its translation via categorical semantic features could broaden its applicability. In particular, we demonstrate that categorical semantics improves the translation between perceptually different domains sharing multiple object categories. We propose a method to learn, in an unsupervised manner, categorical semantic features (such as object labels) that are invariant of the source and target domains. We show that conditioning the style encoder of unsupervised domain translation methods on the learned categorical semantics leads to a translation preserving the digits on MNIST$\leftrightarrow$SVHN and to a more realistic stylization on Sketches$\to$Reals.
翻译:虽然未受监督的域名翻译(UDT)最近取得了许多成功,但我们争辩说,通过直截了当的语义特征来调解翻译,可以扩大其适用性。特别是,我们证明,直截了当的语义可以改善不同概念域之间共享多个对象类别的翻译。我们建议了一种方法,以不受监督的方式学习源域和目标域中不可变的绝对语义特征(如对象标签)。我们表明,在学过直截断语的域名翻译中设置了不受监督域名翻译的风格编码器,可以导致翻译保存MNIST$\leftright$SVHN的数字,并在Sketcheche$\to$Reals上实现更现实的语义化。