Recently, unsupervised learning has made impressive progress on various tasks. Despite the dominance of discriminative models, increasing attention is drawn to representations learned by generative models and in particular, Generative Adversarial Networks (GANs). Previous works on the interpretation of GANs reveal that GANs encode semantics in feature maps in a linearly separable form. In this work, we further find that GAN's features can be well clustered with the linear separability assumption. We propose a novel clustering algorithm, named KLiSH, which leverages the linear separability to cluster GAN's features. KLiSH succeeds in extracting fine-grained semantics of GANs trained on datasets of various objects, e.g., car, portrait, animals, and so on. With KLiSH, we can sample images from GANs along with their segmentation masks and synthesize paired image-segmentation datasets. Using the synthesized datasets, we enable two downstream applications. First, we train semantic segmentation networks on these datasets and test them on real images, realizing unsupervised semantic segmentation. Second, we train image-to-image translation networks on the synthesized datasets, enabling semantic-conditional image synthesis without human annotations.
翻译:最近,未经监督的学习在各种任务上取得了令人印象深刻的进展。尽管歧视模式占主导地位,但人们越来越注意通过基因模型,特别是Generational Aversarial Networks(GANs)所学的表征。以前关于GANs的解释工作显示,GANs以线性分解形式将语义解成地貌地图。在这项工作中,我们进一步发现GAN的特征可以与线性分离假设很好地组合在一起。我们提议了一个叫KLiSH的新型群集算法,它将线性分离作为GAN特性的杠杆。KLiSH成功地提取了GANs精细微的语义结构。首先,我们训练了GANs关于各种物体数据集(例如汽车、肖像、动物等等)的精细精细的语义结构。在KLiSH工作中,我们可以将GANs的图像与它们的分解面面面面面罩和合成的配对式图像拼图解的数据集进行取样。我们用合成数据集使两个下的应用得以进行。首先,我们训练了两个下游应用。我们训练了Smanticretical-sucial comimdeal commact commact commact commactalation sution sution wealction sution subild subilation subalup subild