Deep neural networks (DNNs) have greatly contributed to the performance gains in semantic segmentation. Nevertheless, training DNNs generally requires large amounts of pixel-level labeled data, which is expensive and time-consuming to collect in practice. To mitigate the annotation burden, this paper proposes a self-ensembling generative adversarial network (SE-GAN) exploiting cross-domain data for semantic segmentation. In SE-GAN, a teacher network and a student network constitute a self-ensembling model for generating semantic segmentation maps, which together with a discriminator, forms a GAN. Despite its simplicity, we find SE-GAN can significantly boost the performance of adversarial training and enhance the stability of the model, the latter of which is a common barrier shared by most adversarial training-based methods. We theoretically analyze SE-GAN and provide an $\mathcal O(1/\sqrt{N})$ generalization bound ($N$ is the training sample size), which suggests controlling the discriminator's hypothesis complexity to enhance the generalizability. Accordingly, we choose a simple network as the discriminator. Extensive and systematic experiments in two standard settings demonstrate that the proposed method significantly outperforms current state-of-the-art approaches. The source code of our model is available online (https://github.com/YonghaoXu/SE-GAN).
翻译:深心神经网络(DNNS)极大地促进了语义分割的性能收益,然而,培训DNNS通常需要大量像素级标签数据,而这些数据既昂贵又费时,在实际操作中收集。为减轻批注负担,本文件建议建立一个自我集合的基因对抗网络(SE-GAN),利用跨界域数据进行语义分割。在SE-GAN,一个教师网络和一个学生网络构成了生成语义分割图的自我聚合模型(N$是培训样本大小),这表明要与歧视者一起控制歧视者的假设复杂性,从而形成GAN。尽管数据简单易行,但我们发现SE-GAN能够大大地提高对抗性培训的绩效,并增强模型的稳定性,后者是大多数以对抗性培训为基础的方法共有的一个共同障碍。我们从理论上分析SE-GAN网络并提供$mathcal O(1/ sqrt{N}xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx),这表明控制歧视者假设假设的假设复杂性,以加强当前的常规实验。因此,我们选择了一种简单的网络,以系统化的系统模式。我们现有的系统化方法。我们提出了一种系统化的系统化的系统化的系统化的系统化的系统化的系统化的系统化的系统化的系统化方法。