In unsupervised domain adaptive (UDA) semantic segmentation, the distillation based methods are currently dominant in performance. However, the distillation technique requires complicate multi-stage process and many training tricks. In this paper, we propose a simple yet effective method that can achieve competitive performance to the advanced distillation methods. Our core idea is to fully explore the target-domain information from the views of boundaries and features. First, we propose a novel mix-up strategy to generate high-quality target-domain boundaries with ground-truth labels. Different from the source-domain boundaries in previous works, we select the high-confidence target-domain areas and then paste them to the source-domain images. Such a strategy can generate the object boundaries in target domain (edge of target-domain object areas) with the correct labels. Consequently, the boundary information of target domain can be effectively captured by learning on the mixed-up samples. Second, we design a multi-level contrastive loss to improve the representation of target-domain data, including pixel-level and prototype-level contrastive learning. By combining two proposed methods, more discriminative features can be extracted and hard object boundaries can be better addressed for the target domain. The experimental results on two commonly adopted benchmarks (\textit{i.e.}, GTA5 $\rightarrow$ Cityscapes and SYNTHIA $\rightarrow$ Cityscapes) show that our method achieves competitive performance to complicated distillation methods. Notably, for the SYNTHIA$\rightarrow$ Cityscapes scenario, our method achieves the state-of-the-art performance with $57.8\%$ mIoU and $64.6\%$ mIoU on 16 classes and 13 classes. Code is available at https://github.com/ljjcoder/EHTDI.
翻译:在未监督的域调制(UDA)语义分解中,蒸馏法目前在性能中占主导地位。然而,蒸馏技术需要复杂的多阶段过程和许多训练技巧。在本文件中,我们提出了一个简单而有效的方法,可以达到先进的蒸馏方法的竞争性性能。我们的核心想法是从边界和特性的角度充分探索目标域域信息。首先,我们提出一种新型的混合战略,用地平流标签生成高质量的目标域域域。不同于以往工作的源地平面界限,我们选择高信任目标域域,然后将它们粘贴到源地平面图像中。在目标域域域域(目标域距的顶端点)和正确的标签中,我们可以产生目标域域域域域域域域域域(目标值的顶端点)的边界信息可以通过混合样本的学习来有效捕捉到。第二,我们设计一个多层次的对比损失,用来改进目标域域数据的表述,包括平流值和原位的美元等值域域域域域域域域域。通过两种实验方法,可以更准确地测量G-ral的域域域域成绩。