Data-driven based approaches, in spite of great success in many tasks, have poor generalization when applied to unseen image domains, and require expensive cost of annotation especially for dense pixel prediction tasks such as semantic segmentation. Recently, both unsupervised domain adaptation (UDA) from large amounts of synthetic data and semi-supervised learning (SSL) with small set of labeled data have been studied to alleviate this issue. However, there is still a large gap on performance compared to their supervised counterparts. We focus on a more practical setting of semi-supervised domain adaptation (SSDA) where both a small set of labeled target data and large amounts of labeled source data are available. To address the task of SSDA, a novel framework based on dual-level domain mixing is proposed. The proposed framework consists of three stages. First, two kinds of data mixing methods are proposed to reduce domain gap in both region-level and sample-level respectively. We can obtain two complementary domain-mixed teachers based on dual-level mixed data from holistic and partial views respectively. Then, a student model is learned by distilling knowledge from these two teachers. Finally, pseudo labels of unlabeled data are generated in a self-training manner for another few rounds of teachers training. Extensive experimental results have demonstrated the effectiveness of our proposed framework on synthetic-to-real semantic segmentation benchmarks.
翻译:尽管在许多任务中取得了巨大成功,但以数据为驱动的方法,尽管在许多任务中取得了巨大成功,但在应用到看不见的图像领域时,没有很好地概括化,并且需要昂贵的批注费用,特别是用于密集的像素预测任务,如语义分割等。最近,大量合成数据和带有少量标签数据、半监督学习(SSL)的无监督域适应(UDA)最近,对大量合成数据和半监督学习(SSL)进行了三个阶段的研究,以缓解这一问题。然而,与受监督的对应方相比,在业绩方面仍然存在很大的差距。我们侧重于在半监督域适应(SSDA)方面更切合实际的设置。我们侧重于一套小型的标签目标数据和大量标签源数据。为了应对SCDA的任务,提出了一个基于双层域混合的新框架。拟议中的两种数据混合方法旨在缩小区域层面和抽样层面的域际差距。我们可以分别从整体和局部观点获得两个层次混合数据的基础上获得两个互补的域际混合教师。然后,通过对两轮实验性教师的模拟培训结果进行模拟的模拟培训来学习学生模型。最后,通过模拟培训的模拟培训,将多少个实验性框架的模型用于模拟教师的模拟学习。