In Active Domain Adaptation (ADA), one uses Active Learning (AL) to select a subset of images from the target domain, which are then annotated and used for supervised domain adaptation (DA). Given the large performance gap between supervised and unsupervised DA techniques, ADA allows for an excellent trade-off between annotation cost and performance. Prior art makes use of measures of uncertainty or disagreement of models to identify `regions' to be annotated by the human oracle. However, these regions frequently comprise of pixels at object boundaries which are hard and tedious to annotate. Hence, even if the fraction of image pixels annotated reduces, the overall annotation time and the resulting cost still remain high. In this work, we propose an ADA strategy, which given a frame, identifies a set of classes that are hardest for the model to predict accurately, thereby recommending semantically meaningful regions to be annotated in a selected frame. We show that these set of `hard' classes are context-dependent and typically vary across frames, and when annotated help the model generalize better. We propose two ADA techniques: the Anchor-based and Augmentation-based approaches to select complementary and diverse regions in the context of the current training set. Our approach achieves 66.6 mIoU on GTA to Cityscapes dataset with an annotation budget of 4.7% in comparison to 64.9 mIoU by MADA using 5% of annotations. Our technique can also be used as a decorator for any existing frame-based AL technique, e.g., we report 1.5% performance improvement for CDAL on Cityscapes using our approach.
翻译:在主动域适应(ADA)中,人们使用积极学习(AL)从目标域中选择一组图像,这些图像随后被附加说明,并用于监管域适应(DA)。鉴于受监督和不受监督的DA技术之间的业绩差距很大,ADA允许在批注成本和性能之间实现极佳的权衡。先前的艺术采用不确定或不同模型的计量方法,以确定“区域”应由人类触角加以附加说明。然而,这些区域往往包括目标边界上的像素,这些像素在目标边界上比较困难,对说明性框架有争议。因此,即使附加说明的图像像素部分减少,总体注解时间和由此产生的成本仍然很高。在这项工作中,我们提出了一个ADA战略,根据一个框架,确定了一套模型最难准确预测的班,从而建议具有语义意义的区域在选定的框架中加注解。我们显示,这些“硬性”类是基于背景的,通常以不同框架为基础,而且当附加说明有助于模型的通用。我们提议在目前预算范围内采用两种ADADA技术,即我们采用“ADA技术”的升级方法。我们用了一种技术,即采用“O-ASG-ADADADADADADA方法,在目前的数据区域中采用一种技术,采用一种方法。我们采用“O-AG-ADADAG-AG-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-RO-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-A-