Given multiple labeled source domains and a single target domain, most existing multi-source domain adaptation (MSDA) models are trained on data from all domains jointly in one step. Such an one-step approach limits their ability to adapt to the target domain. This is because the training set is dominated by the more numerous and labeled source domain data. The source-domain-bias can potentially be alleviated by introducing a second training step, where the model is fine-tuned with the unlabeled target domain data only using pseudo labels as supervision. However, the pseudo labels are inevitably noisy and when used unchecked can negatively impact the model performance. To address this problem, we propose a novel Bi-level Optimization based Robust Target Training (BORT$^2$) method for MSDA. Given any existing fully-trained one-step MSDA model, BORT$^2$ turns it to a labeling function to generate pseudo-labels for the target data and trains a target model using pseudo-labeled target data only. Crucially, the target model is a stochastic CNN which is designed to be intrinsically robust against label noise generated by the labeling function. Such a stochastic CNN models each target instance feature as a Gaussian distribution with an entropy maximization regularizer deployed to measure the label uncertainty, which is further exploited to alleviate the negative impact of noisy pseudo labels. Training the labeling function and the target model poses a nested bi-level optimization problem, for which we formulate an elegant solution based on implicit differentiation. Extensive experiments demonstrate that our proposed method achieves the state of the art performance on three MSDA benchmarks, including the large-scale DomainNet dataset. Our code will be available at \url{https://github.com/Zhongying-Deng/BORT2}
翻译:鉴于多标签源域和单一目标域,大多数现有的多源域适应(MSDA)模型都是以所有域的数据来联合培训的。这种一步式方法限制了它们适应目标域的能力。这是因为培训组由更多和标签的源域数据主导。如果采用第二个培训步骤,源域域域可减缓,因为模型只能使用假标签作为监管,对未标签目标域数据进行微调。然而,假的域域域适应(MSDA)模型不可避免地会吵闹,而且当使用时会给模型性能带来负面影响。为了解决这个问题,我们建议为MSDADA推出一种新型双级的Opimization目标培训(BORT$2$2$)方法。鉴于任何现有的经过充分训练的一步MSDDA模型,BO$2$2$将它变成一个标签功能,为目标数据生成假标签的假标签,仅使用假标签目标数据进行培训。CRDASDMSaldality 模型是用来对模型进行彻底的测试,我们每个标签标签的固定目标域域域域域域域域域变的标签上显示一个常规性标签的标签的精确度,通过标签的标签的标签的标签的标签,我们将产生一个最精确的标记的标签的标记的标记的标记的标记的功能的功能的标记, 。