Unsupervised source-free domain adaptation methods aim to train a model to be used in the target domain utilizing the pretrained source-domain model and unlabeled target-domain data, where the source data may not be accessible due to intellectual property or privacy issues. These methods frequently utilize self-training with pseudo-labeling thresholded by prediction confidence. In a source-free scenario, only supervision comes from target data, and thresholding limits the contribution of the self-training. In this study, we utilize self-training with a mean-teacher approach. The student network is trained with all predictions of the teacher network. Instead of thresholding the predictions, the gradients calculated from the pseudo-labels are weighted based on the reliability of the teacher's predictions. We propose a novel method that uses proxy-based metric learning to estimate reliability. We train a metric network on the encoder features of the teacher network. Since the teacher is updated with the moving average, the encoder feature space is slowly changing. Therefore, the metric network can be updated in training time, which enables end-to-end training. We also propose a metric-based online ClassMix method to augment the input of the student network where the patches to be mixed are decided based on the metric reliability. We evaluated our method in synthetic-to-real and cross-city scenarios. The benchmarks show that our method significantly outperforms the existing state-of-the-art methods.
翻译:未经监督的无源域适应方法旨在培训一个模型,供目标领域使用,使用事先培训的源域模型和未贴标签的目标域数据,因为知识产权或隐私问题可能无法获取源数据。这些方法经常使用假标签阈值的自我培训,通过预测信心进行伪标签阈值。在无源假设中,仅监督来自目标数据,限值限制自我培训的贡献。在这项研究中,我们采用中度教师方法进行自我培训。学生网络接受教师网络的所有预测培训。从假标签计算出的梯度不是根据教师预测的可靠性进行加权,而是根据教师预测的可靠性进行加权。我们提出一种新颖的方法,使用基于代名标签的标准学习来估计可靠性。我们只对教师网络的编码特征进行计量网络培训。由于教师采用移动平均数更新,编码特征空间正在慢慢变化。因此,衡量网络可以在培训时更新,从而能够进行端到端培训。我们从假标签上计算出的梯度梯度的梯度,而不是根据教师预测的可靠性。我们还提出了一种使用基于代号的衡量模型的方法来估计可靠性。我们还提议了一种基于以校正基数的在线模型的方法,用以大大地评估我们的校正比标准,从而将校校校校校校校方法的校校校校比。