Existing Domain Adaptation (DA) algorithms train target models and then use the target models to classify all samples in the target dataset. While this approach attempts to address the problem that the source and the target data are from different distributions, it fails to recognize the possibility that, within the target domain, some samples are closer to the distribution of the source domain than the distribution of the target domain. In this paper, we develop a novel DA algorithm, the Enforced Transfer, that deals with this situation. A straightforward but effective idea to deal with this dilemma is to use an out-of-distribution detection algorithm to decide if, during the testing phase, a given sample is closer to the distribution of the source domain, the target domain, or neither. In the first case, this sample is given to a machine learning classifier trained on source samples. In the second case, this sample is given to a machine learning classifier trained on target samples. In the third case, this sample is discarded as neither an ML model trained on source nor an ML model trained on target is suitable to classify it. It is widely known that the first few layers in a neural network extract low-level features, so the aforementioned approach can be extended from classifying samples in three different scenarios to classifying the samples' activations after an empirically determined layer in three different scenarios. The Enforced Transfer implements the idea. On three types of DA tasks, we outperform the state-of-the-art algorithms that we compare against.
翻译:现有域适应算法(DA) 培训目标模型, 然后使用目标模型对目标数据集中的所有样本进行分类。 虽然这一方法试图解决源和目标数据来自不同分布的源和目标数据的问题,但它没有认识到,在目标域内,有些样本更接近源域分布,而不是目标域分布。在本文中,我们开发了一个新的DA算法,即“强制传输”,处理这种情况。处理这一困境的一个简单而有效的想法是,使用一种超出分配的检测算法,以确定在测试阶段,一个特定样本是否更接近源域、目标域或两者的分布。在第一个案例中,这一样本提供给了在源样品样品样品方面受过训练的机器学习分类师。在第二个案例中,这一样本交给了在目标样品方面受过训练的机器学习分类师。在第三个案例中,这一样本被丢弃为既非在源上受过训练的ML模型,在目标模型上受过训练的ML模型,也不适合对其进行分类。 众所周知,一个神经网络中的第一个几层与来源域域域域域域域域域域域域域域域域域域域域域域、目标域域域域域域域域域域域域或两者之间,在进行不同层次的分类分析后,在三个不同的分类中,在进行不同层次的分类,在三个模型中,在进行不同级的分类,在三个级域域域域域域域域域域域域域域域域域域域域域域域域变的模型中,在前,在前,在前,在前,在前,在进行不同级变变变变变变。