Deep learning has become the method of choice to tackle real-world problems in different domains, partly because of its ability to learn from data and achieve impressive performance on a wide range of applications. However, its success usually relies on two assumptions: (i) vast troves of labeled datasets are required for accurate model fitting, and (ii) training and testing data are independent and identically distributed. Its performance on unseen target domains, thus, is not guaranteed, especially when encountering out-of-distribution data at the adaptation stage. The performance drop on data in a target domain is a critical problem in deploying deep neural networks that are successfully trained on data in a source domain. Unsupervised domain adaptation (UDA) is proposed to counter this, by leveraging both labeled source domain data and unlabeled target domain data to carry out various tasks in the target domain. UDA has yielded promising results on natural image processing, video analysis, natural language processing, time-series data analysis, medical image analysis, etc. In this review, as a rapidly evolving topic, we provide a systematic comparison of its methods and applications. In addition, the connection of UDA with its closely related tasks, e.g., domain generalization and out-of-distribution detection, has also been discussed. Furthermore, deficiencies in current methods and possible promising directions are highlighted.
翻译:深层学习已成为解决不同领域现实世界问题的选择方法,部分原因是它能够从数据中学习,并在广泛的应用领域取得令人印象深刻的成绩,但是,它的成功通常取决于两个假设:(一) 精确的模型安装需要大量贴标签的数据集;(二) 培训和测试数据是独立和同样分布的,因此,它在无形目标领域的绩效得不到保障,特别是在适应阶段遇到分配数据时。目标领域的数据性能下降是部署在源领域数据方面经过成功培训的深层神经网络的一个严重问题。建议进行不受监督的域适应(UDA)以对抗这一点,办法是利用标签的源域数据和未贴标签的目标域数据在目标领域执行各种任务。UDA在自然图像处理、视频分析、自然语言处理、时间序列数据分析、医学图像分析等方面取得了令人乐观的结果。在本次审查中,作为一个迅速演变的专题,我们对其方法和应用进行了系统化比较。此外,UDA与当前领域发现的方法之间的联系也充满希望。