Recent advances in NLP are brought by a range of large-scale pretrained language models (PLMs). These PLMs have brought significant performance gains for a range of NLP tasks, circumventing the need to customize complex designs for specific tasks. However, most current work focus on finetuning PLMs on a domain-specific datasets, ignoring the fact that the domain gap can lead to overfitting and even performance drop. Therefore, it is practically important to find an appropriate method to effectively adapt PLMs to a target domain of interest. Recently, a range of methods have been proposed to achieve this purpose. Early surveys on domain adaptation are not suitable for PLMs due to the sophisticated behavior exhibited by PLMs from traditional models trained from scratch and that domain adaptation of PLMs need to be redesigned to take effect. This paper aims to provide a survey on these newly proposed methods and shed light in how to apply traditional machine learning methods to newly evolved and future technologies. By examining the issues of deploying PLMs for downstream tasks, we propose a taxonomy of domain adaptation approaches from a machine learning system view, covering methods for input augmentation, model optimization and personalization. We discuss and compare those methods and suggest promising future research directions.
翻译:国家语言平台的近期进展是由一系列大规模预先培训的语言模型(PLM)带来的。这些PLM为一系列国家语言平台的任务带来了显著的业绩收益,避免了对具体任务复杂设计进行定制的需要。然而,目前大多数工作的重点是微调特定领域数据集上的PLM,忽视域间差距可能导致超配甚至性能下降的事实。因此,找到一个适当方法使PLM有效适应目标领域。最近,为实现这一目标提出了一系列方法。由于PLPM从从零到零培训的传统模型中表现出的复杂行为,对域内适应不适合PLMS,因此需要重新设计PLMS的域适应以生效。本文件旨在调查这些新提出的方法,并阐明如何将传统机器学习方法应用于新开发的和未来的技术。通过研究为下游任务部署PLMMs的问题,我们建议从机器学习系统的角度对域内适应方法进行分类,涵盖投入增强、模型优化和个人化的方法,我们讨论和比较这些方法,并提出有希望的未来研究方向。我们讨论和比较这些方法。