Controllable Text Generation (CTG) is emerging area in the field of natural language generation (NLG). It is regarded as crucial for the development of advanced text generation technologies that are more natural and better meet the specific constraints in practical applications. In recent years, methods using large-scale pre-trained language models (PLMs), in particular the widely used transformer-based PLMs, have become a new paradigm of NLG, allowing generation of more diverse and fluent text. However, due to the lower level of interpretability of deep neural networks, the controllability of these methods need to be guaranteed. To this end, controllable text generation using transformer-based PLMs has become a rapidly growing yet challenging new research hotspot. A diverse range of approaches have emerged in the recent 3-4 years, targeting different CTG tasks which may require different types of controlled constraints. In this paper, we present a systematic critical review on the common tasks, main approaches and evaluation methods in this area. Finally, we discuss the challenges that the field is facing, and put forward various promising future directions. To the best of our knowledge, this is the first survey paper to summarize CTG techniques from the perspective of PLMs. We hope it can help researchers in related fields to quickly track the academic frontier, providing them with a landscape of the area and a roadmap for future research.
翻译:在自然语言生成(NLG)领域,可控制的文本生成(CTG)是新兴领域,对开发更自然、更能满足实际应用具体限制的先进文本生成技术至关重要,近年来,使用大规模预先培训语言模型的方法,特别是广泛使用的变压器的PLM,已成为NLG的新范例,允许生成更多样化和更流畅的文本。然而,由于深层神经网络的可解释性较低,这些方法的可控制性需要得到保证。为此,使用变压器的PLM的可控文本生成技术已成为一个迅速增长但又具有挑战性的新研究热点。近3-4年来,出现了各种各样的方法,针对不同的CTG任务,可能需要不同种类的受控限制。在本文件中,我们提出了对这一领域共同任务、主要方法和评估方法的系统批判性审查。我们讨论了该领域面临的挑战,并提出了各种有希望的未来方向。我们最了解的是,利用变压器的PLMS生成的可控文本,这是我们从未来方向上向MRMS提供我们未来研究方向的第一份调查文件。我们很快地为MRPLM的实地提供与G相关的研究工具。