Natural Language Generation (NLG) for task-oriented dialogue systems focuses on communicating specific content accurately, fluently, and coherently. While these attributes are crucial for a successful dialogue, it is also desirable to simultaneously accomplish specific stylistic goals, such as response length, point-of-view, descriptiveness, sentiment, formality, and empathy. In this work, we focus on stylistic control and evaluation for schema-guided NLG, with joint goals of achieving both semantic and stylistic control. We experiment in detail with various controlled generation methods for large pretrained language models: specifically, conditional training, guided fine-tuning, and guided decoding. We discuss their advantages and limitations, and evaluate them with a broad range of automatic and human evaluation metrics. Our results show that while high style accuracy and semantic correctness are easier to achieve for more lexically-defined styles with conditional training, stylistic control is also achievable for more semantically complex styles using discriminator-based guided decoding methods. The results also suggest that methods that are more scalable (with less hyper-parameters tuning) and that disentangle content generation and stylistic variations are more effective at achieving semantic correctness and style accuracy.
翻译:以任务为导向的对话系统自然语言生成(NLG)的自然语言生成(NLG)侧重于准确、流畅和连贯地传达具体内容。这些属性对于成功对话至关重要,但这些属性对于成功对话至关重要,但同时也有必要同时实现特定的立体目标,如反应长度、观点点、描述性、情绪、形式和感知等。在这项工作中,我们侧重于对系统指导型NLG的文体控制和评价,共同目标是既实现语义控制,又实现文体控制。我们详细试验各种受控的生成方法,用于大型预先训练的语言模型:具体而言,有条件培训、引导的微调和引导解码。我们讨论其优势和局限性,并以广泛的自动和人文评价指标来评估它们。我们的结果显示,虽然高风格的准确性和语义正确性更容易在有条件培训的更具有法律定义的风格上实现,但文体控制也能够实现更具有修饰性的风格的风格,使用基于歧视的导导解码方法。结果还表明,更具有可调性的方法(以不那么高的超度的版本)在生成格式上更精确性的内容和不精确性变。