This paper presents an automatic method to evaluate the naturalness of natural language generation in dialogue systems. While this task was previously rendered through expensive and time-consuming human labor, we present this novel task of automatic naturalness evaluation of generated language. By fine-tuning the BERT model, our proposed naturalness evaluation method shows robust results and outperforms the baselines: support vector machines, bi-directional LSTMs, and BLEURT. In addition, the training speed and evaluation performance of naturalness model are improved by transfer learning from quality and informativeness linguistic knowledge.
翻译:本文件介绍了一种自动方法来评价对话系统中自然语言生成的自然性质。 虽然这项任务以前是通过昂贵和耗时的人力劳动完成的,但我们提出了对生成的语言进行自动自然性评估的这一新任务。 通过微调BERT模型,我们提议的自然性评估方法显示了稳健的结果并超过了基线:支持矢量机、双向LSTM和BLEURT。此外,通过从质量和知识语言知识中传授知识,提高了自然性能模型的培训速度和评估性能。