Despite the tremendous success of neural dialogue models in recent years, it suffers a lack of relevance, diversity, and some times coherence in generated responses. Lately, transformer-based models, such as GPT-2, have revolutionized the landscape of dialogue generation by capturing the long-range structures through language modeling. Though these models have exhibited excellent language coherence, they often lack relevance and terms when used for domain-specific response generation. In this paper, we present DSRNet (Domain Specific Response Network), a transformer-based model for dialogue response generation by reinforcing domain-specific attributes. In particular, we extract meta attributes from context and infuse them with the context utterances for better attention over domain-specific key terms and relevance. We study the use of DSRNet in a multi-turn multi-interlocutor environment for domain-specific response generation. In our experiments, we evaluate DSRNet on Ubuntu dialogue datasets, which are mainly composed of various technical domain related dialogues for IT domain issue resolutions and also on CamRest676 dataset, which contains restaurant domain conversations. Trained with maximum likelihood objective, our model shows significant improvement over the state-of-the-art for multi-turn dialogue systems supported by better BLEU and semantic similarity (BertScore) scores. Besides, we also observe that the responses produced by our model carry higher relevance due to the presence of domain-specific key attributes that exhibit better overlap with the attributes of the context. Our analysis shows that the performance improvement is mostly due to the infusion of key terms along with dialogues which result in better attention over domain-relevant terms. Other contributing factors include joint modeling of dialogue context with the domain-specific meta attributes and topics.
翻译:尽管近年来神经对话模式取得了巨大成功,但近年来神经对话模式却缺乏相关性、多样性,有时在生成的响应中缺乏一致性。最近,以变压器为基础的模型,如GPT-2等,通过语言建模捕捉远程结构,使对话生成的景观发生了革命性的变化。虽然这些模型显示了极好的语言一致性,但在生成特定域响应时,这些模型往往缺乏相关性和术语。在本文件中,我们介绍了DSRNet(DSRNet)(Domain 特定响应网络),这是一个以变压器为基础的对话响应生成模式,主要通过强化特定域特性。特别是,我们从上下文中提取元属性,并用上下文表达来更好地关注特定域关键术语和相关性。我们研究DSRNet在多方向多方向的生成中是如何在多方向的生成中应用多方向生成的。我们模型展示了比比前更清晰的内值,并展示了比前方系统更清晰的内脏的内脏内脏内脏内脏内脏内涵。