Persona-based dialogue systems aim to generate consistent responses based on historical context and predefined persona. Unlike conventional dialogue generation, the persona-based dialogue needs to consider both dialogue context and persona, posing a challenge for coherent training. Specifically, this requires a delicate weight balance between context and persona. To achieve that, in this paper, we propose an effective framework with Persona-Adaptive Attention (PAA), which adaptively integrates the weights from the persona and context information via our designed attention. In addition, a dynamic masking mechanism is applied to the PAA to not only drop redundant information in context and persona but also serve as a regularization mechanism to avoid overfitting. Experimental results demonstrate the superiority of the proposed PAA framework compared to the strong baselines in both automatic and human evaluation. Moreover, the proposed PAA approach can perform equivalently well in a low-resource regime compared to models trained in a full-data setting, which achieve a similar result with only 20% to 30% of data compared to the larger models trained in the full-data setting. To fully exploit the effectiveness of our design, we designed several variants for handling the weighted information in different ways, showing the necessity and sufficiency of our weighting and masking designs.
翻译:与传统对话不同,个人对话需要既考虑对话背景又考虑个人,这给连贯培训带来了挑战。具体地说,这要求背景与个人之间的微妙权衡。为了实现这一点,我们在本文件中提议了一个有效的框架,即人与促进关注(PAA)相结合,通过我们设计的关注,将个人与背景信息之间的权重进行适应性整合;此外,对PAA采用动态掩码机制,不仅在背景和人之间传递多余的信息,而且还作为常规机制,以避免过度匹配。实验结果表明,拟议的PAA框架优于自动和人评价的强大基线。此外,提议的PAAA方法在低资源制度下,与在全面数据设置中培训的模式相比,效果相当,与在全面数据设置中培训的较大模型相比,只有20%至30%的数据取得了类似结果。为了充分利用我们设计的有效性,我们设计了以不同方式处理加权信息设计、必要性和充分性设计的若干变式。