Motivated by suggested question generation in conversational news recommendation systems, we propose a model for generating question-answer pairs (QA pairs) with self-contained, summary-centric questions and length-constrained, article-summarizing answers. We begin by collecting a new dataset of news articles with questions as titles and pairing them with summaries of varying length. This dataset is used to learn a QA pair generation model producing summaries as answers that balance brevity with sufficiency jointly with their corresponding questions. We then reinforce the QA pair generation process with a differentiable reward function to mitigate exposure bias, a common problem in natural language generation. Both automatic metrics and human evaluation demonstrate these QA pairs successfully capture the central gists of the articles and achieve high answer accuracy.
翻译:基于在谈话新闻建议系统中建议的问题生成,我们提出了一个模式,用于生成自成一体、以简易为中心的问题和篇幅限制、文章摘要回答的问答配对(QA配对),我们首先收集以标题为题的新新闻文章数据集,并将之与不同长度的摘要配对。这个数据集用来学习QA配对模式,制作摘要,以平衡简便与充分性及其相应问题。然后,我们用不同的报酬功能加强QA配对过程,以减少暴露偏差,这是自然语言生成中常见的一个问题。自动计量和人类评价都表明这些QA配对成功地抓住了文章的中央作者,并实现了高回答准确性。