Recent advancements in transformer-based models have greatly improved the ability of Question Answering (QA) systems to provide correct answers; in particular, answer sentence selection (AS2) models, core components of retrieval-based systems, have achieved impressive results. While generally effective, these models fail to provide a satisfying answer when all retrieved candidates are of poor quality, even if they contain correct information. In AS2, models are trained to select the best answer sentence among a set of candidates retrieved for a given question. In this work, we propose to generate answers from a set of AS2 top candidates. Rather than selecting the best candidate, we train a sequence to sequence transformer model to generate an answer from a candidate set. Our tests on three English AS2 datasets show improvement up to 32 absolute points in accuracy over the state of the art.
翻译:以变压器为基础的模型最近的进展大大提高了问答系统提供正确答案的能力,特别是答案选择(AS2)模型、检索系统的核心组成部分等答案选择(AS2)模型,取得了令人印象深刻的成果。这些模型虽然总体上有效,但未能在所有检索到的候选人质量差的情况下提供令人满意的答案,即使它们包含正确的信息。在AS2中,模型经过培训,能够在为某个问题检索到的一组候选人中选择最佳答案。在这项工作中,我们提议从一组AS2顶级候选人中找到答案。我们没有选择最佳候选人,而是训练一个序列来排列变压器模型的顺序,以便从候选人组中找到答案。我们对三个英文AS2数据集的测试显示,比艺术状态的精确度提高了32个绝对点。