For open-domain conversational question answering (CQA), it is important to retrieve the most relevant passages to answer a question, but this is challenging compared with standard passage retrieval because it requires understanding the full dialogue context rather than a single query. Moreover, it can be expensive to re-train well-established retrievers such as search engines that are originally developed for non-conversational queries. To facilitate their use, we develop a query rewriting model CONQRR that rewrites a conversational question in context into a standalone question. It is trained with a novel reward function to directly optimize towards retrieval and can be adapted to any fixed blackbox retriever using reinforcement learning. We show that CONQRR achieves state-of-the-art results on a recent open-domain CQA dataset, a combination of conversations from three different sources. We also conduct extensive experiments to show the effectiveness of CONQRR for any given fixed retriever.
翻译:对于开放域对话答题(CQA),必须检索最相关的段落,以回答一个问题,但与标准通道检索相比,这具有挑战性,因为它需要理解全面对话背景,而不是单一查询。此外,再培训成熟的检索器,如最初为非对话查询开发的搜索引擎等,费用很高。为了便于使用,我们开发了查询重写模式COQRR,将一个对话问题重新写成一个单独的问题。它经过新颖的奖励功能培训,可以直接优化检索,并可以通过强化学习适应任何固定的黑盒检索器。我们显示,CONQRR在近期的开放域 CQA数据集上取得了最新的结果,这是三个不同来源对话的组合。我们还进行了广泛的实验,以显示CONQRR对任何给定固定检索器的有效性。