Current conversational passage retrieval systems cast conversational search into ad-hoc search by using an intermediate query resolution step that places the user's question in context of the conversation. While the proposed methods have proven effective, they still assume the availability of large-scale question resolution and conversational search datasets. To waive the dependency on the availability of such data, we adapt a pre-trained token-level dense retriever on ad-hoc search data to perform conversational search with no additional fine-tuning. The proposed method allows to contextualize the user question within the conversation history, but restrict the matching only between question and potential answer. Our experiments demonstrate the effectiveness of the proposed approach. We also perform an analysis that provides insights of how contextualization works in the latent space, in essence introducing a bias towards salient terms from the conversation.
翻译:目前的对话通道检索系统通过使用中间查询解答步骤,将用户的问题与对话中的问题放在一起,从而将谈话搜索到特别的搜索中。虽然建议的方法已证明是有效的,但它们仍然假设可以提供大规模解答问题和对话搜索数据集。为避免对这些数据的可用性的依赖,我们调整了事先培训的象征性密集检索器,在不作其他微调的情况下进行谈话搜索。拟议方法允许将用户问题与对话史上的问题相关联,但只限制对问题和潜在答案的匹配。我们的实验展示了拟议方法的有效性。我们还进行了一项分析,对潜在空间背景化如何运作提供了深刻的洞察力,实质上引入了对对话中突出术语的偏向。