Recent developments of dense retrieval rely on quality representations of queries and contexts coming from pre-trained query and context encoders. In this paper, we introduce TouR (test-time optimization of query representations), which further optimizes instance-level query representations guided by signals from test-time retrieval results. We leverage a cross-encoder re-ranker to provide fine-grained pseudo labels over retrieval results and iteratively optimize query representations with the gradient descent method. Our theoretical analysis reveals that TouR can be viewed as a generalization of the classical Rocchio's algorithm for pseudo relevance feedback, and we present two variants leveraging psuedo labels as either hard binary or soft continuous labels. We first apply TouR on phrase retrieval with our proposed phrase re-ranker. On passage retrieval, we demonstrate its effectiveness with an off-the-shelf re-ranker. TouR improves the end-to-end open-domain QA accuracy significantly, as well as passage retrieval performance. Compared to re-ranker, TouR requires a smaller number of candidates, and achieves consistently better performance and runs up to 4x faster with our efficient implementation.
翻译:最近的密集检索发展取决于来自事先经过训练的查询和上下文编码器的查询和背景的高质量描述。在本文中,我们引入了TouR(对查询表示的测试-时间优化),以测试-时间检索结果的信号为指导,进一步优化实例级查询的表示;我们利用跨编码器重新排序器,在检索结果上提供精细的假标签,并迭接地优化与梯度下移法的查询表达方式。我们的理论分析表明,TouR可以被视为古典Rocchio的古典 Rocchio算法的概括化伪相关性反馈,我们提出两种变式,利用psuedo标签作为硬二进制或软连续标签。我们首先应用TouR的短语检索,用我们提议的重新排序器重新排序器进行。在通道检索时,我们用现成的重新排序器展示其有效性。TouR改进了端到端端端端开放QA的准确性,并改进了通过检索的性能。与重新排序器相比,TouR需要较少的候选人,并不断提高执行速度。