Recent advance in Dense Retrieval (DR) techniques has significantly improved the effectiveness of first-stage retrieval. Trained with large-scale supervised data, DR models can encode queries and documents into a low-dimensional dense space and conduct effective semantic matching. However, previous studies have shown that the effectiveness of DR models would drop by a large margin when the trained DR models are adopted in a target domain that is different from the domain of the labeled data. One of the possible reasons is that the DR model has never seen the target corpus and thus might be incapable of mitigating the difference between the training and target domains. In practice, unfortunately, training a DR model for each target domain to avoid domain shift is often a difficult task as it requires additional time, storage, and domain-specific data labeling, which are not always available. To address this problem, in this paper, we propose a novel DR framework named Disentangled Dense Retrieval (DDR) to support effective and flexible domain adaptation for DR models. DDR consists of a Relevance Estimation Module (REM) for modeling domain-invariant matching patterns and several Domain Adaption Modules (DAMs) for modeling domain-specific features of multiple target corpora. By making the REM and DAMs disentangled, DDR enables a flexible training paradigm in which REM is trained with supervision once and DAMs are trained with unsupervised data. Comprehensive experiments in different domains and languages show that DDR significantly improves ranking performance compared to strong DR baselines and substantially outperforms traditional retrieval methods in most scenarios.
翻译:Dense Retreival (DR) 技术的最近进步大大提高了第一阶段检索的有效性。DR模型经过大规模监督数据的培训,DR模型可以将查询和文件编码成一个低维密集的空间,并进行有效的语义匹配。然而,以往的研究显示,如果在与标签数据领域不同的目标领域采用经过培训的DR模型,DR模型的效力将大大下降。其中一个可能的原因是DR模型从未见过目标内容,因此可能无法缩小培训与目标域之间的差异。在实践上,不幸的是,为每个目标域培训DR模型,以避免领域转换为低维密度的密集空间,并开展有效的语义匹配。为了解决这个问题,在本文件中,我们提议一个名为DRD框架,用于支持DR模型的有效和灵活的域域内适应。DR模型包括一个与经过培训的域域域域域域域域域域域间模拟模型(REM),用来对DREM模型进行大幅的匹配模式和多个目标域域域域域间模型进行升级。