Dense Retrieval (DR) has achieved state-of-the-art first-stage ranking effectiveness. However, the efficiency of most existing DR models is limited by the large memory cost of storing dense vectors and the time-consuming nearest neighbor search (NNS) in vector space. Therefore, we present RepCONC, a novel retrieval model that learns discrete Representations via CONstrained Clustering. RepCONC jointly trains dual-encoders and the Product Quantization (PQ) method to learn discrete document representations and enables fast approximate NNS with compact indexes. It models quantization as a constrained clustering process, which requires the document embeddings to be uniformly clustered around the quantization centroids and supports end-to-end optimization of the quantization method and dual-encoders. We theoretically demonstrate the importance of the uniform clustering constraint in RepCONC and derive an efficient approximate solution for constrained clustering by reducing it to an instance of the optimal transport problem. Besides constrained clustering, RepCONC further adopts a vector-based inverted file system (IVF) to support highly efficient vector search on CPUs. Extensive experiments on two popular ad-hoc retrieval benchmarks show that RepCONC achieves better ranking effectiveness than competitive vector quantization baselines under different compression ratio settings. It also substantially outperforms a wide range of existing retrieval models in terms of retrieval effectiveness, memory efficiency, and time efficiency.
翻译:RepCONC,这是一个通过CONCONC 学习离散代表的新型检索模型。RepCONCONC公司联合培训了双向编码器和产品量化(PQ)方法,以学习离散文件表示方式,使NNS能够以紧凑指数快速接近NNS。它模拟了限制组合过程的量化,这要求将文件嵌入在四分制中间统一组合,支持四分制方法的端对端优化和双向相近方。我们从理论上展示了CONC公司统一组合限制的重要性,通过将它降低到最佳运输问题的例子,为限制组合提供了高效的近似值解决方案。除了限制的组合外,ReconC还进一步采用了基于矢量的反向档案系统(IVF),以限制的组合过程为基础,要求将文件嵌入的特性统一集中在四分解式中间体中间体的近距离搜索中,支持四分级制的量化方法的端对QOF标准进行高效率程度的升级,并在CPU公司现有基准下,在高端级标准下进行高端级的矢量级的矢量级的矢量级检索。