使用矩阵推理法,高效近邻搜索交叉编码模型 (Efficient Nearest Neighbor Search for Cross-Encoder Models using Matrix Factorization)

Efficient k-nearest neighbor search is a fundamental task, foundational for many problems in NLP. When the similarity is measured by dot-product between dual-encoder vectors or $\ell_2$-distance, there already exist many scalable and efficient search methods. But not so when similarity is measured by more accurate and expensive black-box neural similarity models, such as cross-encoders, which jointly encode the query and candidate neighbor. The cross-encoders' high computational cost typically limits their use to reranking candidates retrieved by a cheaper model, such as dual encoder or TF-IDF. However, the accuracy of such a two-stage approach is upper-bounded by the recall of the initial candidate set, and potentially requires additional training to align the auxiliary retrieval model with the cross-encoder model. In this paper, we present an approach that avoids the use of a dual-encoder for retrieval, relying solely on the cross-encoder. Retrieval is made efficient with CUR decomposition, a matrix decomposition approach that approximates all pairwise cross-encoder distances from a small subset of rows and columns of the distance matrix. Indexing items using our approach is computationally cheaper than training an auxiliary dual-encoder model through distillation. Empirically, for k > 10, our approach provides test-time recall-vs-computational cost trade-offs superior to the current widely-used methods that re-rank items retrieved using a dual-encoder or TF-IDF.

翻译：K- 最接近的邻居搜索是一项基本任务, 是 NLP 中许多问题的基础。当以双coder 矢量或 $\ ell_ 2$- 距离之间的点产品来测量相似性时, 已经存在许多可缩放和高效的搜索方法。但是, 当以更准确和昂贵的黑箱神经相似性模型来测量相似性时, 类似性则不是如此, 例如交叉计算器, 它共同编码查询和候选邻居。交叉计算器的高计算成本通常限制它们使用以更便宜的模式( 如双coder 或 TF- IDF ) 重新定位候选人。但是, 这种两阶段方法的准确性被初始候选人集的回调所覆盖, 可能需要额外的培训来将辅助检索模型与交叉编码模型相匹配。在本文中, 我们展示一种避免使用双电解码模型进行检索的方法, 仅仅依靠交叉编码器。 rerelieveval 与CUR 解码、基质解算器- decomplation- decomplation 方法相近似, 将我们的更高级交易模型用于整个轨道的双序路路路路路段计算。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日