快速密集取回器消耗KALE：针对不对称双编码器的嵌入后训练库尔巴克-莱布勒对齐 (Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders) - 专知论文

会员服务 ·

0

密集检索 · 嵌入 · MSMARCO · 推理延迟 · 上下文 ·

2023 年 3 月 31 日

Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders

翻译：快速密集取回器消耗KALE：针对不对称双编码器的嵌入后训练库尔巴克-莱布勒对齐

Daniel Campos,Alessandro Magnani,ChengXiang Zhai

In this paper, we consider the problem of improving the inference latency of language model-based dense retrieval systems by introducing structural compression and model size asymmetry between the context and query encoders. First, we investigate the impact of pre and post-training compression on the MSMARCO, Natural Questions, TriviaQA, SQUAD, and SCIFACT, finding that asymmetry in the dual encoders in dense retrieval can lead to improved inference efficiency. Knowing this, we introduce Kullback Leibler Alignment of Embeddings (KALE), an efficient and accurate method for increasing the inference efficiency of dense retrieval methods by pruning and aligning the query encoder after training. Specifically, KALE extends traditional Knowledge Distillation after bi-encoder training, allowing for effective query encoder compression without full retraining or index generation. Using KALE and asymmetric training, we can generate models which exceed the performance of DistilBERT despite having 3x faster inference.

翻译：在本文中，我们考虑通过引入结构压缩和上下文和查询编码器之间的模型大小不对称性来改善基于语言模型的密集检索系统的推理延迟问题。首先，我们研究了对MSMARCO、自然问答、TriviaQA、SQUAD和SCIFACT进行预训练和后训练压缩的影响，发现密集检索中双编码器之间的不对称性可以提高推理效率。因此，我们介绍了嵌入后的库尔巴克-莱布勒对齐（KALE）方法，这是一种有效而准确的增加密集检索方法推理效率的方法，它通过在训练后对查询编码器进行修剪和对齐。具体而言，KALE 扩展了在双编码器训练后的传统知识蒸馏，允许在不进行完全重新训练或索引生成的情况下有效地压缩查询编码器。使用KALE和不对称训练，我们可以生成性能超过DistilBERT的模型，并具有3倍更快的推理速度。

0

相关内容

密集检索

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

专知会员服务

19+阅读 · 2022年3月13日

《图Transformer网络与语音识别》Facebook语音大牛Awni Hannun，附121页Slides与视频

专知会员服务

33+阅读 · 2021年6月26日

【SIGIR2021】使用难样本优化向量检索模型

专知会员服务

27+阅读 · 2021年4月22日

【CVPR2021】一种基于知识蒸馏的弱监督图像文本匹配模型

专知会员服务

35+阅读 · 2021年4月8日

知识图谱嵌入模型的概率标定,Probability Calibration for Knowledge Graph Embedding Models

专知会员服务

36+阅读 · 2020年5月11日

Query2box: 使用盒嵌入对向量空间中的知识图谱进行推理，Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings

专知会员服务

46+阅读 · 2020年5月11日

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

专知会员服务

102+阅读 · 2020年4月25日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

专知会员服务

49+阅读 · 2019年11月15日

ACL 2022 | 跨模态离散化表示学习：让不同的模态共享相同的词表

ACL 2022 | 跨模态离散化表示学习：让不同的模态共享相同的词表

PaperWeekly

0+阅读 · 2022年7月8日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

DSGAN：使用生成式对抗网络进行远距离监督关系抽取

DSGAN：使用生成式对抗网络进行远距离监督关系抽取

微信AI

98+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

泡泡机器人SLAM

10+阅读 · 2018年5月26日

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

专知

12+阅读 · 2018年4月26日

从 Encoder 到 Decoder 实现 Seq2Seq 模型

从 Encoder 到 Decoder 实现 Seq2Seq 模型

AI研习社

10+阅读 · 2018年2月10日

多重假设检验中的k-FWER控制

国家自然科学基金

0+阅读 · 2015年12月31日

基于多源证据的繁忙水域交管雷达异常目标识别方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

核壳结构超支化共轭聚合物的双光子荧光增强及光动力治疗

国家自然科学基金

1+阅读 · 2014年12月31日

时间反转-逆时偏移混合法用于超声在分层介质和水下波导中目标的探测和定位

国家自然科学基金

0+阅读 · 2014年12月31日

利用天然手性分子制备手性导电聚苯胺的反应机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于稀疏时频分析与二元掩蔽估计的耳语音可懂度增强研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于叠加训练（ST）信道估计的相干光正交频分复用系统研究

国家自然科学基金

0+阅读 · 2013年12月31日

高精度超高空间分辨率的LIBS固相同位素测量技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于本体的深层网络数据集成方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

基于全极化SAR的台风灾害损失森林和房屋的定量评估系统研究

国家自然科学基金

0+阅读 · 2009年12月31日

Large Language Models are Frame-level Directors for Zero-shot Text-to-Video Generation

Arxiv

0+阅读 · 2023年5月23日

A Study on the Efficiency and Generalization of Light Hybrid Retrievers

Arxiv

0+阅读 · 2023年5月23日

Text Is All You Need: Learning Language Representations for Sequential Recommendation

Arxiv

3+阅读 · 2023年5月23日

Memory Asymmetry: A Key to Convergence in Zero-Sum Games

Arxiv

0+阅读 · 2023年5月23日

Iterative Forward Tuning Boosts In-context Learning in Language Models

Arxiv

1+阅读 · 2023年5月22日

READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation

Arxiv

0+阅读 · 2023年5月22日

Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems

Arxiv

0+阅读 · 2023年5月20日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba

Arxiv

15+阅读 · 2018年5月24日

Learning over Knowledge-Base Embeddings for Recommendation

Arxiv

23+阅读 · 2018年3月22日

VIP会员

文章信息

相关主题

相关VIP内容

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

专知会员服务

19+阅读 · 2022年3月13日

《图Transformer网络与语音识别》Facebook语音大牛Awni Hannun，附121页Slides与视频

专知会员服务

33+阅读 · 2021年6月26日

【SIGIR2021】使用难样本优化向量检索模型

专知会员服务

27+阅读 · 2021年4月22日

【CVPR2021】一种基于知识蒸馏的弱监督图像文本匹配模型

专知会员服务

35+阅读 · 2021年4月8日

知识图谱嵌入模型的概率标定,Probability Calibration for Knowledge Graph Embedding Models

专知会员服务

36+阅读 · 2020年5月11日

Query2box: 使用盒嵌入对向量空间中的知识图谱进行推理，Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings

专知会员服务

46+阅读 · 2020年5月11日

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

专知会员服务

102+阅读 · 2020年4月25日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

专知会员服务

49+阅读 · 2019年11月15日

热门VIP内容

开通专知VIP会员享更多权益服务

《人与智能体在系统工程建模语言V2任务中的性能表现：基于用户中心化的评估方法》308页

《数据安全国家标准体系（2025版）》征求意见稿

AlphaMosaic：人工智能赋能的作战管理系统

《军事行动中通信平台的战略价值：提升战术效能与作战优势》

相关资讯

ACL 2022 | 跨模态离散化表示学习：让不同的模态共享相同的词表

ACL 2022 | 跨模态离散化表示学习：让不同的模态共享相同的词表

PaperWeekly

0+阅读 · 2022年7月8日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

DSGAN：使用生成式对抗网络进行远距离监督关系抽取

DSGAN：使用生成式对抗网络进行远距离监督关系抽取

微信AI

98+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

【泡泡一分钟】PathTrack：使用路径监督的快速轨迹标注方法（ICCV2017-28）

泡泡机器人SLAM

10+阅读 · 2018年5月26日

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

专知

12+阅读 · 2018年4月26日

从 Encoder 到 Decoder 实现 Seq2Seq 模型

从 Encoder 到 Decoder 实现 Seq2Seq 模型

AI研习社

10+阅读 · 2018年2月10日

相关论文

Large Language Models are Frame-level Directors for Zero-shot Text-to-Video Generation

Arxiv

0+阅读 · 2023年5月23日

A Study on the Efficiency and Generalization of Light Hybrid Retrievers

Arxiv

0+阅读 · 2023年5月23日

Text Is All You Need: Learning Language Representations for Sequential Recommendation

Arxiv

3+阅读 · 2023年5月23日

Memory Asymmetry: A Key to Convergence in Zero-Sum Games

Arxiv

0+阅读 · 2023年5月23日

Iterative Forward Tuning Boosts In-context Learning in Language Models

Arxiv

1+阅读 · 2023年5月22日

READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation

Arxiv

0+阅读 · 2023年5月22日

Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems

Arxiv

0+阅读 · 2023年5月20日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba

Arxiv

15+阅读 · 2018年5月24日

Learning over Knowledge-Base Embeddings for Recommendation

Arxiv

23+阅读 · 2018年3月22日

相关基金

多重假设检验中的k-FWER控制

国家自然科学基金

0+阅读 · 2015年12月31日

基于多源证据的繁忙水域交管雷达异常目标识别方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

核壳结构超支化共轭聚合物的双光子荧光增强及光动力治疗

国家自然科学基金

1+阅读 · 2014年12月31日

时间反转-逆时偏移混合法用于超声在分层介质和水下波导中目标的探测和定位

国家自然科学基金

0+阅读 · 2014年12月31日

利用天然手性分子制备手性导电聚苯胺的反应机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于稀疏时频分析与二元掩蔽估计的耳语音可懂度增强研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于叠加训练（ST）信道估计的相干光正交频分复用系统研究

国家自然科学基金

0+阅读 · 2013年12月31日

高精度超高空间分辨率的LIBS固相同位素测量技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于本体的深层网络数据集成方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

基于全极化SAR的台风灾害损失森林和房屋的定量评估系统研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员