密集稀疏检索: 利用稀疏语言模型提高密集检索的推断效率 (Dense Sparse Retrieval: Using Sparse Language Models for Inference Efficient Dense Retrieval) - 专知论文

会员服务 ·

0

稀疏 · 密集检索 · 推断 · 语言模型 · 上下文 ·

2023 年 3 月 31 日

Dense Sparse Retrieval: Using Sparse Language Models for Inference Efficient Dense Retrieval

翻译：密集稀疏检索: 利用稀疏语言模型提高密集检索的推断效率

Daniel Campos,ChengXiang Zhai

Vector-based retrieval systems have become a common staple for academic and industrial search applications because they provide a simple and scalable way of extending the search to leverage contextual representations for documents and queries. As these vector-based systems rely on contextual language models, their usage commonly requires GPUs, which can be expensive and difficult to manage. Given recent advances in introducing sparsity into language models for improved inference efficiency, in this paper, we study how sparse language models can be used for dense retrieval to improve inference efficiency. Using the popular retrieval library Tevatron and the MSMARCO, NQ, and TriviaQA datasets, we find that sparse language models can be used as direct replacements with little to no drop in accuracy and up to 4.3x improved inference speeds

翻译：向量检索系统已成为学术和工业搜索应用的常用工具，因为它们提供了一种简单且可扩展的方式，利用上下文表示来扩展搜索范围。由于这些基于向量的系统依赖于上下文语言模型，所以通常需要使用显卡，这可能会很昂贵且难以管理。鉴于最近引入了稀疏性来提高语言模型的推断效率，本文研究了如何利用稀疏语言模型进行密集检索，以提高推断效率。使用流行的检索库Tevatron和MSMARCO、NQ和TriviaQA数据集，我们发现稀疏语言模型可以直接替换常用语言模型，几乎不会降低准确性，并且推断速度提高了多达4.3倍。

0

相关内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

专知会员服务

19+阅读 · 2022年3月13日

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

专知会员服务

13+阅读 · 2022年3月12日

知识增强的文本生成研究进展

知识增强的文本生成研究进展

专知会员服务

100+阅读 · 2021年3月6日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【ICML2020-伯克利】反直觉！大模型重压缩提升Transformer的训练和推理效率，47页ppt

【ICML2020-伯克利】反直觉！大模型重压缩提升Transformer的训练和推理效率，47页ppt

专知会员服务

70+阅读 · 2020年7月1日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

专知

29+阅读 · 2018年3月12日

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

机器学习研究会

11+阅读 · 2018年1月14日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

高维积分波动率矩阵的估计及其在资产投资中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

非参数动态混合Copula模型：估计、推断及应用

国家自然科学基金

0+阅读 · 2013年12月31日

Cofilin在Erucin诱导的乳腺癌细胞线粒体分裂和细胞凋亡中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

纤维素温和水解为葡萄糖的模拟酶催化研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于特征融合的刑侦图像数据库检索算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

掺杂最外层具有6s2电子结构的元素提升Ca5Al2Sb6基材料热电性能的理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

CK2介导NF-kB信号通路在前列腺癌细胞增殖及凋亡作用中的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

跨语言信息检索中的机器翻译研究

国家自然科学基金

2+阅读 · 2011年12月31日

三维模型语义分析与检索研究

国家自然科学基金

2+阅读 · 2008年12月31日

Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder

Arxiv

0+阅读 · 2023年5月25日

Enhancing the Ranking Context of Dense Retrieval Methods through Reciprocal Nearest Neighbors

Arxiv

0+阅读 · 2023年5月25日

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Arxiv

0+阅读 · 2023年5月24日

IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models

Arxiv

0+阅读 · 2023年5月24日

Text encoders are performance bottlenecks in contrastive vision-language models

Arxiv

0+阅读 · 2023年5月24日

Predicting Token Impact Towards Efficient Vision Transformer

Arxiv

0+阅读 · 2023年5月24日

Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models

Arxiv

0+阅读 · 2023年5月24日

NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders

Arxiv

0+阅读 · 2023年5月23日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Detect-to-Retrieve: Efficient Regional Aggregation for Image Search

Arxiv

15+阅读 · 2018年12月4日

VIP会员

文章信息

相关主题

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

专知会员服务

19+阅读 · 2022年3月13日

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

专知会员服务

13+阅读 · 2022年3月12日

知识增强的文本生成研究进展

知识增强的文本生成研究进展

专知会员服务

100+阅读 · 2021年3月6日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【ICML2020-伯克利】反直觉！大模型重压缩提升Transformer的训练和推理效率，47页ppt

【ICML2020-伯克利】反直觉！大模型重压缩提升Transformer的训练和推理效率，47页ppt

专知会员服务

70+阅读 · 2020年7月1日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

专知

29+阅读 · 2018年3月12日

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

机器学习研究会

11+阅读 · 2018年1月14日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder

Arxiv

0+阅读 · 2023年5月25日

Enhancing the Ranking Context of Dense Retrieval Methods through Reciprocal Nearest Neighbors

Arxiv

0+阅读 · 2023年5月25日

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Arxiv

0+阅读 · 2023年5月24日

IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models

Arxiv

0+阅读 · 2023年5月24日

Text encoders are performance bottlenecks in contrastive vision-language models

Arxiv

0+阅读 · 2023年5月24日

Predicting Token Impact Towards Efficient Vision Transformer

Arxiv

0+阅读 · 2023年5月24日

Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models

Arxiv

0+阅读 · 2023年5月24日

NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders

Arxiv

0+阅读 · 2023年5月23日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Detect-to-Retrieve: Efficient Regional Aggregation for Image Search

Arxiv

15+阅读 · 2018年12月4日

相关基金

高维积分波动率矩阵的估计及其在资产投资中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

非参数动态混合Copula模型：估计、推断及应用

国家自然科学基金

0+阅读 · 2013年12月31日

Cofilin在Erucin诱导的乳腺癌细胞线粒体分裂和细胞凋亡中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

纤维素温和水解为葡萄糖的模拟酶催化研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于特征融合的刑侦图像数据库检索算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

掺杂最外层具有6s2电子结构的元素提升Ca5Al2Sb6基材料热电性能的理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

CK2介导NF-kB信号通路在前列腺癌细胞增殖及凋亡作用中的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

跨语言信息检索中的机器翻译研究

国家自然科学基金

2+阅读 · 2011年12月31日

三维模型语义分析与检索研究

国家自然科学基金

2+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员