DPTDR: 高密度通过率检索的深度快速查询 (DPTDR: Deep Prompt Tuning for Dense Passage Retrieval) - 专知论文

会员服务 ·

0

Prompt · tuning · Backbone · MoDELS · 可约的 ·

2022 年 8 月 24 日

DPTDR: Deep Prompt Tuning for Dense Passage Retrieval

翻译：DPTDR: 高密度通过率检索的深度快速查询

Zhengyang Tang,Benyou Wang,Ting Yao

from arxiv, Accepted in COLING 2022

Deep prompt tuning (DPT) has gained great success in most natural language processing~(NLP) tasks. However, it is not well-investigated in dense retrieval where fine-tuning~(FT) still dominates. When deploying multiple retrieval tasks using the same backbone model~(e.g., RoBERTa), FT-based methods are unfriendly in terms of deployment cost: each new retrieval model needs to repeatedly deploy the backbone model without reuse. To reduce the deployment cost in such a scenario, this work investigates applying DPT in dense retrieval. The challenge is that directly applying DPT in dense retrieval largely underperforms FT methods. To compensate for the performance drop, we propose two model-agnostic and task-agnostic strategies for DPT-based retrievers, namely retrieval-oriented intermediate pretraining and unified negative mining, as a general approach that could be compatible with any pre-trained language model and retrieval task. The experimental results show that the proposed method (called DPTDR) outperforms previous state-of-the-art models on both MS-MARCO and Natural Questions. We also conduct ablation studies to examine the effectiveness of each strategy in DPTDR. We believe this work facilitates the industry, as it saves enormous efforts and costs of deployment and increases the utility of computing resources. Our code is available at https://github.com/tangzhy/DPTDR.

翻译：深度快速调试(DPT)在大多数自然语言处理(NLP)任务中取得了巨大成功。但是,在微调((FT)仍然占主导地位的地方,在密集的检索中并没有很好地调查,在微调(FT)仍然占主导地位的地方,在密集的检索中直接应用DPT。在使用同一个主干模型~(如ROBERTA)时,基于FT的方法是不友好的:每个新的检索模型都需要反复部署主干模式而无需再用。在这样的情况下,这项工作调查了在密集的检索中应用DPT的部署费用。挑战在于,在密集的检索中直接应用DPT, 基本上不完善FT方法。为了弥补性下降,我们建议为DPT的检索者采用两种模式-Ancicti和任务-ncial-noral-norstal-screal 战略,即以检索为主干燥的中间培训和统一负采矿方法,作为与任何预先训练的语文模型和检索任务兼容的一般方法。实验结果表明,拟议的方法(称为DPTDPTDDDDDDR)超越了先前在MS-MCO和自然问题中的状态-MT-MDRUPDRPD和自然-IPIPL的部署中所用的标准模式。我们相信每一项工作效率,我们每一个的每一个工作的每一项工作都提高了工作成本。

0

相关内容

Prompt

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【课程推荐】斯坦福课程：信息检索与网络搜索《CS 276: Information Retrieval and Web Search(Spring quarter 2019)》by Chris Manning, Pandu Nayak

【课程推荐】斯坦福课程：信息检索与网络搜索《CS 276: Information Retrieval and Web Search(Spring quarter 2019)》by Chris Manning, Pandu Nayak

专知会员服务

46+阅读 · 2019年12月2日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

新型高性能阻醇质子交换膜纳米多级质子传输通道构筑及构效关系研究

国家自然科学基金

0+阅读 · 2015年12月31日

水体重金属在螯合秸秆纤维超微结构中的吸附机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

多功能无机杂化功能分子的制备与性能

国家自然科学基金

0+阅读 · 2014年12月31日

含镁Al-Zn镀层中周期层状组织的形成机理及耐蚀性研究

国家自然科学基金

0+阅读 · 2014年12月31日

固体氧化物燃料电池纳米结构阴极的构筑及中低温电化学性能

国家自然科学基金

0+阅读 · 2014年12月31日

基于智能在线虚拟参考反馈整定的控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

大丝束碳纤维短切机理及断口形貌控制

国家自然科学基金

0+阅读 · 2012年12月31日

钴合金亚纳米催化相控制生长亚纳米直径的单壁碳纳米管

国家自然科学基金

0+阅读 · 2012年12月31日

树形表面活性剂/石墨烯纳米复合材料的组装及生物大分子检测

国家自然科学基金

0+阅读 · 2012年12月31日

纳米无机杂化发光材料的分子设计与控制制备

国家自然科学基金

0+阅读 · 2009年12月31日

SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data

Arxiv

0+阅读 · 2022年10月6日

Detect, Retrieve, Comprehend: A Flexible Framework for Zero-Shot Document-Level Question Answering

Arxiv

0+阅读 · 2022年10月4日

Context-Tuning: Learning Contextualized Prompts for Natural Language Generation

Arxiv

0+阅读 · 2022年10月3日

Retrieval-based Controllable Molecule Generation

Arxiv

0+阅读 · 2022年9月30日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Deep Image Retrieval: A Survey

Arxiv

16+阅读 · 2021年1月27日

A Decade Survey of Content Based Image Retrieval using Deep Learning

Arxiv

23+阅读 · 2020年11月23日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Arxiv

15+阅读 · 2018年10月11日

VIP会员

文章信息

相关主题

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【课程推荐】斯坦福课程：信息检索与网络搜索《CS 276: Information Retrieval and Web Search(Spring quarter 2019)》by Chris Manning, Pandu Nayak

【课程推荐】斯坦福课程：信息检索与网络搜索《CS 276: Information Retrieval and Web Search(Spring quarter 2019)》by Chris Manning, Pandu Nayak

专知会员服务

46+阅读 · 2019年12月2日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】面向时间序列基础模型的合成序列符号数据生成方法

军事通信市场七大趋势概述

【CMU博士论文】深度学习中泛化的量化、理解与改进

面向低光照图像增强的扩散模型

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data

Arxiv

0+阅读 · 2022年10月6日

Detect, Retrieve, Comprehend: A Flexible Framework for Zero-Shot Document-Level Question Answering

Arxiv

0+阅读 · 2022年10月4日

Context-Tuning: Learning Contextualized Prompts for Natural Language Generation

Arxiv

0+阅读 · 2022年10月3日

Retrieval-based Controllable Molecule Generation

Arxiv

0+阅读 · 2022年9月30日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Deep Image Retrieval: A Survey

Arxiv

16+阅读 · 2021年1月27日

A Decade Survey of Content Based Image Retrieval using Deep Learning

Arxiv

23+阅读 · 2020年11月23日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Arxiv

15+阅读 · 2018年10月11日

相关基金

新型高性能阻醇质子交换膜纳米多级质子传输通道构筑及构效关系研究

国家自然科学基金

0+阅读 · 2015年12月31日

水体重金属在螯合秸秆纤维超微结构中的吸附机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

多功能无机杂化功能分子的制备与性能

国家自然科学基金

0+阅读 · 2014年12月31日

含镁Al-Zn镀层中周期层状组织的形成机理及耐蚀性研究

国家自然科学基金

0+阅读 · 2014年12月31日

固体氧化物燃料电池纳米结构阴极的构筑及中低温电化学性能

国家自然科学基金

0+阅读 · 2014年12月31日

基于智能在线虚拟参考反馈整定的控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

大丝束碳纤维短切机理及断口形貌控制

国家自然科学基金

0+阅读 · 2012年12月31日

钴合金亚纳米催化相控制生长亚纳米直径的单壁碳纳米管

国家自然科学基金

0+阅读 · 2012年12月31日

树形表面活性剂/石墨烯纳米复合材料的组装及生物大分子检测

国家自然科学基金

0+阅读 · 2012年12月31日

纳米无机杂化发光材料的分子设计与控制制备

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员