P$3美元评分:通过速学和预调,缩小培训前和评分的微调之间的差距 (P$^3$ Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning) - 专知论文

会员服务 ·

0

秩 · 知识 (knowledge) · 学成 · 可辨认的 · MoDELS ·

2022 年 5 月 4 日

P$^3$ Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning

翻译：P$3美元评分:通过速学和预调,缩小培训前和评分的微调之间的差距

Xiaomeng Hu,Shi Yu,Chenyan Xiong,Zhenghao Liu,Zhiyuan Liu,Ge Yu

from arxiv, Accepted by SIGIR 2022

Compared to other language tasks, applying pre-trained language models (PLMs) for search ranking often requires more nuances and training signals. In this paper, we identify and study the two mismatches between pre-training and ranking fine-tuning: the training schema gap regarding the differences in training objectives and model architectures, and the task knowledge gap considering the discrepancy between the knowledge needed in ranking and that learned during pre-training. To mitigate these gaps, we propose Pre-trained, Prompt-learned and Pre-finetuned Neural Ranker (P$^3$ Ranker). P$^3$ Ranker leverages prompt-based learning to convert the ranking task into a pre-training like schema and uses pre-finetuning to initialize the model on intermediate supervised tasks. Experiments on MS MARCO and Robust04 show the superior performances of P$^3$ Ranker in few-shot ranking. Analyses reveal that P$^3$ Ranker is able to better accustom to the ranking task through prompt-based learning and retrieve necessary ranking-oriented knowledge gleaned in pre-finetuning, resulting in data-efficient PLM adaptation. Our code is available at \url{https://github.com/NEUIR/P3Ranker}.

翻译：与其他语言任务相比,应用预先培训的语言模型(PLM)进行搜索排名往往需要更多的细微差别和培训信号。在本文件中,我们确定并研究培训前和排名微调之间的两种不匹配之处:培训目标和模式架构差异的培训计划差距,以及考虑到排名所需知识与培训前知识之间的差异的任务知识差距。为了缩小这些差距,我们提议采用预先培训、迅速学习和事先调整的Neuror Ranger(P$3$ Ranger) 。P$3CRer利用快速学习的杠杆,将排序任务转换成预培训前任务,如Schema,并使用预调整来启动中期监督任务模式。关于MS MARCO和Robust04的实验显示了低调P$3的优异性表现。分析显示,PN3$PNCER能够通过快速学习和检索在PIurth3/NKER校前调整中必要的排序导向知识,从而在数据效率/PLMRMRQ}我们的数据/PLAWADRADRDR/CRQ 中可以使用的数据节调制。

0

相关内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

专知

26+阅读 · 2018年5月22日

茶树酚类物质生物合成分支途径调控的分子机理

国家自然科学基金

0+阅读 · 2014年12月31日

单链DNA结合蛋白WHIRLY1转录及表观遗传调控植物衰老和细胞死亡的研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA lncLCSC调控肝癌干细胞自我更新的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于微结构演化的金属材料塑性本构模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于博弈激励的分布式自适应异构无线网络拓扑控制

国家自然科学基金

0+阅读 · 2012年12月31日

中红外波段石墨烯的可饱和吸收特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

脊髓损伤后HMGB1蛋白激活炎症因子的释放及作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

覆盖曲面理论、随机级数及算子不等式的若干研究

国家自然科学基金

0+阅读 · 2011年12月31日

稀土RE-Fe-Cr三元系相图及其化合物吸波性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

植物胁迫"印记"形成的分子机制及脱落酸和水杨酸的调控作用

国家自然科学基金

0+阅读 · 2008年12月31日

Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data

Arxiv

0+阅读 · 2022年6月20日

Language with Vision: a Study on Grounded Word and Sentence Embeddings

Language with Vision: a Study on Grounded Word and Sentence Embeddings

Arxiv

0+阅读 · 2022年6月17日

Bridge-Tower: Building Bridges Between Encoders in Vision-Language Representation Learning

Arxiv

0+阅读 · 2022年6月17日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

A continual learning survey: Defying forgetting in classification tasks

Arxiv

32+阅读 · 2021年4月16日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

Approaches for Enriching and Improving Textual Knowledge Bases

Arxiv

15+阅读 · 2018年4月20日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《战区安全决策课程体系》最新244页

《"无人机航母"原型平台》

任务规划与地形分析：现代复杂环境作战导航体系

《攻击场景描述形式化模型研究》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

专知

26+阅读 · 2018年5月22日

相关论文

Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data

Arxiv

0+阅读 · 2022年6月20日

Language with Vision: a Study on Grounded Word and Sentence Embeddings

Language with Vision: a Study on Grounded Word and Sentence Embeddings

Arxiv

0+阅读 · 2022年6月17日

Bridge-Tower: Building Bridges Between Encoders in Vision-Language Representation Learning

Arxiv

0+阅读 · 2022年6月17日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

A continual learning survey: Defying forgetting in classification tasks

Arxiv

32+阅读 · 2021年4月16日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

Approaches for Enriching and Improving Textual Knowledge Bases

Arxiv

15+阅读 · 2018年4月20日

相关基金

茶树酚类物质生物合成分支途径调控的分子机理

国家自然科学基金

0+阅读 · 2014年12月31日

单链DNA结合蛋白WHIRLY1转录及表观遗传调控植物衰老和细胞死亡的研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA lncLCSC调控肝癌干细胞自我更新的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于微结构演化的金属材料塑性本构模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于博弈激励的分布式自适应异构无线网络拓扑控制

国家自然科学基金

0+阅读 · 2012年12月31日

中红外波段石墨烯的可饱和吸收特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

脊髓损伤后HMGB1蛋白激活炎症因子的释放及作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

覆盖曲面理论、随机级数及算子不等式的若干研究

国家自然科学基金

0+阅读 · 2011年12月31日

稀土RE-Fe-Cr三元系相图及其化合物吸波性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

植物胁迫"印记"形成的分子机制及脱落酸和水杨酸的调控作用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员