缩小 " 扩大 " 密集短语检索培训-参考差距 " (Bridging the Training-Inference Gap for Dense Phrase Retrieval) - 专知论文

会员服务 ·

0

推断 · 模型评估 · 可约的 · MoDELS · contrastive ·

2022 年 10 月 25 日

Bridging the Training-Inference Gap for Dense Phrase Retrieval

翻译：缩小 " 扩大 " 密集短语检索培训-参考差距 "

Gyuwan Kim,Jinhyuk Lee,Barlas Oguz,Wenhan Xiong,Yizhe Zhang,Yashar Mehdad,William Yang Wang

from arxiv, Findings of EMNLP 2022; 12 pages, 3 figures

Building dense retrievers requires a series of standard procedures, including training and validating neural models and creating indexes for efficient search. However, these procedures are often misaligned in that training objectives do not exactly reflect the retrieval scenario at inference time. In this paper, we explore how the gap between training and inference in dense retrieval can be reduced, focusing on dense phrase retrieval (Lee et al., 2021) where billions of representations are indexed at inference. Since validating every dense retriever with a large-scale index is practically infeasible, we propose an efficient way of validating dense retrievers using a small subset of the entire corpus. This allows us to validate various training strategies including unifying contrastive loss terms and using hard negatives for phrase retrieval, which largely reduces the training-inference discrepancy. As a result, we improve top-1 phrase retrieval accuracy by 2~3 points and top-20 passage retrieval accuracy by 2~4 points for open-domain question answering. Our work urges modeling dense retrievers with careful consideration of training and inference via efficient validation while advancing phrase retrieval as a general solution for dense retrieval.

翻译：建筑密集的检索器需要一系列标准程序,包括培训和验证神经模型,并为高效搜索创建索引。然而,这些程序往往不正确,因为培训目标并不确切反映推断时间的检索情景。在本文中,我们探讨如何缩小密集检索中的培训与推断之间的差距,重点是密集的短语检索(Lee等人,2021年),其中数十亿个表达方式在推断时进行了索引化。由于用大型指数验证每个密度的检索器,实际上不可行,我们建议了一种有效的方法,用整个体中的一小部分来验证密集检索器。这使我们能够验证各种培训战略,包括统一对比损失术语,以及用硬负值来计算短语检索,这在很大程度上缩小了培训-推断的差异。结果,我们用2~3个点和上20个通道的检索精度提高2~4个点,用于开放式问题回答。我们的工作敦促对密集检索器进行模型化,仔细考虑培训,并通过高效率的验证,同时推进短语检索作为密集检索的一般解决办法。

0

相关内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Decorin对急性缺血性卒中后血脑屏障中ZO-1蛋白的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

空间插值的微分几何方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

γ-Synuclein调控MAPK-ERK-JNK信号通路及细胞周期促进子宫内膜癌恶性进展的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

磁各向异性纳米结构的尺寸、微结构与磁转变温度内在关联机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

泛素化在棉花曲叶病毒侵染中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

人血管内皮细胞登革病毒候选受体- - 55 kDa蛋白的鉴定

国家自然科学基金

0+阅读 · 2012年12月31日

基于pQDs-CD133mAb的双模态探针对胶质瘤CD133+干细胞靶向成像的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

调控家蚕发育非编码RNA（non-coding RNA, ncRNA）的功能解析

国家自然科学基金

0+阅读 · 2011年12月31日

MBL对抗原提呈细胞早期分化的调节作用及其机制

国家自然科学基金

0+阅读 · 2009年12月31日

新发现的登革病毒亚基因组非编码RNA的生物学功能研究

国家自然科学基金

0+阅读 · 2009年12月31日

In Defense of Cross-Encoders for Zero-Shot Retrieval

In Defense of Cross-Encoders for Zero-Shot Retrieval

Arxiv

0+阅读 · 2022年12月12日

Information retrieval in single cell chromatin analysis using TF-IDF transformation methods

Arxiv

0+阅读 · 2022年12月10日

On the Ability of Graph Neural Networks to Model Interactions Between Vertices

Arxiv

0+阅读 · 2022年12月9日

VindLU: A Recipe for Effective Video-and-Language Pretraining

Arxiv

0+阅读 · 2022年12月9日

Multi-Label Logo Recognition and Retrieval based on Weighted Fusion of Neural Features

Arxiv

0+阅读 · 2022年12月9日

MobilePTX: Sparse Coding for Pneumothorax Detection Given Limited Training Examples

Arxiv

0+阅读 · 2022年12月8日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling

Arxiv

10+阅读 · 2021年2月11日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《巡飞弹药（爆炸性无人机）威胁态势分析》最新24页报告

《军用后勤无人机：破解战场运输挑战的创新方案》

人工智能战争：以色列、伊朗与新型AI战争形态

《俄乌战争：现代战争未来的启示与经验》

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

In Defense of Cross-Encoders for Zero-Shot Retrieval

In Defense of Cross-Encoders for Zero-Shot Retrieval

Arxiv

0+阅读 · 2022年12月12日

Information retrieval in single cell chromatin analysis using TF-IDF transformation methods

Arxiv

0+阅读 · 2022年12月10日

On the Ability of Graph Neural Networks to Model Interactions Between Vertices

Arxiv

0+阅读 · 2022年12月9日

VindLU: A Recipe for Effective Video-and-Language Pretraining

Arxiv

0+阅读 · 2022年12月9日

Multi-Label Logo Recognition and Retrieval based on Weighted Fusion of Neural Features

Arxiv

0+阅读 · 2022年12月9日

MobilePTX: Sparse Coding for Pneumothorax Detection Given Limited Training Examples

Arxiv

0+阅读 · 2022年12月8日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling

Arxiv

10+阅读 · 2021年2月11日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

相关基金

Decorin对急性缺血性卒中后血脑屏障中ZO-1蛋白的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

空间插值的微分几何方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

γ-Synuclein调控MAPK-ERK-JNK信号通路及细胞周期促进子宫内膜癌恶性进展的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

磁各向异性纳米结构的尺寸、微结构与磁转变温度内在关联机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

泛素化在棉花曲叶病毒侵染中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

人血管内皮细胞登革病毒候选受体- - 55 kDa蛋白的鉴定

国家自然科学基金

0+阅读 · 2012年12月31日

基于pQDs-CD133mAb的双模态探针对胶质瘤CD133+干细胞靶向成像的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

调控家蚕发育非编码RNA（non-coding RNA, ncRNA）的功能解析

国家自然科学基金

0+阅读 · 2011年12月31日

MBL对抗原提呈细胞早期分化的调节作用及其机制

国家自然科学基金

0+阅读 · 2009年12月31日

新发现的登革病毒亚基因组非编码RNA的生物学功能研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员