I^3 Retriever: Incorporating Implicit Interaction in Pre-trained Language Models for Passage Retrieval - 专知论文

会员服务 ·

0

INTERACT · 语言模型化 · MoDELS · INFORMS · state-of-the-art ·

2023 年 6 月 4 日

I^3 Retriever: Incorporating Implicit Interaction in Pre-trained Language Models for Passage Retrieval

翻译：暂无翻译

Qian Dong,Yiding Liu,Qingyao Ai,Haitao Li,Shuaiqiang Wang,Yiqun Liu,Dawei Yin,Shaoping Ma

from arxiv, 10 pages

Passage retrieval is a fundamental task in many information systems, such as web search and question answering, where both efficiency and effectiveness are critical concerns. In recent years, neural retrievers based on pre-trained language models (PLM), such as dual-encoders, have achieved huge success. Yet, studies have found that the performance of dual-encoders are often limited due to the neglecting of the interaction information between queries and candidate passages. Therefore, various interaction paradigms have been proposed to improve the performance of vanilla dual-encoders. Particularly, recent state-of-the-art methods often introduce late-interaction during the model inference process. However, such late-interaction based methods usually bring extensive computation and storage cost on large corpus. Despite their effectiveness, the concern of efficiency and space footprint is still an important factor that limits the application of interaction-based neural retrieval models. To tackle this issue, we incorporate implicit interaction into dual-encoders, and propose I^3 retriever. In particular, our implicit interaction paradigm leverages generated pseudo-queries to simulate query-passage interaction, which jointly optimizes with query and passage encoders in an end-to-end manner. It can be fully pre-computed and cached, and its inference process only involves simple dot product operation of the query vector and passage vector, which makes it as efficient as the vanilla dual encoders. We conduct comprehensive experiments on MSMARCO and TREC2019 Deep Learning Datasets, demonstrating the I^3 retriever's superiority in terms of both effectiveness and efficiency. Moreover, the proposed implicit interaction is compatible with special pre-training and knowledge distillation for passage retrieval, which brings a new state-of-the-art performance.

翻译：暂无翻译

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

内质网应激IRE1－XBP1S通路在高糖引起肾脏及系膜细胞发生氧化应激及损伤中的机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

麻黄-杏仁药对减轻气道损伤的配伍机理研究

国家自然科学基金

1+阅读 · 2014年12月31日

Al-In-X(X=Er,Zn)体系相图、相结构及体系富铝合金电化学行为研究

国家自然科学基金

0+阅读 · 2013年12月31日

福氏志贺氏菌蛋白质相互作用组的预测与分析

国家自然科学基金

1+阅读 · 2012年12月31日

石墨烯量子波导结构中的电子输运性质

国家自然科学基金

0+阅读 · 2011年12月31日

Neural-based Cross-modal Search and Retrieval of Artwork

Arxiv

0+阅读 · 2023年7月26日

Federated Split Learning with Only Positive Labels for resource-constrained IoT environment

Arxiv

0+阅读 · 2023年7月25日

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

Arxiv

8+阅读 · 2023年7月24日

Deep Image Retrieval: A Survey

Arxiv

16+阅读 · 2021年1月27日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

VIP会员

文章信息

相关主题

语言模型化

state-of-the-art

相关VIP内容

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【书籍】从零开始构建文本生成图像生成器：基于 Transformers 与扩散模型

人工智能与未来指挥

【伯克利博士论文】将大语言模型绑定至虚拟人格：实现人类行为模拟

稀疏自编码器综述：解释大语言模型的内部机制

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Neural-based Cross-modal Search and Retrieval of Artwork

Arxiv

0+阅读 · 2023年7月26日

Federated Split Learning with Only Positive Labels for resource-constrained IoT environment

Arxiv

0+阅读 · 2023年7月25日

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

Arxiv

8+阅读 · 2023年7月24日

Deep Image Retrieval: A Survey

Arxiv

16+阅读 · 2021年1月27日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

相关基金

内质网应激IRE1－XBP1S通路在高糖引起肾脏及系膜细胞发生氧化应激及损伤中的机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

麻黄-杏仁药对减轻气道损伤的配伍机理研究

国家自然科学基金

1+阅读 · 2014年12月31日

Al-In-X(X=Er,Zn)体系相图、相结构及体系富铝合金电化学行为研究

国家自然科学基金

0+阅读 · 2013年12月31日

福氏志贺氏菌蛋白质相互作用组的预测与分析

国家自然科学基金

1+阅读 · 2012年12月31日

石墨烯量子波导结构中的电子输运性质

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员