【Google AI-Yi Tay】Transformer记忆为可微搜索索引”(DSI) - 专知VIP

会员服务 ·

4

可微搜索索引 (DSI) · Google · Transformer · Yi Tay · 演讲 ·

2022 年 3 月 4 日

【Google AI-Yi Tay】Transformer记忆为可微搜索索引”(DSI)

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

在这次演讲中，我将讨论谷歌AI的最新工作，即“可微搜索索引”(DSI)。DSI表明，信息检索可以通过一个Transformer来完成，其中关于语料库的所有信息都编码在模型的参数中。DSI是一个新的范例，它学习了文本到文本的模型，将字符串查询直接映射到相关的docids;换句话说，DSI模型只使用其参数直接回答查询，极大地简化了整个检索过程。我们研究文档及其标识符如何表示的变化，训练过程的变化，以及模型和语料库大小之间的相互作用。实验证明，给予适当的设计选择，DSI显著优于强大的基线，如双编码器模型。此外，DSI显示了强大的泛化能力，在零样本设置中优于BM25基线。

成为VIP会员查看完整内容

10

相关内容

可微搜索索引 (DSI)

可微搜索索引 (DSI)

Google最新《高效Transformers》2022综述大全，阐述九大类提升Transformers效率方式

Google最新《高效Transformers》2022综述大全，阐述九大类提升Transformers效率方式

专知会员服务

97+阅读 · 2022年3月18日

【Google】高效Transformer综述，Efficient Transformers: A Survey

【Google】高效Transformer综述，Efficient Transformers: A Survey

专知会员服务

66+阅读 · 2022年3月17日

【NeurIPS2021】神经解释器的动态推理

专知会员服务

15+阅读 · 2021年10月16日

【谷歌Kelvin Guu】语言模型可以是知识库吗？，46页ppt

专知会员服务

26+阅读 · 2021年10月12日

【Google】最新《高效Transformers》综述大全，Efficient Transformers: A Survey

【Google】最新《高效Transformers》综述大全，Efficient Transformers: A Survey

专知会员服务

113+阅读 · 2020年9月17日

【Google】多模态Transformer视频检索，Multi-modal Transformer

【Google】多模态Transformer视频检索，Multi-modal Transformer

专知会员服务

103+阅读 · 2020年7月22日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【Google AI新论文】REALM:检索增强语言模型预训练，QA的SOTA提升4-16%准确性

【Google AI新论文】REALM:检索增强语言模型预训练，QA的SOTA提升4-16%准确性

专知会员服务

45+阅读 · 2020年2月12日

Google AI博客解读论文《Reformer: The Efficient Transformer》，百万量级注意力机制

Google AI博客解读论文《Reformer: The Efficient Transformer》，百万量级注意力机制

专知会员服务

70+阅读 · 2020年1月17日

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

专知会员服务

18+阅读 · 2019年11月30日

Google最新《高效Transformers》2022综述大全，39页pdf阐述九大类提升Transformers效率方式

Google最新《高效Transformers》2022综述大全，39页pdf阐述九大类提升Transformers效率方式

专知

0+阅读 · 2022年3月18日

单个Transformer完成信息检索，谷歌用可微搜索索引打败双编码器模型

单个Transformer完成信息检索，谷歌用可微搜索索引打败双编码器模型

机器之心

1+阅读 · 2022年3月4日

别再双塔了！谷歌提出DSI索引，检索效果吊打双塔，零样本超BM25！

别再双塔了！谷歌提出DSI索引，检索效果吊打双塔，零样本超BM25！

夕小瑶的卖萌屋

2+阅读 · 2022年2月21日

DeepMind一键三连，强推「地鼠」语言模型！只要2800亿参数就能刷SOTA

DeepMind一键三连，强推「地鼠」语言模型！只要2800亿参数就能刷SOTA

新智元

0+阅读 · 2021年12月9日

图像随便打乱，模型输入不靠「眼睛」看！Google华人一作：强化学习和人类有相同的感知能力

图像随便打乱，模型输入不靠「眼睛」看！Google华人一作：强化学习和人类有相同的感知能力

新智元

0+阅读 · 2021年12月8日

详解微软大规模稀疏模型 MEB：参数高达 1350 亿，可显著提升搜索相关性

详解微软大规模稀疏模型 MEB：参数高达 1350 亿，可显著提升搜索相关性

InfoQ

0+阅读 · 2021年11月26日

Google Research新成果，让表格理解和检索更上一层楼！

Google Research新成果，让表格理解和检索更上一层楼！

夕小瑶的卖萌屋

1+阅读 · 2021年9月28日

NLP研究索引神器，3000+代码库，一键查找论文、GitHub库

NLP研究索引神器，3000+代码库，一键查找论文、GitHub库

机器之心

0+阅读 · 2021年4月28日

Google-EfficientNet v2来了！更快，更小，更强！

Google-EfficientNet v2来了！更快，更小，更强！

专知

0+阅读 · 2021年4月4日

一文详解Google最新NLP模型XLNet

一文详解Google最新NLP模型XLNet

PaperWeekly

18+阅读 · 2019年7月1日

泛素交联酶UbcH7在DNA损伤应答过程中的功能和机制分析

国家自然科学基金

0+阅读 · 2015年12月31日

结构化多项式系统的三角化求解方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

轻度认知障碍患者语义记忆损伤对其情节记忆的影响：认知神经机制探索及干预研究

国家自然科学基金

0+阅读 · 2013年12月31日

SIRT3在硫化氢抗血管内皮氧化应激损伤中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

POMC神经元在回肠转位术改善非肥胖2型糖尿病中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

老化相关的alpha-突触核蛋白寡聚体积聚对海马神经元NMDA受体表达和功能的影响及其机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

补体调节CD8+ T细胞记忆的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

实时双模态自动图像软标注与多关键词检索

国家自然科学基金

0+阅读 · 2009年12月31日

爆炸流场特征提取及其可视化软件开发

国家自然科学基金

0+阅读 · 2009年12月31日

机器学习中模型选择问题的研究及其在图像理解中的应用

国家自然科学基金

8+阅读 · 2008年12月31日

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Arxiv

0+阅读 · 2022年4月20日

Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models

Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models

Arxiv

0+阅读 · 2022年4月19日

Value Retrieval with Arbitrary Queries for Form-like Documents

Arxiv

0+阅读 · 2022年4月15日

ML_LTU at SemEval-2022 Task 4: T5 Towards Identifying Patronizing and Condescending Language

ML_LTU at SemEval-2022 Task 4: T5 Towards Identifying Patronizing and Condescending Language

Arxiv

0+阅读 · 2022年4月15日

Training Entire-Space Models for Target-oriented Opinion Words Extraction

Arxiv

0+阅读 · 2022年4月15日

Pre-training Methods in Information Retrieval

Arxiv

1+阅读 · 2022年4月15日

How Different are Pre-trained Transformers for Text Ranking?

Arxiv

0+阅读 · 2022年4月5日

Efficient Transformers: A Survey

Arxiv

35+阅读 · 2022年3月14日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Arxiv

15+阅读 · 2018年10月11日

VIP会员

相关主题

可微搜索索引 (DSI)

相关VIP内容

Google最新《高效Transformers》2022综述大全，阐述九大类提升Transformers效率方式

Google最新《高效Transformers》2022综述大全，阐述九大类提升Transformers效率方式

专知会员服务

97+阅读 · 2022年3月18日

【Google】高效Transformer综述，Efficient Transformers: A Survey

【Google】高效Transformer综述，Efficient Transformers: A Survey

专知会员服务

66+阅读 · 2022年3月17日

【NeurIPS2021】神经解释器的动态推理

专知会员服务

15+阅读 · 2021年10月16日

【谷歌Kelvin Guu】语言模型可以是知识库吗？，46页ppt

专知会员服务

26+阅读 · 2021年10月12日

【Google】最新《高效Transformers》综述大全，Efficient Transformers: A Survey

【Google】最新《高效Transformers》综述大全，Efficient Transformers: A Survey

专知会员服务

113+阅读 · 2020年9月17日

【Google】多模态Transformer视频检索，Multi-modal Transformer

【Google】多模态Transformer视频检索，Multi-modal Transformer

专知会员服务

103+阅读 · 2020年7月22日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【Google AI新论文】REALM:检索增强语言模型预训练，QA的SOTA提升4-16%准确性

【Google AI新论文】REALM:检索增强语言模型预训练，QA的SOTA提升4-16%准确性

专知会员服务

45+阅读 · 2020年2月12日

Google AI博客解读论文《Reformer: The Efficient Transformer》，百万量级注意力机制

Google AI博客解读论文《Reformer: The Efficient Transformer》，百万量级注意力机制

专知会员服务

70+阅读 · 2020年1月17日

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

专知会员服务

18+阅读 · 2019年11月30日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

Google最新《高效Transformers》2022综述大全，39页pdf阐述九大类提升Transformers效率方式

Google最新《高效Transformers》2022综述大全，39页pdf阐述九大类提升Transformers效率方式

专知

0+阅读 · 2022年3月18日

单个Transformer完成信息检索，谷歌用可微搜索索引打败双编码器模型

单个Transformer完成信息检索，谷歌用可微搜索索引打败双编码器模型

机器之心

1+阅读 · 2022年3月4日

别再双塔了！谷歌提出DSI索引，检索效果吊打双塔，零样本超BM25！

别再双塔了！谷歌提出DSI索引，检索效果吊打双塔，零样本超BM25！

夕小瑶的卖萌屋

2+阅读 · 2022年2月21日

DeepMind一键三连，强推「地鼠」语言模型！只要2800亿参数就能刷SOTA

DeepMind一键三连，强推「地鼠」语言模型！只要2800亿参数就能刷SOTA

新智元

0+阅读 · 2021年12月9日

图像随便打乱，模型输入不靠「眼睛」看！Google华人一作：强化学习和人类有相同的感知能力

图像随便打乱，模型输入不靠「眼睛」看！Google华人一作：强化学习和人类有相同的感知能力

新智元

0+阅读 · 2021年12月8日

详解微软大规模稀疏模型 MEB：参数高达 1350 亿，可显著提升搜索相关性

详解微软大规模稀疏模型 MEB：参数高达 1350 亿，可显著提升搜索相关性

InfoQ

0+阅读 · 2021年11月26日

Google Research新成果，让表格理解和检索更上一层楼！

Google Research新成果，让表格理解和检索更上一层楼！

夕小瑶的卖萌屋

1+阅读 · 2021年9月28日

NLP研究索引神器，3000+代码库，一键查找论文、GitHub库

NLP研究索引神器，3000+代码库，一键查找论文、GitHub库

机器之心

0+阅读 · 2021年4月28日

Google-EfficientNet v2来了！更快，更小，更强！

Google-EfficientNet v2来了！更快，更小，更强！

专知

0+阅读 · 2021年4月4日

一文详解Google最新NLP模型XLNet

一文详解Google最新NLP模型XLNet

PaperWeekly

18+阅读 · 2019年7月1日

相关基金

泛素交联酶UbcH7在DNA损伤应答过程中的功能和机制分析

国家自然科学基金

0+阅读 · 2015年12月31日

结构化多项式系统的三角化求解方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

轻度认知障碍患者语义记忆损伤对其情节记忆的影响：认知神经机制探索及干预研究

国家自然科学基金

0+阅读 · 2013年12月31日

SIRT3在硫化氢抗血管内皮氧化应激损伤中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

POMC神经元在回肠转位术改善非肥胖2型糖尿病中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

老化相关的alpha-突触核蛋白寡聚体积聚对海马神经元NMDA受体表达和功能的影响及其机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

补体调节CD8+ T细胞记忆的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

实时双模态自动图像软标注与多关键词检索

国家自然科学基金

0+阅读 · 2009年12月31日

爆炸流场特征提取及其可视化软件开发

国家自然科学基金

0+阅读 · 2009年12月31日

机器学习中模型选择问题的研究及其在图像理解中的应用

国家自然科学基金

8+阅读 · 2008年12月31日

相关论文

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Arxiv

0+阅读 · 2022年4月20日

Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models

Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models

Arxiv

0+阅读 · 2022年4月19日

Value Retrieval with Arbitrary Queries for Form-like Documents

Arxiv

0+阅读 · 2022年4月15日

ML_LTU at SemEval-2022 Task 4: T5 Towards Identifying Patronizing and Condescending Language

ML_LTU at SemEval-2022 Task 4: T5 Towards Identifying Patronizing and Condescending Language

Arxiv

0+阅读 · 2022年4月15日

Training Entire-Space Models for Target-oriented Opinion Words Extraction

Arxiv

0+阅读 · 2022年4月15日

Pre-training Methods in Information Retrieval

Arxiv

1+阅读 · 2022年4月15日

How Different are Pre-trained Transformers for Text Ranking?

Arxiv

0+阅读 · 2022年4月5日

Efficient Transformers: A Survey

Arxiv

35+阅读 · 2022年3月14日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Arxiv

15+阅读 · 2018年10月11日

微信扫码咨询专知VIP会员