MURAL: 多种语言的多模式、多任务检索 (MURAL: Multimodal, Multitask Retrieval Across Languages) - 专知论文

会员服务 ·

0

Performer · 多峰值 · 学成 · 样例 · 查全率/召回率 ·

2021 年 9 月 10 日

MURAL: Multimodal, Multitask Retrieval Across Languages

翻译：MURAL: 多种语言的多模式、多任务检索

Aashi Jain,Mandy Guo,Krishna Srinivasan,Ting Chen,Sneha Kudugunta,Chao Jia,Yinfei Yang,Jason Baldridge

Both image-caption pairs and translation pairs provide the means to learn deep representations of and connections between languages. We use both types of pairs in MURAL (MUltimodal, MUltitask Representations Across Languages), a dual encoder that solves two tasks: 1) image-text matching and 2) translation pair matching. By incorporating billions of translation pairs, MURAL extends ALIGN (Jia et al. PMLR'21)--a state-of-the-art dual encoder learned from 1.8 billion noisy image-text pairs. When using the same encoders, MURAL's performance matches or exceeds ALIGN's cross-modal retrieval performance on well-resourced languages across several datasets. More importantly, it considerably improves performance on under-resourced languages, showing that text-text learning can overcome a paucity of image-caption examples for these languages. On the Wikipedia Image-Text dataset, for example, MURAL-base improves zero-shot mean recall by 8.1% on average for eight under-resourced languages and by 6.8% on average when fine-tuning. We additionally show that MURAL's text representations cluster not only with respect to genealogical connections but also based on areal linguistics, such as the Balkan Sprachbund.

翻译：图像插图配对和翻译配对都提供了学习语言之间深层表达和连接的手段。我们使用MURAL( Multimodal, Multitask Productions over Lebes) 两种配对的两种配对( MURAL, Multitask Guides ), 这两类配对可以解决两个任务:(1) 图像文本匹配和 2 翻译配对。 MURAL 包含数十亿对翻译配对, 扩展了 ALIGIN( Jia et al. PMLR'21) - 从18亿个噪音图像- 文本配对中学习的最先进的双倍编码。当使用相同的编码器时, MURAL的性能匹配或超过 ALIGINT在多个数据集资源充足的语言上的跨模式检索性能。更重要的是, 它大大改进了资源不足的语言的性能, 表明文本学习可以克服这些语言的缺乏性能实例。例如, MURAL- Text数据集平均将8种资源不足的语言的性能提高0.1%, 而平均只有6. 。我们在Balalimalimal 上显示MAL 的图像显示, 也显示MAL- sqalmalbormas

0

相关内容

Performer

【ICCV2021】模态视频表示的跨模态对比学习

专知会员服务

16+阅读 · 2021年10月4日

【KDD2021】检索交互机的表格数据预测

专知会员服务

16+阅读 · 2021年8月13日

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

【干货书】实体搜索，Entity-Oriented Search，358页pdf

【干货书】实体搜索，Entity-Oriented Search，358页pdf

专知会员服务

35+阅读 · 2021年4月9日

【AAAI2021】以事件为中心的自然语言理解，256页ppt

【AAAI2021】以事件为中心的自然语言理解，256页ppt

专知会员服务

74+阅读 · 2021年2月8日

【IJCAI2020南大】上下文在神经机器翻译中的充分利用

【IJCAI2020南大】上下文在神经机器翻译中的充分利用

专知会员服务

16+阅读 · 2020年8月17日

【SIGIR2020】学习搜索查询的颜色表示，Learning Colour Representations of Search Queries

【SIGIR2020】学习搜索查询的颜色表示，Learning Colour Representations of Search Queries

专知会员服务

17+阅读 · 2020年6月18日

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

专知会员服务

17+阅读 · 2020年5月6日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

已删除

将门创投

5+阅读 · 2019年10月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

SIGIR2019 接收论文列表

SIGIR2019 接收论文列表

专知

18+阅读 · 2019年4月20日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

论文浅尝 | EARL: Joint Entity and Relation Linking for QA over KG

论文浅尝 | EARL: Joint Entity and Relation Linking for QA over KG

开放知识图谱

6+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

Improving Candidate Generation for Low-resource Cross-lingual Entity Linking

Arxiv

8+阅读 · 2020年3月3日

Jointly Learning Entity and Relation Representations for Entity Alignment

Arxiv

3+阅读 · 2019年9月20日

Multi-Task Deep Neural Networks for Natural Language Understanding

Multi-Task Deep Neural Networks for Natural Language Understanding

Arxiv

3+阅读 · 2019年1月31日

Multi-class Classification without Multi-class Labels

Multi-class Classification without Multi-class Labels

Arxiv

4+阅读 · 2019年1月2日

Multi-Task Neural Models for Translating Between Styles Within and Across Languages

Arxiv

4+阅读 · 2018年6月12日

Mixing Context Granularities for Improved Entity Linking on Question Answering Data across Entity Categories

Arxiv

3+阅读 · 2018年4月23日

3D Pose Estimation and 3D Model Retrieval for Objects in the Wild

Arxiv

7+阅读 · 2018年3月30日

Stacked Cross Attention for Image-Text Matching

Arxiv

3+阅读 · 2018年3月21日

Self-Attention with Relative Position Representations

Arxiv

14+阅读 · 2018年3月6日

Cross-lingual Entity Alignment via Joint Attribute-Preserving Embedding

Arxiv

3+阅读 · 2017年9月26日

VIP会员

文章信息

相关主题

查全率/召回率

相关VIP内容

【ICCV2021】模态视频表示的跨模态对比学习

专知会员服务

16+阅读 · 2021年10月4日

【KDD2021】检索交互机的表格数据预测

专知会员服务

16+阅读 · 2021年8月13日

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

【干货书】实体搜索，Entity-Oriented Search，358页pdf

【干货书】实体搜索，Entity-Oriented Search，358页pdf

专知会员服务

35+阅读 · 2021年4月9日

【AAAI2021】以事件为中心的自然语言理解，256页ppt

【AAAI2021】以事件为中心的自然语言理解，256页ppt

专知会员服务

74+阅读 · 2021年2月8日

【IJCAI2020南大】上下文在神经机器翻译中的充分利用

【IJCAI2020南大】上下文在神经机器翻译中的充分利用

专知会员服务

16+阅读 · 2020年8月17日

【SIGIR2020】学习搜索查询的颜色表示，Learning Colour Representations of Search Queries

【SIGIR2020】学习搜索查询的颜色表示，Learning Colour Representations of Search Queries

专知会员服务

17+阅读 · 2020年6月18日

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

专知会员服务

17+阅读 · 2020年5月6日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

已删除

将门创投

5+阅读 · 2019年10月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

SIGIR2019 接收论文列表

SIGIR2019 接收论文列表

专知

18+阅读 · 2019年4月20日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

论文浅尝 | EARL: Joint Entity and Relation Linking for QA over KG

论文浅尝 | EARL: Joint Entity and Relation Linking for QA over KG

开放知识图谱

6+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

相关论文

Improving Candidate Generation for Low-resource Cross-lingual Entity Linking

Arxiv

8+阅读 · 2020年3月3日

Jointly Learning Entity and Relation Representations for Entity Alignment

Arxiv

3+阅读 · 2019年9月20日

Multi-Task Deep Neural Networks for Natural Language Understanding

Multi-Task Deep Neural Networks for Natural Language Understanding

Arxiv

3+阅读 · 2019年1月31日

Multi-class Classification without Multi-class Labels

Multi-class Classification without Multi-class Labels

Arxiv

4+阅读 · 2019年1月2日

Multi-Task Neural Models for Translating Between Styles Within and Across Languages

Arxiv

4+阅读 · 2018年6月12日

Mixing Context Granularities for Improved Entity Linking on Question Answering Data across Entity Categories

Arxiv

3+阅读 · 2018年4月23日

3D Pose Estimation and 3D Model Retrieval for Objects in the Wild

Arxiv

7+阅读 · 2018年3月30日

Stacked Cross Attention for Image-Text Matching

Arxiv

3+阅读 · 2018年3月21日

Self-Attention with Relative Position Representations

Arxiv

14+阅读 · 2018年3月6日

Cross-lingual Entity Alignment via Joint Attribute-Preserving Embedding

Arxiv

3+阅读 · 2017年9月26日

微信扫码咨询专知VIP会员