ERNIE-UniX2:理解和产生统一跨语言跨模式框架 (ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation) - 专知论文

会员服务 ·

0

可理解性 · Learning · MoDELS · Integration · Machine Translation ·

2022 年 11 月 9 日

ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

翻译：ERNIE-UniX2:理解和产生统一跨语言跨模式框架

Bin Shan,Yaqian Han,Weichong Yin,Shuohuan Wang,Yu Sun,Hao Tian,Hua Wu,Haifeng Wang

from arxiv, 13 pages, 2 figures

Recent cross-lingual cross-modal works attempt to extend Vision-Language Pre-training (VLP) models to non-English inputs and achieve impressive performance. However, these models focus only on understanding tasks utilizing encoder-only architecture. In this paper, we propose ERNIE-UniX2, a unified cross-lingual cross-modal pre-training framework for both generation and understanding tasks. ERNIE-UniX2 integrates multiple pre-training paradigms (e.g., contrastive learning and language modeling) based on encoder-decoder architecture and attempts to learn a better joint representation across languages and modalities. Furthermore, ERNIE-UniX2 can be seamlessly fine-tuned for varieties of generation and understanding downstream tasks. Pre-trained on both multilingual text-only and image-text datasets, ERNIE-UniX2 achieves SOTA results on various cross-lingual cross-modal generation and understanding tasks such as multimodal machine translation and multilingual visual question answering.

翻译：最近的跨语言跨模式工作试图将愿景-语言培训前(VLP)模式扩大到非英语投入,并取得令人印象深刻的成绩;然而,这些模式仅侧重于利用只使用编码器的结构来理解任务;在本文件中,我们建议ERNIE-UniX2为产生和理解任务建立一个统一的跨语言跨模式培训前框架;ERNIE-UniX2结合了基于编码器-编码器结构的多种培训前模式(例如对比学习和语言建模),并试图学习不同语言和模式的更好的联合代表;此外,ERNIE-UniX2可以对新一代和理解下游任务进行无缝的调整;在多语言文本和图像-文字数据集方面预先培训,ERNIE-UniX2在多种语言跨模式的跨模式生成和理解任务上取得了SOTA结果,如多语言机器翻译和多语言视觉解答。

0

相关内容

可理解性

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

专知会员服务

21+阅读 · 2019年12月12日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

微软发布DialoGPT预训练语言模型，论文与代码 Large-Scale Generative Pre-training for Conversational Response Generation

微软发布DialoGPT预训练语言模型，论文与代码 Large-Scale Generative Pre-training for Conversational Response Generation

专知会员服务

28+阅读 · 2019年11月8日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

弹塑性相场模型在TRIP钢应变诱发马氏体相变研究中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

一种考虑微尺度金属材料损伤的应变梯度理论

国家自然科学基金

0+阅读 · 2013年12月31日

环的clean性及其相关广义逆的研究

国家自然科学基金

0+阅读 · 2013年12月31日

窄滞后NiCoMnSn记忆合金薄膜及其低磁场驱动马氏体相变及磁感生应变研究

国家自然科学基金

0+阅读 · 2013年12月31日

稠油热采中THM耦合机制及储层破裂演化研究

国家自然科学基金

0+阅读 · 2013年12月31日

超限插值曲面造型的连分式方法与光滑拼接研究

国家自然科学基金

0+阅读 · 2012年12月31日

肾脏单核-巨噬细胞系统中IKKα-p52:RelB途径活化对促进肾脏缺血再灌注损伤后修复的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

由Janus胶束构筑具有不对称结构的金属-金属氧化物纳米粒子

国家自然科学基金

0+阅读 · 2011年12月31日

Nrf2-ARE通路在缺血/药物后处理中作用的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微重力静态变形界面流动及稳定性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Generative Language Models for Paragraph-Level Question Generation

Arxiv

0+阅读 · 2023年1月2日

Fuzzing Deep-Learning Libraries via Large Language Models

Arxiv

0+阅读 · 2022年12月30日

HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training

Arxiv

0+阅读 · 2022年12月30日

Unifying Vision-and-Language Tasks via Text Generation

Arxiv

10+阅读 · 2021年2月4日

Temporal Relational Modeling with Self-Supervision for Action Segmentation

Arxiv

13+阅读 · 2020年12月14日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Arxiv

12+阅读 · 2020年2月19日

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

Arxiv

19+阅读 · 2020年2月15日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

VIP会员

文章信息

相关主题

Machine Translation

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

专知会员服务

21+阅读 · 2019年12月12日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

微软发布DialoGPT预训练语言模型，论文与代码 Large-Scale Generative Pre-training for Conversational Response Generation

微软发布DialoGPT预训练语言模型，论文与代码 Large-Scale Generative Pre-training for Conversational Response Generation

专知会员服务

28+阅读 · 2019年11月8日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Generative Language Models for Paragraph-Level Question Generation

Arxiv

0+阅读 · 2023年1月2日

Fuzzing Deep-Learning Libraries via Large Language Models

Arxiv

0+阅读 · 2022年12月30日

HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training

Arxiv

0+阅读 · 2022年12月30日

Unifying Vision-and-Language Tasks via Text Generation

Arxiv

10+阅读 · 2021年2月4日

Temporal Relational Modeling with Self-Supervision for Action Segmentation

Arxiv

13+阅读 · 2020年12月14日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Arxiv

12+阅读 · 2020年2月19日

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

Arxiv

19+阅读 · 2020年2月15日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

相关基金

弹塑性相场模型在TRIP钢应变诱发马氏体相变研究中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

一种考虑微尺度金属材料损伤的应变梯度理论

国家自然科学基金

0+阅读 · 2013年12月31日

环的clean性及其相关广义逆的研究

国家自然科学基金

0+阅读 · 2013年12月31日

窄滞后NiCoMnSn记忆合金薄膜及其低磁场驱动马氏体相变及磁感生应变研究

国家自然科学基金

0+阅读 · 2013年12月31日

稠油热采中THM耦合机制及储层破裂演化研究

国家自然科学基金

0+阅读 · 2013年12月31日

超限插值曲面造型的连分式方法与光滑拼接研究

国家自然科学基金

0+阅读 · 2012年12月31日

肾脏单核-巨噬细胞系统中IKKα-p52:RelB途径活化对促进肾脏缺血再灌注损伤后修复的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

由Janus胶束构筑具有不对称结构的金属-金属氧化物纳米粒子

国家自然科学基金

0+阅读 · 2011年12月31日

Nrf2-ARE通路在缺血/药物后处理中作用的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微重力静态变形界面流动及稳定性研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员