限制密度匹配和建模,以多种语文统一上上上上下调代表情况 (Constrained Density Matching and Modeling for Cross-lingual Alignment of Contextualized Representations) - 专知论文

会员服务 ·

0

基于上下文的表示 · Performer · Subspace · motivation · 无监督 ·

2022 年 9 月 18 日

Constrained Density Matching and Modeling for Cross-lingual Alignment of Contextualized Representations

翻译：限制密度匹配和建模,以多种语文统一上上上上下调代表情况

Wei Zhao,Steffen Eger

from arxiv, ACML2022 Camera Ready

Multilingual representations pre-trained with monolingual data exhibit considerably unequal task performances across languages. Previous studies address this challenge with resource-intensive contextualized alignment, which assumes the availability of large parallel data, thereby leaving under-represented language communities behind. In this work, we attribute the data hungriness of previous alignment techniques to two limitations: (i) the inability to sufficiently leverage data and (ii) these techniques are not trained properly. To address these issues, we introduce supervised and unsupervised density-based approaches named Real-NVP and GAN-Real-NVP, driven by Normalizing Flow, to perform alignment, both dissecting the alignment of multilingual subspaces into density matching and density modeling. We complement these approaches with our validation criteria in order to guide the training process. Our experiments encompass 16 alignments, including our approaches, evaluated across 6 language pairs, synthetic data and 5 NLP tasks. We demonstrate the effectiveness of our approaches in the scenarios of limited and no parallel data. First, our supervised approach trained on 20k parallel data (sentences) mostly surpasses Joint-Align and InfoXLM trained on over 100k parallel sentences. Second, parallel data can be removed without sacrificing performance when integrating our unsupervised approach in our bootstrapping procedure, which is theoretically motivated to enforce equality of multilingual subspaces. Moreover, we demonstrate the advantages of validation criteria over validation data for guiding supervised training.

翻译：在这项工作中,我们把先前的调整技术中的数据缺乏归因于两个限制:(一) 无法充分利用数据,以及(二) 这些技术没有经过适当的培训。为了解决这些问题,我们采用了监督和不受监督的基于密度的方法,即Real-NVP和GAN-Real-NVP。首先,我们在正常化流程的驱动下,在20k平行数据(说明)的驱动下,采用监督和不受监督的方法,以进行统一,既要将多语言子空间的调整分解为密度匹配和密度模型,又要将这些方法与我们的验证标准相配合,以指导培训进程。我们的实验包括16个匹配方法,包括我们的方法,在6对语文、合成数据和5项NLP任务中加以评估。为了解决这些问题,我们展示了我们在有限和没有平行数据的情况下采用的方法的有效性。首先,我们经过监督的关于20k平行数据(说明)的处理方法,大多可以超越联合定位和InfoXLM的匹配方法,在100多语言性指导性测试的平行性数据中,在不牺牲我们具有可持续性的平行性测试性测试性数据时,第二个平行程序可以超越我们的平行数据。

0

相关内容

基于上下文的表示

基于上下文的表示

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

lincRNA-p21调控DNA甲基化介导颞叶内侧癫痫耐药的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

量子点耦合系统的输运及耗散动力学

国家自然科学基金

0+阅读 · 2013年12月31日

低维受限胶体量子点/氧化锌管状复合光学微腔耦合效应及光谱调制效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

缺锌胁迫下生长素和活性氧对玉米侧根生长发育的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

大肠杆菌K1外膜蛋白A特异结构在其导致新生儿细菌性脑膜炎中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

Witten Laplacian的特征值及与其相关的Ricci Soliton研究

国家自然科学基金

0+阅读 · 2012年12月31日

EB病毒介导细胞DNA应激沉默和端粒酶TCAB1转录激活的效应及信号调控

国家自然科学基金

0+阅读 · 2011年12月31日

Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds

Arxiv

0+阅读 · 2022年10月26日

Adapters for Enhanced Modeling of Multilingual Knowledge and Text

Arxiv

0+阅读 · 2022年10月26日

Semi-Supervised Learning Based on Reference Model for Low-resource TTS

Arxiv

0+阅读 · 2022年10月25日

Label-aware Multi-level Contrastive Learning for Cross-lingual Spoken Language Understanding

Arxiv

0+阅读 · 2022年10月25日

Geometric multimodal representation learning

Arxiv

69+阅读 · 2022年9月7日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Cross-lingual Knowledge Graph Alignment via Graph Matching Neural Network

Arxiv

15+阅读 · 2019年5月28日

Deep contextualized word representations

Arxiv

10+阅读 · 2018年3月22日

Matching Networks for One Shot Learning

Arxiv

10+阅读 · 2017年12月29日

VIP会员

文章信息

相关主题

基于上下文的表示

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds

Arxiv

0+阅读 · 2022年10月26日

Adapters for Enhanced Modeling of Multilingual Knowledge and Text

Arxiv

0+阅读 · 2022年10月26日

Semi-Supervised Learning Based on Reference Model for Low-resource TTS

Arxiv

0+阅读 · 2022年10月25日

Label-aware Multi-level Contrastive Learning for Cross-lingual Spoken Language Understanding

Arxiv

0+阅读 · 2022年10月25日

Geometric multimodal representation learning

Arxiv

69+阅读 · 2022年9月7日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Cross-lingual Knowledge Graph Alignment via Graph Matching Neural Network

Arxiv

15+阅读 · 2019年5月28日

Deep contextualized word representations

Arxiv

10+阅读 · 2018年3月22日

Matching Networks for One Shot Learning

Arxiv

10+阅读 · 2017年12月29日

相关基金

lincRNA-p21调控DNA甲基化介导颞叶内侧癫痫耐药的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

量子点耦合系统的输运及耗散动力学

国家自然科学基金

0+阅读 · 2013年12月31日

低维受限胶体量子点/氧化锌管状复合光学微腔耦合效应及光谱调制效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

缺锌胁迫下生长素和活性氧对玉米侧根生长发育的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

大肠杆菌K1外膜蛋白A特异结构在其导致新生儿细菌性脑膜炎中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

Witten Laplacian的特征值及与其相关的Ricci Soliton研究

国家自然科学基金

0+阅读 · 2012年12月31日

EB病毒介导细胞DNA应激沉默和端粒酶TCAB1转录激活的效应及信号调控

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员