多语种语言翻译轻量度适应器 (Lightweight Adapter Tuning for Multilingual Speech Translation) - 专知论文

会员服务 ·

0

tuning · 语音翻译 · MoDELS · BART · 语音识别 ·

2021 年 7 月 12 日

Lightweight Adapter Tuning for Multilingual Speech Translation

翻译：多语种语言翻译轻量度适应器

Hang Le,Juan Pino,Changhan Wang,Jiatao Gu,Didier Schwab,Laurent Besacier

from arxiv, Accepted at ACL-IJCNLP 2021

Adapter modules were recently introduced as an efficient alternative to fine-tuning in NLP. Adapter tuning consists in freezing pretrained parameters of a model and injecting lightweight modules between layers, resulting in the addition of only a small number of task-specific trainable parameters. While adapter tuning was investigated for multilingual neural machine translation, this paper proposes a comprehensive analysis of adapters for multilingual speech translation (ST). Starting from different pre-trained models (a multilingual ST trained on parallel data or a multilingual BART (mBART) trained on non-parallel multilingual data), we show that adapters can be used to: (a) efficiently specialize ST to specific language pairs with a low extra cost in terms of parameters, and (b) transfer from an automatic speech recognition (ASR) task and an mBART pre-trained model to a multilingual ST task. Experiments show that adapter tuning offer competitive results to full fine-tuning, while being much more parameter-efficient.

翻译：适应器的调适包括冻结一个模型的预先训练参数和在两层之间注射轻量级模块,结果只增加了少量的任务特定培训参数。虽然对多语种神经机翻译的调适器调适进行了调查,但本文件提议对多语种语音翻译的调适器进行综合分析。从不同的培训前模式(受过平行数据培训的多语言ST,或受过非平行多语种数据培训的多语种BART)开始,我们表明可使用适应器:(a) 高效率地将ST专门用于特定语言配对,在参数方面成本较低;(b) 从自动语音识别任务和MBART预培训模式转移到多语言ST任务。实验显示,调适器的调适能为全面微调带来竞争性结果,同时提高参数效率。

0

相关内容

tuning

【CVPR2021】用Transformers无监督预训练进行目标检测

【CVPR2021】用Transformers无监督预训练进行目标检测

专知会员服务

58+阅读 · 2021年3月3日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

108+阅读 · 2020年8月4日

【微软亚洲研究院】无监督词嵌入对齐的几何感知域自适应，Geometry-aware Domain Adaptation for Unsupervised Alignment of Word Embeddings

【微软亚洲研究院】无监督词嵌入对齐的几何感知域自适应，Geometry-aware Domain Adaptation for Unsupervised Alignment of Word Embeddings

专知会员服务

23+阅读 · 2020年4月21日

所有跨语言嵌入式都应该讲英语吗? | Should All Cross-Lingual Embeddings Speak English?

所有跨语言嵌入式都应该讲英语吗? | Should All Cross-Lingual Embeddings Speak English?

专知会员服务

7+阅读 · 2020年4月16日

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

专知会员服务

17+阅读 · 2020年3月23日

【Google】无监督机器翻译，Unsupervised Machine Translation

【Google】无监督机器翻译，Unsupervised Machine Translation

专知会员服务

36+阅读 · 2020年3月3日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【AAAI2020论文-清华大学】Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources，最小资源增强的元学习跨语言命名实体识别

【AAAI2020论文-清华大学】Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources，最小资源增强的元学习跨语言命名实体识别

专知会员服务

31+阅读 · 2019年11月17日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

已删除

将门创投

6+阅读 · 2019年4月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

利用 Universal Transformer，翻译将无往不利！

利用 Universal Transformer，翻译将无往不利！

谷歌开发者

5+阅读 · 2018年9月4日

Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation

Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation

Arxiv

2+阅读 · 2021年9月14日

Efficient Inference for Multilingual Neural Machine Translation

Arxiv

0+阅读 · 2021年9月14日

MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model

MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model

Arxiv

0+阅读 · 2021年9月14日

Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation

Arxiv

0+阅读 · 2021年9月14日

Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation

Arxiv

0+阅读 · 2021年9月14日

MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs

Arxiv

0+阅读 · 2021年9月13日

Counter-Interference Adapter for Multilingual Machine Translation

Arxiv

0+阅读 · 2021年9月12日

Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph

Arxiv

0+阅读 · 2021年9月10日

Universal Language Model Fine-tuning for Text Classification

Arxiv

3+阅读 · 2018年5月23日

Multilingual Training and Cross-lingual Adaptation on CTC-based Acoustic Model

Arxiv

7+阅读 · 2018年1月23日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2021】用Transformers无监督预训练进行目标检测

【CVPR2021】用Transformers无监督预训练进行目标检测

专知会员服务

58+阅读 · 2021年3月3日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

108+阅读 · 2020年8月4日

【微软亚洲研究院】无监督词嵌入对齐的几何感知域自适应，Geometry-aware Domain Adaptation for Unsupervised Alignment of Word Embeddings

【微软亚洲研究院】无监督词嵌入对齐的几何感知域自适应，Geometry-aware Domain Adaptation for Unsupervised Alignment of Word Embeddings

专知会员服务

23+阅读 · 2020年4月21日

所有跨语言嵌入式都应该讲英语吗? | Should All Cross-Lingual Embeddings Speak English?

所有跨语言嵌入式都应该讲英语吗? | Should All Cross-Lingual Embeddings Speak English?

专知会员服务

7+阅读 · 2020年4月16日

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

专知会员服务

17+阅读 · 2020年3月23日

【Google】无监督机器翻译，Unsupervised Machine Translation

【Google】无监督机器翻译，Unsupervised Machine Translation

专知会员服务

36+阅读 · 2020年3月3日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【AAAI2020论文-清华大学】Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources，最小资源增强的元学习跨语言命名实体识别

【AAAI2020论文-清华大学】Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources，最小资源增强的元学习跨语言命名实体识别

专知会员服务

31+阅读 · 2019年11月17日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

已删除

将门创投

6+阅读 · 2019年4月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

利用 Universal Transformer，翻译将无往不利！

利用 Universal Transformer，翻译将无往不利！

谷歌开发者

5+阅读 · 2018年9月4日

相关论文

Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation

Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation

Arxiv

2+阅读 · 2021年9月14日

Efficient Inference for Multilingual Neural Machine Translation

Arxiv

0+阅读 · 2021年9月14日

MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model

MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model

Arxiv

0+阅读 · 2021年9月14日

Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation

Arxiv

0+阅读 · 2021年9月14日

Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation

Arxiv

0+阅读 · 2021年9月14日

MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs

Arxiv

0+阅读 · 2021年9月13日

Counter-Interference Adapter for Multilingual Machine Translation

Arxiv

0+阅读 · 2021年9月12日

Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph

Arxiv

0+阅读 · 2021年9月10日

Universal Language Model Fine-tuning for Text Classification

Arxiv

3+阅读 · 2018年5月23日

Multilingual Training and Cross-lingual Adaptation on CTC-based Acoustic Model

Arxiv

7+阅读 · 2018年1月23日

微信扫码咨询专知VIP会员