注意链接:高效关注的低资源机器翻译结构 (Attention Link: An Efficient Attention-Based Low Resource Machine Translation Architecture) - 专知论文

会员服务 ·

0

Attention · Machine Translation · 变换 · MoDELS · 数据集 ·

2023 年 2 月 1 日

Attention Link: An Efficient Attention-Based Low Resource Machine Translation Architecture

翻译：注意链接:高效关注的低资源机器翻译结构

Transformers have achieved great success in machine translation, but transformer-based NMT models often require millions of bilingual parallel corpus for training. In this paper, we propose a novel architecture named as attention link (AL) to help improve transformer models' performance, especially in low training resources. We theoretically demonstrate the superiority of our attention link architecture in low training resources. Besides, we have done a large number of experiments, including en-de, de-en, en-fr, en-it, it-en, en-ro translation tasks on the IWSLT14 dataset as well as real low resources scene on bn-gu and gu-ta translation tasks on the CVIT PIB dataset. All the experiment results show our attention link is powerful and can lead to a significant improvement. In addition, we achieve a 37.9 BLEU score, a new sota, on the IWSLT14 de-en task by combining our attention link and other advanced methods.

翻译：变压器在机器翻译方面取得了巨大成功,但基于变压器的NMT模型往往需要数百万双语平行培训。在本文中,我们建议建立一个名为关注链接(AL)的新结构,以帮助改善变压器模型的性能,特别是在低培训资源方面。我们理论上展示了在低培训资源方面我们关注链接结构的优势。此外,我们做了大量实验,包括IWSLT14数据集的en-fr、en-it、IWSLT14数据集的en-ro翻译任务,以及CVIT PIB数据集的bn-gu和gu-ta翻译任务上真正的低资源场景。所有实验结果都表明我们的关注联系是强大的,能够带来显著的改善。此外,我们通过将我们的关注链接和其他先进方法结合起来,在IWSLT14 D-en任务上实现了37.9 BLEU评分,这是一个新的索塔。

0

相关内容

Attention

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

基于多铁异质结的高频磁电微波器件研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于上皮间充质转化和细胞外基质沉积研究人参皂甙Rg1对COPD发生发展的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Co,Mn掺杂ZnO异质结的电阻开关效应及其对磁性的影响

国家自然科学基金

0+阅读 · 2012年12月31日

磁性阻挫ABO3型锰氧化物中的磁电耦合及多铁性质的研究

国家自然科学基金

0+阅读 · 2009年12月31日

环境友好型高取向织构化铁电压电陶瓷的制备及机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

Towards Better Dynamic Graph Learning: New Architecture and Unified Library

Arxiv

0+阅读 · 2023年3月23日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

Arxiv

16+阅读 · 2020年3月30日

Learning Hierarchical Features for Visual Object Tracking with Recursive Neural Networks

Arxiv

13+阅读 · 2018年1月6日

Multimodal Machine Learning: A Survey and Taxonomy

Arxiv

151+阅读 · 2017年8月1日

VIP会员

文章信息

相关主题

Machine Translation

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《北约联合仿真与集成、验证与鉴定服务标准》2025最新40页

《面向协同任务的无人地面车辆与无人机（UGV-UAV）集成研究综述》2025最新综述论文

《理解大语言模型在军事战术任务规划中的局限性》

《国防与安全会议论文集》最新80页

相关资讯

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Towards Better Dynamic Graph Learning: New Architecture and Unified Library

Arxiv

0+阅读 · 2023年3月23日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

Arxiv

16+阅读 · 2020年3月30日

Learning Hierarchical Features for Visual Object Tracking with Recursive Neural Networks

Arxiv

13+阅读 · 2018年1月6日

Multimodal Machine Learning: A Survey and Taxonomy

Arxiv

151+阅读 · 2017年8月1日

相关基金

基于多铁异质结的高频磁电微波器件研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于上皮间充质转化和细胞外基质沉积研究人参皂甙Rg1对COPD发生发展的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Co,Mn掺杂ZnO异质结的电阻开关效应及其对磁性的影响

国家自然科学基金

0+阅读 · 2012年12月31日

磁性阻挫ABO3型锰氧化物中的磁电耦合及多铁性质的研究

国家自然科学基金

0+阅读 · 2009年12月31日

环境友好型高取向织构化铁电压电陶瓷的制备及机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员