LEAPT: 适应性前缀到前缀翻译学习用于同时机器翻译 (LEAPT: Learning Adaptive Prefix-to-prefix Translation For Simultaneous Machine Translation)

Simultaneous machine translation, which aims at a real-time translation, is useful in many live scenarios but very challenging due to the trade-off between accuracy and latency. To achieve the balance for both, the model needs to wait for appropriate streaming text (READ policy) and then generates its translation (WRITE policy). However, WRITE policies of previous work either are specific to the method itself due to the end-to-end training or suffer from the input mismatch between training and decoding for the non-end-to-end training. Therefore, it is essential to learn a generic and better WRITE policy for simultaneous machine translation. Inspired by strategies utilized by human interpreters and "wait" policies, we propose a novel adaptive prefix-to-prefix training policy called LEAPT, which allows our machine translation model to learn how to translate source sentence prefixes and make use of the future context. Experiments show that our proposed methods greatly outperform competitive baselines and achieve promising results.

翻译：同时机器翻译旨在实现实时翻译，因精度与延迟之间的平衡而非常具有挑战性。为了实现两者的平衡，模型需要等待适当的流式文本(READ策略)，然后生成其翻译(WRITE策略)。然而，以前的WRITE策略由于端到端训练本身的特殊性或由于非端到端训练时训练与解码的输入不匹配而具有特定性。因此，学习适用于同时机器翻译的通用和更好的WRITE策略至关重要。受人类口译员策略和“等待”策略的启发，我们提出了一种新颖的适应性前缀到前缀训练策略，称为LEAPT，允许我们的机器翻译模型学习如何翻译源句前缀并利用未来的语境。实验表明，我们提出的方法大大优于竞争基线，并取得了良好的结果。

相关内容

Machine Translation

关注 210

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

【IJCAI2020】通过双向对抗训练生成中间域样本提升半监督域自适应效果

专知会员服务

35+阅读 · 2020年9月17日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日

【ACL2020】不要停止预训练:根据领域和任务自适应调整语言模型，Don't Stop Pretraining: Adapt Language Models to Domains and Tasks

专知会员服务

46+阅读 · 2020年4月25日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日