Simultaneous machine translation, which aims at a real-time translation, is useful in many live scenarios but very challenging due to the trade-off between accuracy and latency. To achieve the balance for both, the model needs to wait for appropriate streaming text (READ policy) and then generates its translation (WRITE policy). However, WRITE policies of previous work either are specific to the method itself due to the end-to-end training or suffer from the input mismatch between training and decoding for the non-end-to-end training. Therefore, it is essential to learn a generic and better WRITE policy for simultaneous machine translation. Inspired by strategies utilized by human interpreters and "wait" policies, we propose a novel adaptive prefix-to-prefix training policy called LEAPT, which allows our machine translation model to learn how to translate source sentence prefixes and make use of the future context. Experiments show that our proposed methods greatly outperform competitive baselines and achieve promising results.
翻译:同时机器翻译旨在实现实时翻译,因精度与延迟之间的平衡而非常具有挑战性。为了实现两者的平衡,模型需要等待适当的流式文本(READ策略),然后生成其翻译(WRITE策略)。然而,以前的WRITE策略由于端到端训练本身的特殊性或由于非端到端训练时训练与解码的输入不匹配而具有特定性。因此,学习适用于同时机器翻译的通用和更好的WRITE策略至关重要。受人类口译员策略和“等待”策略的启发,我们提出了一种新颖的适应性前缀到前缀训练策略,称为LEAPT,允许我们的机器翻译模型学习如何翻译源句前缀并利用未来的语境。实验表明,我们提出的方法大大优于竞争基线,并取得了良好的结果。