同步机器翻译无预期培训 (Anticipation-Free Training for Simultaneous Machine Translation)

Simultaneous machine translation (SimulMT) speeds up the translation process by starting to translate before the source sentence is completely available. It is difficult due to limited context and word order difference between languages. Existing methods increase latency or introduce adaptive read-write policies for SimulMT models to handle local reordering and improve translation quality. However, the long-distance reordering would make the SimulMT models learn translation mistakenly. Specifically, the model may be forced to predict target tokens when the corresponding source tokens have not been read. This leads to aggressive anticipation during inference, resulting in the hallucination phenomenon. To mitigate this problem, we propose a new framework that decompose the translation process into the monotonic translation step and the reordering step, and we model the latter by the auxiliary sorting network (ASN). The ASN rearranges the hidden states to match the order in the target language, so that the SimulMT model could learn to translate more reasonably. The entire model is optimized end-to-end and does not rely on external aligners or data. During inference, ASN is removed to achieve streaming. Experiments show the proposed framework could outperform previous methods with less latency.

翻译：同时的机器翻译(SimulMT) 开始在源句完全可用之前翻译, 加速翻译过程, 开始翻译过程, 但由于语系和文字顺序差异有限, 很难。现有的方法会提高 Latenity 或引入对 SimulMT 模型的适应性读写政策, 以便处理本地重新排序并改进翻译质量。但是, 长距离重新排序会让 SimulMT 模型错误地学习翻译。具体地说, 模型可能被迫在相应的源符号没有读取时预测目标符号。这导致在推断过程中产生强烈的预测, 从而导致幻觉现象。为了缓解这个问题, 我们提议一个新的框架, 将翻译进程分解为单声翻译步骤和重新排序步骤, 并且我们用辅助排序网络( SSN) 来模拟后者。 ASN 重新排列隐藏状态以匹配目标语言的顺序, 以便 SimulMT 模型可以学习更合理的翻译。整个模型将最终端端端端到端, 并且不依赖外部的匹配器或数据。在推断过程中, ASNS 将移除前的模型将无法实现流流式框架。。。

相关内容

Machine Translation

关注 210

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

专知会员服务

39+阅读 · 2020年11月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日