Standard machine translation systems process sentences in isolation and hence ignore extra-sentential information, even though extended context can both prevent mistakes in ambiguous cases and improve translation coherence. We introduce a context-aware neural machine translation model designed in such way that the flow of information from the extended context to the translation model can be controlled and analyzed. We experiment with an English-Russian subtitles dataset, and observe that much of what is captured by our model deals with improving pronoun translation. We measure correspondences between induced attention distributions and coreference relations and observe that the model implicitly captures anaphora. It is consistent with gains for sentences where pronouns need to be gendered in translation. Beside improvements in anaphoric cases, the model also improves in overall BLEU, both over its context-agnostic version (+0.7) and over simple concatenation of the context and source sentences (+0.6).
翻译:标准机器翻译系统单独处理判决,从而忽视非文件资料,尽管扩展的背景既可以防止在模棱两可的案件中出现错误,也可以提高翻译的一致性。我们引入了一种符合背景的神经机翻译模式,其设计方式是能够控制和分析从扩展背景到翻译模式的信息流动。我们试验了一个英语-俄语字幕数据集,并观察到我们模型所捕捉的大部分内容涉及改进原声翻译。我们测量了引人注意分布和共同参照关系之间的对应关系,并观察到该模式隐含地捕捉了Anaphora。这与在翻译中需要用性别来表示的句子的得益是一致的。在无光谱案例中,该模型还改善了整个BLEU的全局性,不仅改善了其上下文-不可知性版本(+0.7),而且加强了对上下文和源句的简单组合(+0.6)。