分析对神经机器翻译预测的来源和目标贡献 (Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation)

In Neural Machine Translation (and, more generally, conditional language modeling), the generation of a target token is influenced by two types of context: the source and the prefix of the target sequence. While many attempts to understand the internal workings of NMT models have been made, none of them explicitly evaluates relative source and target contributions to a generation decision. We argue that this relative contribution can be evaluated by adopting a variant of Layerwise Relevance Propagation (LRP). Its underlying 'conservation principle' makes relevance propagation unique: differently from other methods, it evaluates not an abstract quantity reflecting token importance, but the proportion of each token's influence. We extend LRP to the Transformer and conduct an analysis of NMT models which explicitly evaluates the source and target relative contributions to the generation process. We analyze changes in these contributions when conditioning on different types of prefixes, when varying the training objective or the amount of training data, and during the training process. We find that models trained with more data tend to rely on source information more and to have more sharp token contributions; the training process is non-monotonic with several stages of different nature.

翻译：在神经机器翻译(以及更一般的有条件语言模型)中,目标符号的生成受到两类背景的影响:目标序列的来源和前缀。虽然许多尝试都试图了解NMT模型的内部运行情况,但没有一项尝试明确评价相对来源和针对对一代人决定的贡献。我们争辩说,可以通过采用一种不同的图层相关性促进(LRP)来评估这一相对贡献。其基础“保护原则”使相关传播变得独特:不同于其他方法,它评价的不是抽象数量,反映象征性重要性,而是每个象征影响的比例。我们将LRP扩大到变异器,并对NMT模型进行分析,明确评价源和目标相对于生成过程的相对贡献。我们分析这些贡献的变化,以不同种类的预构件为条件,在不同的培训目标或培训数据数量不同时,以及在培训过程中。我们发现,经过更多数据培训的模型往往依赖来源信息,而具有更精确的象征性贡献;培训过程是非流动的,具有不同性质的若干阶段。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

19+阅读 · 2020年4月25日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【ACL2019】基于学习注意力机制的知识图谱中关系预测的嵌入 Learning Attention-based Embeddings for Relation Prediction in Knowledge Graphs

专知会员服务

122+阅读 · 2020年3月29日

【Google】无监督机器翻译，Unsupervised Machine Translation

专知会员服务

36+阅读 · 2020年3月3日