分析神经机器翻译中的不确定性 (Analyzing Uncertainty in Neural Machine Translation)

Machine translation is a popular test bed for research in neural sequence-to-sequence models but despite much recent research, there is still a lack of understanding of these models. Practitioners report performance degradation with large beams, the under-estimation of rare words and a lack of diversity in the final translations. Our study relates some of these issues to the inherent uncertainty of the task, due to the existence of multiple valid translations for a single source sentence, and to the extrinsic uncertainty caused by noisy training data. We propose tools and metrics to assess how uncertainty in the data is captured by the model distribution and how it affects search strategies that generate translations. Our results show that search works remarkably well but that the models tend to spread too much probability mass over the hypothesis space. Next, we propose tools to assess model calibration and show how to easily fix some shortcomings of current models. We release both code and multiple human reference translations for two popular benchmarks.

翻译：机器翻译是神经序列到序列模型研究的流行试验床,但尽管最近进行了许多研究,仍然对这些模型缺乏了解。从业者以大梁表示性能退化,对稀有词数估计不足,最后译文缺乏多样性。我们的研究将其中一些问题与任务固有的不确定性联系在一起,因为单一来源句存在多种有效的翻译,以及由于紧张的培训数据造成的外部不确定性。我们提出了各种工具和衡量标准,以评估模型的分布如何捕获数据的不确定性,以及如何影响产生翻译的搜索战略。我们的结果显示,搜索工作效果非常好,但模型往往在假设空间上传播太多概率。接下来,我们提出评估模型校准的工具,并展示如何轻而易举地纠正当前模型的某些缺陷。我们为两个流行基准发布了代码和多个人类参考翻译。

相关内容

Machine Translation

关注 210

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

【伯克利】机器学习蛋白质工程，Machine learning for protein engineering，83页ppt

专知会员服务

36+阅读 · 2020年5月9日

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

19+阅读 · 2020年4月25日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日