难懂的机器翻译评价 (Difficulty-Aware Machine Translation Evaluation)

The high-quality translation results produced by machine translation (MT) systems still pose a huge challenge for automatic evaluation. Current MT evaluation pays the same attention to each sentence component, while the questions of real-world examinations (e.g., university examinations) have different difficulties and weightings. In this paper, we propose a novel difficulty-aware MT evaluation metric, expanding the evaluation dimension by taking translation difficulty into consideration. A translation that fails to be predicted by most MT systems will be treated as a difficult one and assigned a large weight in the final score function, and conversely. Experimental results on the WMT19 English-German Metrics shared tasks show that our proposed method outperforms commonly used MT metrics in terms of human correlation. In particular, our proposed method performs well even when all the MT systems are very competitive, which is when most existing metrics fail to distinguish between them. The source code is freely available at https://github.com/NLP2CT/Difficulty-Aware-MT-Evaluation.

翻译：机器翻译(MT)系统产生的高质量翻译结果仍对自动评价构成巨大挑战。目前的MT评价对每个句子部分给予同样的重视,而现实世界考试(例如大学考试)的问题则有不同的困难和加权。在本文件中,我们提出一个新的难懂的MT评价指标,通过考虑翻译困难来扩大评价层面。大多数MT系统无法预测的翻译将被视为困难的翻译,在最后分数函数中给予很大分数,反之亦然。WMT19英文-德文的实验结果显示,我们所提议的方法在人际关系方面优于常用的MT衡量标准。特别是,即使所有MT系统都非常具有竞争力,也就是大多数现有指标无法区分它们时,我们提议的方法也表现良好。源代码可免费查阅https://github.com/NLP2CT/Difficolty-Aware-MT-Evale。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

机器学习简明导论，62页pdf

专知会员服务

83+阅读 · 2021年7月31日

面向预测数据分析的机器学习，72页pdf

专知会员服务

66+阅读 · 2021年7月18日

【Facebook AI】无监督机器翻译，336页ppt，Unsupervised Machine Translation

专知会员服务

19+阅读 · 2020年11月17日