FLORES-101 低资源和多语种机器翻译评价基准 (The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation)

One of the biggest challenges hindering progress in low-resource and multilingual machine translation is the lack of good evaluation benchmarks. Current evaluation benchmarks either lack good coverage of low-resource languages, consider only restricted domains, or are low quality because they are constructed using semi-automatic procedures. In this work, we introduce the FLORES-101 evaluation benchmark, consisting of 3001 sentences extracted from English Wikipedia and covering a variety of different topics and domains. These sentences have been translated in 101 languages by professional translators through a carefully controlled process. The resulting dataset enables better assessment of model quality on the long tail of low-resource languages, including the evaluation of many-to-many multilingual translation systems, as all translations are multilingually aligned. By publicly releasing such a high-quality and high-coverage dataset, we hope to foster progress in the machine translation community and beyond.

翻译：妨碍在低资源和多语种机器翻译方面取得进展的最大挑战之一是缺乏良好的评价基准。目前的评价基准要么缺乏对低资源语言的良好覆盖,只考虑有限的领域,要么由于使用半自动程序建造,质量低。在这项工作中,我们引入了FLORES-101评价基准,其中包括从英文维基百科提取的3001个句子,涵盖不同的专题和领域。这些句子由专业笔译员通过仔细控制的程序以101种语言翻译。由此形成的数据集有助于更好地评估低资源语言长尾的模型质量,包括评价多种多语种翻译系统,因为所有翻译都是多语种的。通过公开发布高质量和高覆盖的数据集,我们希望在机器翻译界内外取得进展。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

19+阅读 · 2020年4月25日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日