Given the rise of a new approach to MT, Neural MT (NMT), and its promising performance on different text types, we assess the translation quality it can attain on what is perceived to be the greatest challenge for MT: literary text. Specifically, we target novels, arguably the most popular type of literary text. We build a literary-adapted NMT system for the English-to-Catalan translation direction and evaluate it against a system pertaining to the previous dominant paradigm in MT: statistical phrase-based MT (PBSMT). To this end, for the first time we train MT systems, both NMT and PBSMT, on large amounts of literary text (over 100 million words) and evaluate them on a set of twelve widely known novels spanning from the the 1920s to the present day. According to the BLEU automatic evaluation metric, NMT is significantly better than PBSMT (p < 0.01) on all the novels considered. Overall, NMT results in a 11% relative improvement (3 points absolute) over PBSMT. A complementary human evaluation on three of the books shows that between 17% and 34% of the translations, depending on the book, produced by NMT (versus 8% and 20% with PBSMT) are perceived by native speakers of the target language to be of equivalent quality to translations produced by a professional human translator.
翻译:鉴于对MT、Neal MT(NMT)的新做法的兴起及其在不同文本类型上的有希望的表现,我们评估了它能够达到的翻译质量,认为这对MT(文学文本)来说是最大的挑战:文学文本。具体地说,我们的目标是小说,可以说是最受欢迎的文学文本类型。我们为英语到卡塔兰翻译方向建立了一个经过文学改造的NMT系统,并对照与MT(基于统计的语句的MT(PBSMT)的先前主导模式(PBSMT)有关的制度对其进行评估。为此,我们首次就大量文学文本(超过1亿字)对NMT和PBMMMM(P)系统(PBSMMM)系统进行了培训,对三本专业书籍(NMT)进行了补充性评估,从1920年代到今天这12种广为人所知的小说,根据BLE的自动评价标准,NMT(p 0.01)比所有所考虑的PBSMMM(PBSMMMT)系统(PBM)要好得多。总体来说,比11%(绝对3点)比PBMMTMT(3分)比20页质量翻译翻译的17%和3 %。