摊销噪音频道神经机器翻译 (Amortized Noisy Channel Neural Machine Translation)

Noisy channel models have been especially effective in neural machine translation (NMT). However, recent approaches like "beam search and rerank" (BSR) incur significant computation overhead during inference, making real-world application infeasible. We aim to build an amortized noisy channel NMT model such that greedily decoding from it would generate translations that maximize the same reward as translations generated using BSR. We attempt three approaches: knowledge distillation, 1-step-deviation imitation learning, and Q learning. The first approach obtains the noisy channel signal from a pseudo-corpus, and the latter two approaches aim to optimize toward a noisy-channel MT reward directly. All three approaches speed up inference by 1-2 orders of magnitude. For all three approaches, the generated translations fail to achieve rewards comparable to BSR, but the translation quality approximated by BLEU is similar to the quality of BSR-produced translations.

翻译：在神经机翻译中,噪音频道模型特别有效。然而,最近的一些方法,如“波音搜索和重新排序”(BSR)在推理过程中引起了大量的计算间接费用,使得现实世界应用无法实现。我们的目标是建立一个摊销的噪音频道NMT模型,这样贪婪地解码出它就能产生与使用 BSR 生成的翻译同样的最大回报的翻译。我们尝试了三种方法:知识蒸馏、一步步递仿真学习和Q学习。第一种方法从一个伪体获取噪音频道信号,而后两种方法旨在直接优化对噪音频道MT的奖励。所有三种方法都加速了1-2级的推论。对于所有三种方法,产生的翻译都无法取得与BSR相似的回报,但BLEU所估计的翻译质量与BSR制作的翻译质量相似。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

专知会员服务

30+阅读 · 2021年1月25日

【Facebook AI】无监督机器翻译，336页ppt，Unsupervised Machine Translation

专知会员服务

19+阅读 · 2020年11月17日

近期必读的六篇 ICML 2020【元学习（Meta Learning）】相关论文

专知会员服务

45+阅读 · 2020年9月25日

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems