Diverse machine translation aims at generating various target language translations for a given source language sentence. Leveraging the linear relationship in the sentence latent space introduced by the mixup training, we propose a novel method, MixDiversity, to generate different translations for the input sentence by linearly interpolating it with different sentence pairs sampled from the training corpus when decoding. To further improve the faithfulness and diversity of the translations, we propose two simple but effective approaches to select diverse sentence pairs in the training corpus and adjust the interpolation weight for each pair correspondingly. Moreover, by controlling the interpolation weight, our method can achieve the trade-off between faithfulness and diversity without any additional training, which is required in most of the previous methods. Experiments on WMT'16 en-ro, WMT'14 en-de, and WMT'17 zh-en are conducted to show that our method substantially outperforms all previous diverse machine translation methods.
翻译:多种机器翻译旨在为特定源语言句生成各种目标语言翻译。 利用混合培训带来的句子潜在空间中的线性关系,我们提出一种新颖的方法,即混合多样性,通过线性地对输入句进行不同的翻译,在解码时对输入句进行不同句子的抽调;为了进一步提高翻译的忠诚性和多样性,我们建议了两种简单而有效的方法,在培训材料中选择不同的句子,并相应调整每对词的内插权重。 此外,通过控制内插权重,我们的方法可以在不进行任何额外培训的情况下实现忠诚与多样性之间的权衡,而以往方法大多要求这种权衡。 对WMT'16 en-ro、WMT'14 en-de和WMT'17 ZH-en进行了实验,以显示我们的方法大大超越了以往所有不同的机器翻译方法。