Minimum Bayes risk (MBR) decoding generates high-quality translations by maximizing the expected utility of output candidates, but it evaluates all pairwise scores over the candidate set; hence, it takes quadratic time with respect to the number of candidates. To reduce the number of utility function calls, probabilistic MBR (PMBR) decoding partially evaluates quality scores using sampled pairs of candidates and completes the missing scores with a matrix completion algorithm. Nevertheless, it degrades the translation quality as the number of utility function calls is reduced. Therefore, to improve the trade-off between quality and cost, we propose agreement-constrained PMBR (AC-PMBR) decoding, which leverages a knowledge distilled model to guide the completion of the score matrix. Our AC-PMBR decoding improved approximation errors of matrix completion by up to 3 times and achieved higher translation quality compared with PMBR decoding at a comparable computational cost on the WMT'23 En$\leftrightarrow$De translation tasks.
翻译:最小贝叶斯风险(MBR)解码通过最大化输出候选的期望效用生成高质量翻译,但其需评估候选集内所有成对得分,因此计算复杂度随候选数量呈二次方增长。为减少效用函数调用次数,概率最小贝叶斯风险(PMBR)解码通过采样候选对部分评估质量得分,并利用矩阵补全算法填补缺失得分。然而,随着效用函数调用次数的减少,该方法会降低翻译质量。为改善质量与计算成本间的权衡,我们提出基于一致性约束的概率最小贝叶斯风险(AC-PMBR)解码,该方法利用知识蒸馏模型指导得分矩阵的补全过程。在WMT'23英德双向翻译任务中,我们的AC-PMBR解码将矩阵补全的近似误差降低了最高达3倍,并在相近计算成本下实现了比PMBR解码更高的翻译质量。