Crosslingual conditional generation (e.g., machine translation) has long enjoyed the benefits of scaling. Nonetheless, there are still issues that scale alone may not overcome. A source query in one language, for instance, may yield several translation options in another language without any extra context. Only one translation could be acceptable however, depending on the translator's preferences and goals. Choosing the incorrect option might significantly affect translation usefulness and quality. We propose a novel method interactive-chain prompting -- a series of question, answering and generation intermediate steps between a Translator model and a User model -- that reduces translations into a list of subproblems addressing ambiguities and then resolving such subproblems before producing the final text to be translated. To check ambiguity resolution capabilities and evaluate translation quality, we create a dataset exhibiting different linguistic phenomena which leads to ambiguities at inference for four languages. To encourage further exploration in this direction, we release all datasets. We note that interactive-chain prompting, using eight interactions as exemplars, consistently surpasses prompt-based methods with direct access to background information to resolve ambiguities.
翻译:长期以来,跨语言的有条件的跨语言生成(例如,机器翻译)一直享受到推广的好处。然而,仍然有一些问题,单靠规模也许无法克服。例如,一种语言的源查询可以产生几种其他语言的翻译选项,而无需任何额外的背景。然而,根据翻译的偏好和目标,只有一种翻译是可以接受的。选择错误的选项可能会大大影响翻译的效用和质量。我们建议采用新颖的方法交互式链,即一系列问题,在翻译模型和用户模型之间进行回答和生成中间步骤 -- -- 将翻译减少为解决含混问题的一个子问题清单,然后在产生最后文本要翻译之前解决这类子问题。为了检查模糊的解析能力并评估翻译质量,我们创建了一个显示不同语言现象的数据集,从而导致对四种语言的推论含糊不清。为了鼓励进一步探讨这个方向,我们发布所有数据集。我们注意到,互动式链利用作为Explaers的八种互动,不断超过直接获取背景信息以解决模糊性的方法。