In open-ended natural-language generation, existing text decoding methods typically struggle to produce text which is both diverse and high-quality. Greedy and beam search are known to suffer from text degeneration and linguistic diversity issues, while temperature, top-k, and nucleus sampling often yield diverse but low-quality outputs. In this work, we present crowd sampling, a family of decoding methods based on Bayesian risk minimization, to address this diversity-quality trade-off. Inspired by the principle of "the wisdom of the crowd," crowd sampling seeks to select a candidate from a pool of candidates that has the least expected risk (i.e., highest expected reward) under a generative model according to a given utility function. Crowd sampling can be seen as a generalization of numerous existing methods, including majority voting, and in practice, it can be used as a drop-in replacement for existing sampling methods. Extensive experiments show that crowd sampling delivers improvements of 3-7 ROUGE and BLEU points across a wide range of tasks, including summarization, data-to-text, translation, and textual style transfer, while achieving new state-of-the-art results on WebNLG and WMT'16.
翻译:在开放的自然语言中,现有的文字解码方法通常很难产生多样化和高质量的文本。据知,贪婪和梁搜索会受到文本变异和语言多样性问题的影响,而温度、顶点和核取样往往产生多样化但低质量的产出。在这项工作中,我们介绍人群抽样,这是一套基于巴伊西亚风险最小化的解码方法,以解决这种多样性质量权衡问题。受“人群智慧”原则的启发,人群抽样试图从一个具有最不预期风险(即最高预期报酬)的候选人群体中挑选一名候选人,根据一种基因化模式,根据特定公用事业功能,从其中的基因化模型中挑选一名候选人。人群抽样可被视为包括多数投票在内的多种现有方法的概括化,在实践中,可用作现有取样方法的一滴替代。广泛的实验表明,人群抽样在一系列广泛任务中提供了3-7 ROUGE和BLEU点的改进,包括总结、数据到文字、翻译和文本风格风格转移,同时实现WMT-LG的新状态结果。