Beam search is the default decoding strategy for many sequence generation tasks in NLP. The set of approximate K-best items returned by the algorithm is a useful summary of the distribution for many applications; however, the candidates typically exhibit high overlap and may give a highly biased estimate for expectations under our model. These problems can be addressed by instead using stochastic decoding strategies. In this work, we propose a new method for turning beam search into a stochastic process: Conditional Poisson stochastic beam search. Rather than taking the maximizing set at each iteration, we sample K candidates without replacement according to the conditional Poisson sampling design. We view this as a more natural alternative to Kool et. al. 2019's stochastic beam search (SBS). Furthermore, we show how samples generated under the CPSBS design can be used to build consistent estimators and sample diverse sets from sequence models. In our experiments, we observe CPSBS produces lower variance and more efficient estimators than SBS, even showing improvements in high entropy settings.
翻译:光束搜索是NLP中许多序列生成任务的默认解码策略。 通过算法返回的一套近似 K- 最佳项目是许多应用程序分布的有用摘要; 然而, 候选人通常表现出高度重叠, 可能对我们的模型下的期望作出高度偏差的估计。 这些问题可以通过使用随机解码战略来解决。 在这项工作中, 我们提出了将光束搜索转换成一个随机过程的新方法 : 有条件的 Poisson 随机波束搜索。 我们没有在每次循环中采用最大化设置, 我们根据有条件的 Poisson 取样设计对 K 候选人进行抽样, 而没有进行替换。 我们将此视为Kool 等人 2019 的随机光束搜索( SBS) 的更自然的替代方法。 此外, 我们展示了在 CPSBS 设计下生成的样本如何用于从序列模型中构建一致的测算器和样本多样化数据集。 我们的实验中, 我们观察 CPSBSBS 产生比 SBS 更低的差异和更高效的测算器, 甚至显示高温室环境的改进。