Score-based generative models provide state-of-the-art quality for image and audio synthesis. Sampling from these models is performed iteratively, typically employing a discretized series of noise levels and a predefined scheme. In this note, we first overview three common sampling schemes for models trained with denoising score matching. Next, we focus on one of them, consistent annealed sampling, and study its hyper-parameter boundaries. We then highlight a possible formulation of such hyper-parameter that explicitly considers those boundaries and facilitates tuning when using few or a variable number of steps. Finally, we highlight some connections of the formulation with other sampling schemes.
翻译:基于计分的基因化模型为图像和音频合成提供了最先进的质量。从这些模型中进行取样是迭接式的,通常采用一系列离散的噪音水平和预设办法。在本说明中,我们首先概述三种通用的抽样方案,即经过分级比对培训的模型。然后,我们着重其中一种,一致的无线取样,并研究其超参数界限。然后,我们强调一种可能的超参数配方,明确考虑这些边界,并在使用几个或几个不同步骤时便利调适。最后,我们强调该配方与其他取样方案之间的某些联系。