Score-based (denoising diffusion) generative models have recently gained a lot of success in generating realistic and diverse data. These approaches define a forward diffusion process for transforming data to noise and generate data by reversing it (thereby going from noise to data). Unfortunately, current score-based models generate data very slowly due to the sheer number of score network evaluations required by numerical SDE solvers. In this work, we aim to accelerate this process by devising a more efficient SDE solver. Existing approaches rely on the Euler-Maruyama (EM) solver, which uses a fixed step size. We found that naively replacing it with other SDE solvers fares poorly - they either result in low-quality samples or become slower than EM. To get around this issue, we carefully devise an SDE solver with adaptive step sizes tailored to score-based generative models piece by piece. Our solver requires only two score function evaluations, rarely rejects samples, and leads to high-quality samples. Our approach generates data 2 to 10 times faster than EM while achieving better or equal sample quality. For high-resolution images, our method leads to significantly higher quality samples than all other methods tested. Our SDE solver has the benefit of requiring no step size tuning.
翻译:基于分数(减少传播)的基因化模型最近在生成现实和多样化数据方面取得了许多成功。这些方法界定了将数据转化为噪音并通过逆转数据(从噪音到数据)生成数据的一个前方扩散过程。不幸的是,由于数字SDE解答器要求的分数网络评价数量之多,目前的分数模型生成的数据非常缓慢。在这项工作中,我们的目标是通过设计一个效率更高的SDE求解器来加快这一进程。现有方法依赖于使用固定步数大小的Euler-Maruyama(EM)求解器。我们发现,用其他SDE解答器取代数据时,天真地取代数据的过程差,要么导致低质量样本,要么比EM慢。为了绕过这个问题,我们仔细设计了一个适应性步数的SDE求解答器。我们的解答器只需要两个分数函数评价,很少拒绝样本,并导致高质量的样本。我们的方法比EM更快2到10倍的数据,同时取得更好的或同等的样数质量。对于高分辨率图像来说,我们的方法不会导致大大提高质量的样本。