Though denoising diffusion probabilistic models (DDPMs) have achieved remarkable generation results, the low sampling efficiency of DDPMs still limits further applications. Since DDPMs can be formulated as diffusion ordinary differential equations (ODEs), various fast sampling methods can be derived from solving diffusion ODEs. However, we notice that previous sampling methods with fixed analytical form are not robust with the error in the noise estimated from pretrained diffusion models. In this work, we construct an error-robust Adams solver (ERA-Solver), which utilizes the implicit Adams numerical method that consists of a predictor and a corrector. Different from the traditional predictor based on explicit Adams methods, we leverage a Lagrange interpolation function as the predictor, which is further enhanced with an error-robust strategy to adaptively select the Lagrange bases with lower error in the estimated noise. Experiments on Cifar10, LSUN-Church, and LSUN-Bedroom datasets demonstrate that our proposed ERA-Solver achieves 5.14, 9.42, and 9.69 Fenchel Inception Distance (FID) for image generation, with only 10 network evaluations.
翻译:尽管去除扩散概率模型(DDPMs)取得了显著的生成结果,但DDPMs取样效率低仍然进一步限制了应用。由于DDPMs可以作为扩散普通差分方程式(ODEs)来设计,因此可以通过解决扩散数方程式(ODEs)来得出各种快速取样方法。然而,我们注意到,以前采用固定分析形式的取样方法并不健全,因为事先经过训练的传播模型估计的噪音有误差。在这项工作中,我们建造了一个错误-robust Adams解答器(ERA-Solver),该解答器使用了由预测器和校正器组成的隐含的Adams数字方法。与基于明确的Adams方法的传统预测器不同,我们利用一个拉格朗的内插器作为预测器,用错误-robustobet战略进一步加强了这一功能,以适应性地选择在估计噪音中差错的Lagrange基础。在Cifar10、LSUN-church、LSUN-Broom数据集上进行的实验表明,我们提议的ERARS-Soldervision 10 图像网络仅与10的LFIFIS15)仅进行了5.14、9.42和9.69。