InSPECT：扩散模型的不变谱特征保持 (InSPECT: Invariant Spectral Features Preservation of Diffusion Models)

Modern diffusion models (DMs) have achieved state-of-the-art image generation. However, the fundamental design choice of diffusing data all the way to white noise and then reconstructing it leads to an extremely difficult and computationally intractable prediction task. To overcome this limitation, we propose InSPECT (Invariant Spectral Feature-Preserving Diffusion Model), a novel diffusion model that keeps invariant spectral features during both the forward and backward processes. At the end of the forward process, the Fourier coefficients smoothly converge to a specified random noise, enabling features preservation while maintaining diversity and randomness. By preserving invariant features, InSPECT demonstrates enhanced visual diversity, faster convergence rate, and a smoother diffusion process. Experiments on CIFAR-10, Celeb-A, and LSUN demonstrate that InSPECT achieves on average a 39.23% reduction in FID and 45.80% improvement in IS against DDPM for 10K iterations under specified parameter settings, which demonstrates the significant advantages of preserving invariant features: achieving superior generation quality and diversity, while enhancing computational efficiency and enabling faster convergence rate. To the best of our knowledge, this is the first attempt to analyze and preserve invariant spectral features in diffusion models.

翻译：现代扩散模型（DMs）已实现最先进的图像生成。然而，其将数据完全扩散至白噪声再进行重建的基本设计选择，导致了一个极其困难且计算上难以处理的预测任务。为克服这一限制，我们提出了InSPECT（不变谱特征保持扩散模型），这是一种在正向和反向过程中均保持不变谱特征的新型扩散模型。在正向过程结束时，傅里叶系数平滑收敛至指定的随机噪声，从而在保持多样性和随机性的同时实现特征保持。通过保持不变特征，InSPECT展现出增强的视觉多样性、更快的收敛速度以及更平滑的扩散过程。在CIFAR-10、Celeb-A和LSUN数据集上的实验表明，在指定参数设置下经过10K次迭代，InSPECT相较于DDPM平均实现了39.23%的FID降低和45.80%的IS提升，这证明了保持不变特征的显著优势：在实现更优生成质量和多样性的同时，提升了计算效率并实现了更快的收敛速度。据我们所知，这是首次在扩散模型中分析并保持不变谱特征的尝试。