Diffusion Models (DMs) have demonstrated state-of-the-art performance in content generation without requiring adversarial training. These models are trained using a two-step process. First, a forward - diffusion - process gradually adds noise to a datum (usually an image). Then, a backward - reverse diffusion - process gradually removes the noise to turn it into a sample of the target distribution being modelled. DMs are inspired by non-equilibrium thermodynamics and have inherent high computational complexity. Due to the frequent function evaluations and gradient calculations in high-dimensional spaces, these models incur considerable computational overhead during both training and inference stages. This can not only preclude the democratization of diffusion-based modelling, but also hinder the adaption of diffusion models in real-life applications. Not to mention, the efficiency of computational models is fast becoming a significant concern due to excessive energy consumption and environmental scares. These factors have led to multiple contributions in the literature that focus on devising computationally efficient DMs. In this review, we present the most recent advances in diffusion models for vision, specifically focusing on the important design aspects that affect the computational efficiency of DMs. In particular, we emphasize the recently proposed design choices that have led to more efficient DMs. Unlike the other recent reviews, which discuss diffusion models from a broad perspective, this survey is aimed at pushing this research direction forward by highlighting the design strategies in the literature that are resulting in practicable models for the broader research community. We also provide a future outlook of diffusion models in vision from their computational efficiency viewpoint.
翻译:传播模型(DMs)在内容生成方面表现出了最先进的业绩,不需要对抗性培训。这些模型是用两步过程来培训的。首先,前向-扩散过程(Spreform-Sprocess)逐渐将噪音添加到一个表层(通常是图像),然后,后向-反向扩散过程逐渐消除噪音,将其转化为正在模拟的目标分布样本。管理模型受到非平衡热力动力学的启发,具有固有的高计算复杂性。由于高维空间经常进行功能评估和梯度计算,这些模型在培训和推断阶段都需要大量计算间接费用。这不仅可以排除基于传播模型的民主化,而且还会阻碍实际应用中传播模型的适应性。 更不用说,由于能源消耗过量和环境恐慌,计算模型的效率正在迅速成为一个重大关切问题。 这些因素使得侧重于计算高效的MDMs的文献传播具有多重贡献。 在本次审查中,我们展示了视觉传播模型的最新进展,具体侧重于基于基于传播模式的重要设计方面,在近期进行的设计中,从而导致计算效率审查。