Existing Score-based Generative Models (SGMs) can be categorized into constrained SGMs (CSGMs) or unconstrained SGMs (USGMs) according to their parameterization approaches. CSGMs model probability density functions as Boltzmann distributions, and assign their predictions as the negative gradients of some scalar-valued energy functions. On the other hand, USGMs employ flexible architectures capable of directly estimating scores without the need to explicitly model energy functions. In this paper, we demonstrate that the architectural constraints of CSGMs may limit their modeling ability. In addition, we show that USGMs' inability to preserve the property of conservativeness may lead to degraded sampling performance in practice. To address the above issues, we propose Quasi-Conservative Score-based Generative Models (QCSGMs) for keeping the advantages of both CSGMs and USGMs. Our theoretical derivations demonstrate that the training objective of QCSGMs can be efficiently integrated into the training processes by leveraging the Hutchinson trace estimator. In addition, our experimental results on the CIFAR-10, CIFAR-100, ImageNet, and SVHN datasets validate the effectiveness of QCSGMs. Finally, we justify the advantage of QCSGMs using an example of a one-layered autoencoder.
翻译:现有基于分数的生成模型(SGM)可根据其参数化方法分为限制的SGM(CSGMs)或不受限制的SGM(USGMs) 。CSGMs 模拟概率密度功能作为Boltzmann的分布,并将其预测定为某些标价能源功能的负梯度。另一方面,USGMs采用能够直接估计分数的灵活结构,而无需明确示范能源功能。在本文中,我们证明CSGMs的建筑限制可能限制其建模能力。此外,我们表明,USGMs无法保存保守性特性可能导致实际采样业绩的下降。为了解决上述问题,我们提议以基于准的评分模型(QCSGMs) 来保持CSGMs和USGs的优势。我们的理论推断表明,通过利用Hutchinson 跟踪仪,我们关于CIFAR-10的实验结果,CIFAR-CS-CS-CSL-CSLAVS-CSUral-CSUral-AGMSUral-CSUral-CSUIGGMA-SUILA-CSUILOral-CSUIGR 的优势, 我们GM-CRA-CRVAL-CRVAL-A-IGVAL-A-CRVA-A-IGVA-IGVA的一例。