Energy-based models (EBMs) offer flexible distribution parametrization. However, due to the intractable partition function, they are typically trained via contrastive divergence for maximum likelihood estimation. In this paper, we propose pseudo-spherical contrastive divergence (PS-CD) to generalize maximum likelihood learning of EBMs. PS-CD is derived from the maximization of a family of strictly proper homogeneous scoring rules, which avoids the computation of the intractable partition function and provides a generalized family of learning objectives that include contrastive divergence as a special case. Moreover, PS-CD allows us to flexibly choose various learning objectives to train EBMs without additional computational cost or variational minimax optimization. Theoretical analysis on the proposed method and extensive experiments on both synthetic data and commonly used image datasets demonstrate the effectiveness and modeling flexibility of PS-CD, as well as its robustness to data contamination, thus showing its superiority over maximum likelihood and $f$-EBMs.
翻译:以能源为基础的模型(EBMs)提供了灵活的分布分布平衡,然而,由于难以调和的分布功能,这些模型通常通过对比差异加以培训,以便进行最大可能的估计;在本文中,我们提议了假球对比差异(PS-CD),以便尽可能普及对EBMs的学习。PS-CD来自一个严格相同的评分规则的大家庭的最大化,这避免了对棘手分割功能的计算,并提供了一个包括差异差异的通用学习目标大家庭,作为一个特殊案例。此外,PS-CD允许我们灵活选择各种学习目标来培训EBMs,而不增加计算成本或变式微缩缩缩缩缩缩缩胶。关于拟议方法的理论分析和关于合成数据和常用图像数据集的广泛实验表明PS-CD的有效性和建模灵活性,以及其对数据污染的坚固性,从而表明其优于最大可能性和美元-EBMs。