We investigate the cold posterior effect through the lens of PAC-Bayes generalization bounds. We argue that in the non-asymptotic setting, when the number of training samples is (relatively) small, discussions of the cold posterior effect should take into account that approximate Bayesian inference does not readily provide guarantees of performance on out-of-sample data. Instead, out-of-sample error is better described through a generalization bound. In this context, we explore the connections between the ELBO objective from variational inference and the PAC-Bayes objectives. We note that, while the ELBO and PAC-Bayes objectives are similar, the latter objectives naturally contain a temperature parameter $\lambda$ which is not restricted to be $\lambda=1$. For both regression and classification tasks, in the case of isotropic Laplace approximations to the posterior, we show how this PAC-Bayesian interpretation of the temperature parameter captures the cold posterior effect.
翻译:我们通过PAC-Bayes通用界限的透镜来调查冷后继效应。我们争辩说,在非简易环境中,当训练样品的数量(相对而言)很小时,对冷后继效应的讨论应当考虑到,近似巴伊西亚的推论并不能随时保证从抽样数据中产生性能的保证。相反,通过概括性约束来描述抽样外误差更好。在这方面,我们探讨ELBO目标从变相推断与PAC-Bayes目标之间的联系。我们注意到,虽然ELBO和PAC-Bayes目标相似,但后一目标自然含有温度参数$\lambda$,不受限制为$\lambda=1美元。对于回归和分类任务,在后表的Iotropic Laplace近似物中,我们展示PAC-Bayesian对温度参数的这种解释如何捕捉到冷后继效应。