We introduce a modified version of the excess risk, which can be used to obtain tighter, fast-rate PAC-Bayesian generalisation bounds. This modified excess risk leverages information about the relative hardness of data examples to reduce the variance of its empirical counterpart, tightening the bound. We combine this with a new bound for $[-1, 1]$-valued (and potentially non-independent) signed losses, which is more favourable when they empirically have low variance around $0$. The primary new technical tool is a novel result for sequences of interdependent random vectors which may be of independent interest. We empirically evaluate these new bounds on a number of real-world datasets.
翻译:我们引入了经修改的超重风险版本,可用于获取更严格、快速的PAC-Bayesian通用范围。这种经修改的超重风险将数据实例相对硬性的信息用于减少经验对应方的差异,同时收紧约束。我们将此与新的 $[1,1] 价值为$[1,1] 的(和潜在非独立 )经签字的损失约束结合起来,当它们根据经验将差异降低到约0美元左右时,这种约束就更为有利。主要的新技术工具是独立感兴趣的相互依存随机矢量序列的新结果。我们用经验评估了一些真实世界数据集的这些新界限。