We present a framework to derive bounds on the test loss of randomized learning algorithms for the case of bounded loss functions. Drawing from Steinke & Zakynthinou (2020), this framework leads to bounds that depend on the conditional information density between the the output hypothesis and the choice of the training set, given a larger set of data samples from which the training set is formed. Furthermore, the bounds pertain to the average test loss as well as to its tail probability, both for the PAC-Bayesian and the single-draw settings. If the conditional information density is bounded uniformly in the size $n$ of the training set, our bounds decay as $1/n$. This is in contrast with the tail bounds involving conditional information measures available in the literature, which have a less benign $1/\sqrt{n}$ dependence. We demonstrate the usefulness of our tail bounds by showing that they lead to nonvacuous estimates of the test loss achievable with some neural network architectures trained on MNIST and Fashion-MNIST.
翻译:我们提出了一个框架, 用于计算受约束损失函数的随机学习算法测试损失的界限。 根据Steinke & Zakynthinou (202020年), 这个框架导致的界限取决于产出假设和选择培训组之间的有条件信息密度, 取决于构成培训组的较大型数据样本。 此外, 界限涉及平均测试损失及其尾端概率, 包括PAC- Bayesian 和单拖线设置。 如果有条件信息密度以培训组的美元大小统一捆绑, 我们的界限会以1美元/ 美元的速度衰减。 这与文献中包含有条件信息计量的尾端界限形成对照, 后者含有不那么好的1美元/ sqrt{n} $的依赖性。 我们通过显示它们导致对测试损失进行不遗漏的估计, 测试组由在MNIST和Fashion- MNIST中培训的一些神经网络结构来完成。