A classic problem in statistics is the estimation of the expectation of random variables from samples. This gives rise to the tightly connected problems of deriving concentration inequalities and confidence sequences, that is confidence intervals that hold uniformly over time. Jun and Orabona [COLT'19] have shown how to easily convert the regret guarantee of an online betting algorithm into a time-uniform concentration inequality. In this paper, we show that we can go even further: We show that the regret of universal portfolio algorithms give rise to new implicit time-uniform concentrations and state-of-the-art empirically calculated confidence sequences. In particular, our numerically obtained confidence sequences can be never vacuous, even with a single sample, and satisfy the law of iterated logarithm.
翻译:典型的统计问题是估计抽样随机变量的预期值。 这引起了产生集中不平等和信心序列的紧密关联问题,即信任期间隔随时间而异。 Jun 和 Orabona [COLT'19] 已经展示了如何将在线赌注算法的遗憾保证轻易转换为时间一致的集中不平等。 在本文中,我们展示了我们可以更进一步:我们表明,对通用组合算法的遗憾导致了新的隐含时间格式的集中和最先进的经经验计算的信任序列。 特别是,我们从数字上获得的信任序列永远不会消失,即使有单一样本,也不可能满足迭代对数法。