A classic problem in statistics is the estimation of the expectation of random variables from samples. This gives rise to the tightly connected problems of deriving concentration inequalities and confidence sequences, that is confidence intervals that hold uniformly over time. Previous work has shown how to easily convert the regret guarantee of an online betting algorithm into a time-uniform concentration inequality. In this paper, we show that we can go even further: We show that the regret of universal portfolio algorithms give rise to new implicit time-uniform concentrations and state-of-the-art empirically calculated confidence sequences. In particular, our numerically obtained confidence sequences can never be vacuous, even with a single sample, and satisfy the law of iterated logarithm.
翻译:典型的统计问题是估计抽样随机变量的预期值。 这引起了产生集中不平等和信心序列的紧密关联问题,即信任期间隔是长期一致的。 先前的工作已经表明,如何将在线赌博算法的遗憾保证轻易转换成时间一致的集中不平等。 在本文中,我们可以更进一步:我们显示,对通用组合算法的遗憾导致新的隐含时间一致的集中度和最先进的实验性计算信任序列。 特别是,我们从数字上获得的信任序列永远不可能是空洞的,即使有单一样本,也不可能满足迭代对数法则。