通过分步走道的下下弯道 (Stronger Calibration Lower Bounds via Sidestepping) - 专知论文

会员服务 ·

0

相互独立的 · 情景 · 早停 · binary · 状态空间 ·

2020 年 12 月 7 日

Stronger Calibration Lower Bounds via Sidestepping

翻译：通过分步走道的下下弯道

Mingda Qiao,Gregory Valiant

We consider an online binary prediction setting where a forecaster observes a sequence of $T$ bits one by one. Before each bit is revealed, the forecaster predicts the probability that the bit is $1$. The forecaster is called well-calibrated if for each $p \in [0, 1]$, among the $n_p$ bits for which the forecaster predicts probability $p$, the actual number of ones, $m_p$, is indeed equal to $p \cdot n_p$. The calibration error, defined as $\sum_p |m_p - p n_p|$, quantifies the extent to which the forecaster deviates from being well-calibrated. It has long been known that an $O(T^{2/3})$ calibration error is achievable even when the bits are chosen adversarially, and possibly based on the previous predictions. However, little is known on the lower bound side, except an $\Omega(\sqrt{T})$ bound that follows from the trivial example of independent fair coin flips. In this paper, we prove an $\Omega(T^{0.528})$ bound on the calibration error, which is the first super-$\sqrt{T}$ lower bound for this setting to the best of our knowledge. The technical contributions of our work include two lower bound techniques, early stopping and sidestepping, which circumvent the obstacles that have previously hindered strong calibration lower bounds. We also propose an abstraction of the prediction setting, termed the Sign-Preservation game, which may be of independent interest. This game has a much smaller state space than the full prediction setting and allows simpler analyses. The $\Omega(T^{0.528})$ lower bound follows from a general reduction theorem that translates lower bounds on the game value of Sign-Preservation into lower bounds on the calibration error.

翻译：我们考虑一个在线的二进制预测设置, 预测者在其中观察的顺序是 $T$ 的一比一。在每位曝光之前, 预报者预测的概率是 1美元。如果每个 $ p $ [0, 1] 美元, 预测者被称作完全校准。在预测者预测的概率为 $ p 美元的百分位中, 实际的美元( $_ p美元) 等于 $p\ cdot n_ p_ p 美元。校准错误, 定义为 $ sum_ p_ m_ p_ p n_ p $, 预测者预测者被调准为 1美元。预测者早已知道 $O (T%2/3} 美元) 校准错误是可以实现的, 实际数, 实际数, 可能是根据先前的预测值。但是, 下限的一面面的值, 除了 $( sqr) 和下方的更下方的值, 更低的值被绑起来, 。

0

相关内容

相互独立的

相互独立的

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

123+阅读 · 2020年5月30日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

93+阅读 · 2020年5月5日

元学习与图神经网络逻辑推导，55页ppt

元学习与图神经网络逻辑推导，55页ppt

专知会员服务

129+阅读 · 2020年4月25日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【关关的刷题日记47】Leetcode 38. Count and Say

【关关的刷题日记47】Leetcode 38. Count and Say

专知

3+阅读 · 2017年11月25日

【LeetCode 202】关关的刷题日记35 – Leetcode 202. Happy Number

【LeetCode 202】关关的刷题日记35 – Leetcode 202. Happy Number

专知

5+阅读 · 2017年11月13日

【LeetCode 409】关关的刷题日记31Longest Palindrome

【LeetCode 409】关关的刷题日记31Longest Palindrome

专知

4+阅读 · 2017年11月9日

关关的刷题日记13——Leetcode 414. Third Maximum Number

关关的刷题日记13——Leetcode 414. Third Maximum Number

专知

3+阅读 · 2017年10月8日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

RL for Latent MDPs: Regret Guarantees and a Lower Bound

RL for Latent MDPs: Regret Guarantees and a Lower Bound

Arxiv

0+阅读 · 2021年2月9日

Lower Bounds on the Integraliy Ratio of the Subtour LP for the Traveling Salesman Problem

Arxiv

0+阅读 · 2021年2月9日

Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap

Arxiv

0+阅读 · 2021年2月9日

Higher Strong Order Methods for Itô SDEs on Matrix Lie Groups

Arxiv

0+阅读 · 2021年2月8日

Lower Bounds and Accelerated Algorithms for Bilevel Optimization

Arxiv

0+阅读 · 2021年2月7日

Lie complexity of words

Arxiv

0+阅读 · 2021年2月7日

All Sampling Methods Produce Outliers

Arxiv

0+阅读 · 2021年2月6日

Scalable Inference of Sparsely-changing Markov Random Fields with Strong Statistical Guarantees

Arxiv

0+阅读 · 2021年2月6日

Parameterized Complexity of Immunization in the Threshold Model

Arxiv

0+阅读 · 2021年2月6日

Variance Reduction Methods for Sublinear Reinforcement Learning

Arxiv

4+阅读 · 2018年4月25日

VIP会员

文章信息

相关主题

相互独立的

相关VIP内容

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

123+阅读 · 2020年5月30日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

93+阅读 · 2020年5月5日

元学习与图神经网络逻辑推导，55页ppt

元学习与图神经网络逻辑推导，55页ppt

专知会员服务

129+阅读 · 2020年4月25日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型幻觉：系统综述

《分析与预测陆军战斗体能测试表现：统计与机器学习方法》2025最新137页

【博士论文】数据与任务的物理学：深度学习中的局部性与组合性理论

代理式人工智能时代的决策优势

相关资讯

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【关关的刷题日记47】Leetcode 38. Count and Say

【关关的刷题日记47】Leetcode 38. Count and Say

专知

3+阅读 · 2017年11月25日

【LeetCode 202】关关的刷题日记35 – Leetcode 202. Happy Number

【LeetCode 202】关关的刷题日记35 – Leetcode 202. Happy Number

专知

5+阅读 · 2017年11月13日

【LeetCode 409】关关的刷题日记31Longest Palindrome

【LeetCode 409】关关的刷题日记31Longest Palindrome

专知

4+阅读 · 2017年11月9日

关关的刷题日记13——Leetcode 414. Third Maximum Number

关关的刷题日记13——Leetcode 414. Third Maximum Number

专知

3+阅读 · 2017年10月8日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

RL for Latent MDPs: Regret Guarantees and a Lower Bound

RL for Latent MDPs: Regret Guarantees and a Lower Bound

Arxiv

0+阅读 · 2021年2月9日

Lower Bounds on the Integraliy Ratio of the Subtour LP for the Traveling Salesman Problem

Arxiv

0+阅读 · 2021年2月9日

Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap

Arxiv

0+阅读 · 2021年2月9日

Higher Strong Order Methods for Itô SDEs on Matrix Lie Groups

Arxiv

0+阅读 · 2021年2月8日

Lower Bounds and Accelerated Algorithms for Bilevel Optimization

Arxiv

0+阅读 · 2021年2月7日

Lie complexity of words

Arxiv

0+阅读 · 2021年2月7日

All Sampling Methods Produce Outliers

Arxiv

0+阅读 · 2021年2月6日

Scalable Inference of Sparsely-changing Markov Random Fields with Strong Statistical Guarantees

Arxiv

0+阅读 · 2021年2月6日

Parameterized Complexity of Immunization in the Threshold Model

Arxiv

0+阅读 · 2021年2月6日

Variance Reduction Methods for Sublinear Reinforcement Learning

Arxiv

4+阅读 · 2018年4月25日

微信扫码咨询专知VIP会员