预测校准有顺序有效测试 (Sequentially valid tests for forecast calibration) - 专知论文

会员服务 ·

0

统计量 · 正则化项 · 随机变量 · TOOLS · 周期的 ·

2022 年 7 月 1 日

Sequentially valid tests for forecast calibration

翻译：预测校准有顺序有效测试

Sebastian Arnold,Alexander Henzi,Johanna F. Ziegel

Forecasting and forecast evaluation are inherently sequential tasks. Predictions are often issued on a regular basis, such as every hour, day, or month, and their quality is monitored continuously. However, the classical statistical tools for forecast evaluation are static, in the sense that statistical tests for forecast calibration are only valid if the evaluation period is fixed in advance. Recently, e-values have been introduced as a new, dynamic method for assessing statistical significance. An e-value is a non-negative random variable with expected value at most one under a null hypothesis. Large e-values give evidence against the null hypothesis, and the multiplicative inverse of an e-value is a conservative p-value. E-values are particularly suitable for sequential forecast evaluation, since they naturally lead to statistical tests which are valid under optional stopping. This article proposes e-values for testing probabilistic calibration of forecasts, which is one of the most important notions of calibration. The proposed methods are also more generally applicable for sequential goodness-of-fit testing. We demonstrate that the e-values are competitive in terms of power when compared to extant methods, which do not allow sequential testing. Furthermore, they provide important and useful insights in the evaluation of probabilistic weather forecasts.

翻译：预测和预测评价是必然的相继任务。预测通常定期发布,如每小时、日或月,并不断监测其质量。然而,预测评价的典型统计工具是静态的,因为预测校准的统计测试只有在评价期提前固定的情况下才有效。最近,电子价值被引入为评估统计意义的一种新的动态方法。电子价值是一种非负性随机变量,在完全假设下,其预期值最多为一个。大型电子价值提供证据反对无效假设,电子价值的倍增反面是一种保守的p价值。电子价值特别适合顺序预测评价,因为它们自然导致统计测试,而这种测试在任择性停止的情况下是有效的。这一文章提出了测试预测的概率校准电子价值,这是最重要的校准概念之一。拟议方法也更普遍地适用于连续性良好测试。我们证明,电子价值在能力方面与远端的天气预测方法相比具有竞争性,因此无法进行重要的连续性观测。此外,电子价值提供了重要的连续性预测。此外,电子价值提供其评估在与远端的天气预测中具有竞争性。

0

相关内容

统计量

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

马铃薯晚疫病菌Phytophthora infestans中miRNA的研究与分析

国家自然科学基金

0+阅读 · 2013年12月31日

SDIR1互作蛋白ECA1在植物应对干旱胁迫过程中的功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

非磁性元素掺杂稀磁半导体铁磁性机理研究的新方法

国家自然科学基金

0+阅读 · 2012年12月31日

玻璃衬底上p型自掺杂硅纳米线阵列的制备及在径向异质结光伏器件中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

基于太阳电池的新型掺杂C12A7透明导电薄膜制备及其光电特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

雪崩倍增低温砷化镓高功率光电导太赫兹辐射源研究

国家自然科学基金

0+阅读 · 2012年12月31日

Li3V2(PO4)3快离子导体掺杂改性LiMnPO4/C纳米复合材料的基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

DNA复制中Cdc45在染色体上动态行为的新机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

自旋极化电子在磁性半导体及其异质结中的输运研究

国家自然科学基金

0+阅读 · 2009年12月31日

Calibration of P-values for calibration and for deviation of a subpopulation from the full population

Arxiv

0+阅读 · 2022年8月24日

Multivariate Boosted Trees and Applications to Forecasting and Control

Arxiv

0+阅读 · 2022年8月22日

Robust Tests in Online Decision-Making

Arxiv

0+阅读 · 2022年8月21日

An Unsupervised Short- and Long-Term Mask Representation for Multivariate Time Series Anomaly Detection

Arxiv

0+阅读 · 2022年8月19日

Parametric and Multivariate Uncertainty Calibration for Regression and Object Detection

Arxiv

0+阅读 · 2022年8月19日

Semi-Random Impossibilities of Condorcet Criterion

Arxiv

0+阅读 · 2022年8月18日

Optimized Equivalent Linearization for Random Vibration

Arxiv

0+阅读 · 2022年8月18日

Deletion and Insertion Tests in Regression Models

Arxiv

0+阅读 · 2022年8月18日

Forecasting: theory and practice

Arxiv

57+阅读 · 2022年1月5日

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

Arxiv

36+阅读 · 2020年5月24日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

人工智能赋能自主武器与人类控制第一部分：人类控制与机器学习的设计和开发 | 46页

军事指挥控制系统：2025年5种用途

人工智能赋能自主武器与人类控制第二部分：人类控制与军事指挥官 | 38页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Calibration of P-values for calibration and for deviation of a subpopulation from the full population

Arxiv

0+阅读 · 2022年8月24日

Multivariate Boosted Trees and Applications to Forecasting and Control

Arxiv

0+阅读 · 2022年8月22日

Robust Tests in Online Decision-Making

Arxiv

0+阅读 · 2022年8月21日

An Unsupervised Short- and Long-Term Mask Representation for Multivariate Time Series Anomaly Detection

Arxiv

0+阅读 · 2022年8月19日

Parametric and Multivariate Uncertainty Calibration for Regression and Object Detection

Arxiv

0+阅读 · 2022年8月19日

Semi-Random Impossibilities of Condorcet Criterion

Arxiv

0+阅读 · 2022年8月18日

Optimized Equivalent Linearization for Random Vibration

Arxiv

0+阅读 · 2022年8月18日

Deletion and Insertion Tests in Regression Models

Arxiv

0+阅读 · 2022年8月18日

Forecasting: theory and practice

Arxiv

57+阅读 · 2022年1月5日

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

Arxiv

36+阅读 · 2020年5月24日

相关基金

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

马铃薯晚疫病菌Phytophthora infestans中miRNA的研究与分析

国家自然科学基金

0+阅读 · 2013年12月31日

SDIR1互作蛋白ECA1在植物应对干旱胁迫过程中的功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

非磁性元素掺杂稀磁半导体铁磁性机理研究的新方法

国家自然科学基金

0+阅读 · 2012年12月31日

玻璃衬底上p型自掺杂硅纳米线阵列的制备及在径向异质结光伏器件中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

基于太阳电池的新型掺杂C12A7透明导电薄膜制备及其光电特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

雪崩倍增低温砷化镓高功率光电导太赫兹辐射源研究

国家自然科学基金

0+阅读 · 2012年12月31日

Li3V2(PO4)3快离子导体掺杂改性LiMnPO4/C纳米复合材料的基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

DNA复制中Cdc45在染色体上动态行为的新机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

自旋极化电子在磁性半导体及其异质结中的输运研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员