计算预测的分级分级评分 (Scaling-aware rating of count forecasts) - 专知论文

会员服务 ·

0

分桶 · 缩放 · 噪声 · Performer · 统计量 ·

2022 年 11 月 29 日

Scaling-aware rating of count forecasts

翻译：计算预测的分级分级评分

Malte C. Tichy,Illia Babounikau,Nikolas Wolke,Stefan Ulbrich,Michael Feindt

from arxiv, 41 pages, 13 figures

Forecasts crave a rating that reflects the forecast's quality in the context of what is possible in theory and what is reasonable to expect in practice. Granular forecasts in the regime of low count rates - as they often occur in retail, for which an intermittent demand of a handful might be observed per product, day, and location - are dominated by the inevitable statistical uncertainty of the Poisson distribution. This makes it hard to judge whether a certain metric value is dominated by Poisson noise or truly indicates a bad prediction model. To make things worse, every evaluation metric suffers from scaling: Its value is mostly defined by the predicted selling rate and the resulting rate-dependent Poisson noise, and only secondarily by the quality of the forecast. For any metric, comparing two groups of forecasted products often yields "the slow movers are performing worse than the fast movers" or vice versa - the na\"ive scaling trap. To distill the intrinsic quality of a forecast, we stratify predictions into buckets of approximately equal rate and evaluate metrics for each bucket separately. By comparing the achieved value per bucket to benchmarks, we obtain a scaling-aware rating of count forecasts. Our procedure avoids the na\"ive scaling trap, provides an immediate intuitive judgment of forecast quality, and allows to compare forecasts for different products or even industries.

翻译：预测希望得到一种反映预测质量的评级,这种评级在理论上是可能的,在实践上是合理预期的。低计率制度下,低计率制度(通常发生在零售业,每产品、日、地都可能观察到一小部分的间歇需求)的粒子预测受到Poisson分布不可避免的统计不确定性的支配。这使得很难判断某一指标值是否为Poisson噪音所主宰,或确实显示一个坏的预测模型。要让情况更糟,每个评价指标都受到缩放的影响:其价值大多由预测的销售率和由此产生的以比率为根据的Poisson噪音来界定,而仅次于预测的质量。对于任何一种指标,比较两种预测产品往往产生“慢动者的表现比快速移动者差”或反之更差的“缩放陷阱 ” 。为了淡化预报的内在质量,我们将预测压缩为大约相同比率的桶,并分别评估每桶的计量标准。通过将每一桶的已实现的价值与基准进行比较,我们甚至获得一个缩度-觉察测得的预测质量的评级。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

AlGaN极化场调控对内量子效率的影响

国家自然科学基金

1+阅读 · 2016年12月31日

循环肿瘤细胞时空异质性及其在肝癌转移复发中相关机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

稀有变异的有效发现与识别

国家自然科学基金

0+阅读 · 2013年12月31日

乳腺间质成纤维细胞在奶牛乳腺炎中的作用及分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

火星晨昏线附近电离层变化特征研究

国家自然科学基金

0+阅读 · 2012年12月31日

7A85铝合金的非等温时效行为研究

国家自然科学基金

0+阅读 · 2011年12月31日

星载ALOS PALSAR数据反演云南松林生物量研究

国家自然科学基金

0+阅读 · 2009年12月31日

TBX21基因启动子rSNP对慢性HBV感染者Th1/Foxp3+Treg分化失衡调节的机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于HHT的超光谱图像高精度分类算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

甘蔗蔗糖磷酸合成酶（SPS）基因克隆及其调控表达机制的研究

国家自然科学基金

0+阅读 · 2008年12月31日

What can be learnt with wide convolutional neural networks?

Arxiv

0+阅读 · 2023年1月31日

Inference Time Evidences of Adversarial Attacks for Forensic on Transformers

Arxiv

0+阅读 · 2023年1月31日

The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance

Arxiv

0+阅读 · 2023年1月30日

A Simulation Study of the Performance of Statistical Models for Count Outcomes with Excessive Zeros: Focusing on the Marginalized Model

Arxiv

0+阅读 · 2023年1月30日

Transfer Learning in Deep Learning Models for Building Load Forecasting: Case of Limited Data

Transfer Learning in Deep Learning Models for Building Load Forecasting: Case of Limited Data

Arxiv

0+阅读 · 2023年1月27日

Targeted Attacks on Timeseries Forecasting

Arxiv

0+阅读 · 2023年1月27日

FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting

Arxiv

10+阅读 · 2022年5月16日

Forecasting: theory and practice

Arxiv

57+阅读 · 2022年1月5日

Embedding-based Retrieval in Facebook Search

Arxiv

12+阅读 · 2020年6月20日

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

Arxiv

13+阅读 · 2018年9月6日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】多目标奖励与偏好优化：理论与算法

《无形的防御者？将定向能武器集成到反无人机框架的机遇与挑战》报告

自主化海军：海上无人系统与未来海战

迈向智能体系统规模化的科学

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

相关论文

What can be learnt with wide convolutional neural networks?

Arxiv

0+阅读 · 2023年1月31日

Inference Time Evidences of Adversarial Attacks for Forensic on Transformers

Arxiv

0+阅读 · 2023年1月31日

The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance

Arxiv

0+阅读 · 2023年1月30日

A Simulation Study of the Performance of Statistical Models for Count Outcomes with Excessive Zeros: Focusing on the Marginalized Model

Arxiv

0+阅读 · 2023年1月30日

Transfer Learning in Deep Learning Models for Building Load Forecasting: Case of Limited Data

Transfer Learning in Deep Learning Models for Building Load Forecasting: Case of Limited Data

Arxiv

0+阅读 · 2023年1月27日

Targeted Attacks on Timeseries Forecasting

Arxiv

0+阅读 · 2023年1月27日

FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting

Arxiv

10+阅读 · 2022年5月16日

Forecasting: theory and practice

Arxiv

57+阅读 · 2022年1月5日

Embedding-based Retrieval in Facebook Search

Arxiv

12+阅读 · 2020年6月20日

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

Arxiv

13+阅读 · 2018年9月6日

相关基金

AlGaN极化场调控对内量子效率的影响

国家自然科学基金

1+阅读 · 2016年12月31日

循环肿瘤细胞时空异质性及其在肝癌转移复发中相关机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

稀有变异的有效发现与识别

国家自然科学基金

0+阅读 · 2013年12月31日

乳腺间质成纤维细胞在奶牛乳腺炎中的作用及分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

火星晨昏线附近电离层变化特征研究

国家自然科学基金

0+阅读 · 2012年12月31日

7A85铝合金的非等温时效行为研究

国家自然科学基金

0+阅读 · 2011年12月31日

星载ALOS PALSAR数据反演云南松林生物量研究

国家自然科学基金

0+阅读 · 2009年12月31日

TBX21基因启动子rSNP对慢性HBV感染者Th1/Foxp3+Treg分化失衡调节的机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于HHT的超光谱图像高精度分类算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

甘蔗蔗糖磷酸合成酶（SPS）基因克隆及其调控表达机制的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员