通过同侪预测汇总预测 (Forecast Aggregation via Peer Prediction) - 专知论文

会员服务 ·

0

模型评估 · 可辨认的 · 预测准确率 · 得分 · 均方误差 ·

2021 年 3 月 4 日

Forecast Aggregation via Peer Prediction

翻译：通过同侪预测汇总预测

Juntao Wang,Yang Liu,Yiling Chen

Crowdsourcing is a popular paradigm for soliciting forecasts on future events. As people may have different forecasts, how to aggregate solicited forecasts into a single accurate prediction remains to be an important challenge, especially when no historical accuracy information is available for identifying experts. In this paper, we borrow ideas from the peer prediction literature and assess the prediction accuracy of participants using solely the collected forecasts. This approach leverages the correlations among peer reports to cross-validate each participant's forecasts and allows us to assign a "peer assessment score (PAS)" for each agent as a proxy for the agent's prediction accuracy. We identify several empirically effective methods to generate PAS and propose an aggregation framework that uses PAS to identify experts and to boost existing aggregators' prediction accuracy. We evaluate our methods over 14 real-world datasets and show that i) PAS generated from peer prediction methods can approximately reflect the prediction accuracy of agents, and ii) our aggregation framework demonstrates consistent and significant improvement in the prediction accuracy over existing aggregators for both binary and multi-choice questions under three popular accuracy measures: Brier score (mean square error), log score (cross-entropy loss) and AUC-ROC.

翻译：由于人们可能有不同的预测,如何将索取的预测汇总成单一准确的预测仍是一项重大挑战,特别是当没有历史准确性信息可供鉴定专家时。在本文中,我们借用同行预测文献中的想法,并评估仅使用所收集的预测的参与者的预测准确性。这种方法利用同行报告之间的相互关系来交叉校验每个参与者的预测,并使我们能够为每个代理商指定一个“同行评估分数(PAS)”作为该代理商预测准确性的代理。我们确定了若干有效的实证方法,以生成考绩制度,并提议一个汇总框架,利用考绩制度确定专家,提高现有的聚合者的预测准确性。我们评估了14多个真实世界数据集的方法,并表明一)同行预测方法产生的考绩制度可以大致反映代理人的预测准确性,二)我们的汇总框架显示,在三种大众精确度措施下,对二分数和多选题的现有聚合器的预测准确性预测准确性都得到了一致和显著的改进:Brier评分(平均误差)、log-Cropy损失(CU)和Acrosty-CU。

0

相关内容

模型评估

机器学习系统设计系统评估标准

深度卷积神经网络图像语义分割研究进展

专知会员服务

87+阅读 · 2021年1月7日

图像分割方法综述

图像分割方法综述

专知会员服务

56+阅读 · 2020年11月22日

ECCV 2020 五项大奖出炉！普林斯顿邓嘉获最佳论文奖

ECCV 2020 五项大奖出炉！普林斯顿邓嘉获最佳论文奖

专知会员服务

14+阅读 · 2020年8月25日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

SIGIR2019 接收论文列表

SIGIR2019 接收论文列表

专知

18+阅读 · 2019年4月20日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【论文推荐】最新六篇自动问答相关论文—排序函数、文本摘要评估、信息抽取框架、层次递归编码器、半监督问答

【论文推荐】最新六篇自动问答相关论文—排序函数、文本摘要评估、信息抽取框架、层次递归编码器、半监督问答

专知

9+阅读 · 2018年5月10日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

【论文推荐】最新八篇目标跟踪相关论文—自适应相关滤波、因果关系图模型、TrackingNet、ClickBAIT、图像矩模型

【论文推荐】最新八篇目标跟踪相关论文—自适应相关滤波、因果关系图模型、TrackingNet、ClickBAIT、图像矩模型

专知

4+阅读 · 2018年4月18日

LibRec 精选：推荐系统9个必备数据集

LibRec 精选：推荐系统9个必备数据集

LibRec智能推荐

6+阅读 · 2018年3月7日

【泡泡一分钟】一种基于光场的快速有效深度图估计方法（3dv-43）

【泡泡一分钟】一种基于光场的快速有效深度图估计方法（3dv-43）

泡泡机器人SLAM

4+阅读 · 2018年2月11日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

A Kernel-based Consensual Aggregation for Regression

Arxiv

0+阅读 · 2021年4月28日

UniTE -- The Best of Both Worlds: Unifying Function-Fitting and Aggregation-Based Approaches to Travel Time and Travel Speed Estimation

UniTE -- The Best of Both Worlds: Unifying Function-Fitting and Aggregation-Based Approaches to Travel Time and Travel Speed Estimation

Arxiv

0+阅读 · 2021年4月27日

A Human-Centered Interpretability Framework Based on Weight of Evidence

Arxiv

0+阅读 · 2021年4月27日

Novel bivariate autoregressive model for predicting and forecasting irregularly observed time series

Arxiv

0+阅读 · 2021年4月25日

Inductive Relation Prediction by Subgraph Reasoning

Inductive Relation Prediction by Subgraph Reasoning

Arxiv

11+阅读 · 2020年2月12日

STGRAT: A Spatio-Temporal Graph Attention Network for Traffic Forecasting

STGRAT: A Spatio-Temporal Graph Attention Network for Traffic Forecasting

Arxiv

9+阅读 · 2019年11月29日

Multi-Range Attentive Bicomponent Graph Convolutional Network for Traffic Forecasting

Arxiv

3+阅读 · 2019年11月27日

Knowledge Graph Alignment Network with Gated Multi-hop Neighborhood Aggregation

Arxiv

19+阅读 · 2019年11月20日

Foreground-aware Image Inpainting

Foreground-aware Image Inpainting

Arxiv

4+阅读 · 2019年1月17日

Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering

Arxiv

8+阅读 · 2018年4月26日

VIP会员

文章信息

相关主题

预测准确率

相关VIP内容

深度卷积神经网络图像语义分割研究进展

专知会员服务

87+阅读 · 2021年1月7日

图像分割方法综述

图像分割方法综述

专知会员服务

56+阅读 · 2020年11月22日

ECCV 2020 五项大奖出炉！普林斯顿邓嘉获最佳论文奖

ECCV 2020 五项大奖出炉！普林斯顿邓嘉获最佳论文奖

专知会员服务

14+阅读 · 2020年8月25日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

最新《扩散模型原理》新书，470页pdf

无人机作战：演进、创新与未来战场

AI 智能体简史

多模态空间推理在大模型时代：综述与基准测试

相关资讯

SIGIR2019 接收论文列表

SIGIR2019 接收论文列表

专知

18+阅读 · 2019年4月20日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【论文推荐】最新六篇自动问答相关论文—排序函数、文本摘要评估、信息抽取框架、层次递归编码器、半监督问答

【论文推荐】最新六篇自动问答相关论文—排序函数、文本摘要评估、信息抽取框架、层次递归编码器、半监督问答

专知

9+阅读 · 2018年5月10日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

【论文推荐】最新八篇目标跟踪相关论文—自适应相关滤波、因果关系图模型、TrackingNet、ClickBAIT、图像矩模型

【论文推荐】最新八篇目标跟踪相关论文—自适应相关滤波、因果关系图模型、TrackingNet、ClickBAIT、图像矩模型

专知

4+阅读 · 2018年4月18日

LibRec 精选：推荐系统9个必备数据集

LibRec 精选：推荐系统9个必备数据集

LibRec智能推荐

6+阅读 · 2018年3月7日

【泡泡一分钟】一种基于光场的快速有效深度图估计方法（3dv-43）

【泡泡一分钟】一种基于光场的快速有效深度图估计方法（3dv-43）

泡泡机器人SLAM

4+阅读 · 2018年2月11日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

A Kernel-based Consensual Aggregation for Regression

Arxiv

0+阅读 · 2021年4月28日

UniTE -- The Best of Both Worlds: Unifying Function-Fitting and Aggregation-Based Approaches to Travel Time and Travel Speed Estimation

UniTE -- The Best of Both Worlds: Unifying Function-Fitting and Aggregation-Based Approaches to Travel Time and Travel Speed Estimation

Arxiv

0+阅读 · 2021年4月27日

A Human-Centered Interpretability Framework Based on Weight of Evidence

Arxiv

0+阅读 · 2021年4月27日

Novel bivariate autoregressive model for predicting and forecasting irregularly observed time series

Arxiv

0+阅读 · 2021年4月25日

Inductive Relation Prediction by Subgraph Reasoning

Inductive Relation Prediction by Subgraph Reasoning

Arxiv

11+阅读 · 2020年2月12日

STGRAT: A Spatio-Temporal Graph Attention Network for Traffic Forecasting

STGRAT: A Spatio-Temporal Graph Attention Network for Traffic Forecasting

Arxiv

9+阅读 · 2019年11月29日

Multi-Range Attentive Bicomponent Graph Convolutional Network for Traffic Forecasting

Arxiv

3+阅读 · 2019年11月27日

Knowledge Graph Alignment Network with Gated Multi-hop Neighborhood Aggregation

Arxiv

19+阅读 · 2019年11月20日

Foreground-aware Image Inpainting

Foreground-aware Image Inpainting

Arxiv

4+阅读 · 2019年1月17日

Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering

Arxiv

8+阅读 · 2018年4月26日

微信扫码咨询专知VIP会员