Crowdsourcing enables the solicitation of forecasts on a variety of prediction tasks from distributed groups of people. How to aggregate the solicited forecasts, which may vary in quality, into an accurate final prediction remains a challenging yet critical question. Studies have found that weighing expert forecasts more in aggregation can improve the accuracy of the aggregated prediction. However, this approach usually requires access to the historical performance data of the forecasters, which are often not available. In this paper, we study the problem of aggregating forecasts without having historical performance data. We propose using peer prediction methods, a family of mechanisms initially designed to truthfully elicit private information in the absence of ground truth verification, to assess the expertise of forecasters, and then using this assessment to improve forecast aggregation. We evaluate our peer-prediction-aided aggregators on a diverse collection of 14 human forecast datasets. Compared with a variety of existing aggregators, our aggregators achieve a significant and consistent improvement on aggregation accuracy measured by the Brier score and the log score. Our results reveal the effectiveness of identifying experts to improve aggregation even without historical data.
翻译:众包可以对分布人群的各种预测任务进行预测; 如何将所请求的预测质量可能各不相同的预测汇总为准确的最终预测,仍然是一个具有挑战性但又很关键的问题; 研究发现,将专家预测进行更综合的权衡,可以提高总体预测的准确性; 然而,这种办法通常需要查阅预报员的历史业绩数据,而这些数据往往没有提供; 本文研究在没有历史业绩数据的情况下汇总预测的问题; 我们提议采用同行预测方法,即最初设计一系列机制,以便在没有地面真相核实的情况下真实地获取私人信息,评估预报员的专门知识,然后利用这一评估改进预测汇总; 我们评估了不同收集14个人类预测数据集的同行辅助聚合器; 与现有的各种分类器相比,我们的聚合器在根据布里尔分和日志分测量的汇总准确性方面取得了显著和一致的改进。 我们的结果表明,即使没有历史数据,确定专家来改进汇总的有效性。