组合成员应进行校准吗? (Should Ensemble Members Be Calibrated?) - 专知论文

会员服务 ·

0

集成 · Performer · 统计量 · 图片分类 · MoDELS ·

2021 年 1 月 13 日

Should Ensemble Members Be Calibrated?

翻译：组合成员应进行校准吗?

Xixin Wu,Mark Gales

Underlying the use of statistical approaches for a wide range of applications is the assumption that the probabilities obtained from a statistical model are representative of the "true" probability that event, or outcome, will occur. Unfortunately, for modern deep neural networks this is not the case, they are often observed to be poorly calibrated. Additionally, these deep learning approaches make use of large numbers of model parameters, motivating the use of Bayesian, or ensemble approximation, approaches to handle issues with parameter estimation. This paper explores the application of calibration schemes to deep ensembles from both a theoretical perspective and empirically on a standard image classification task, CIFAR-100. The underlying theoretical requirements for calibration, and associated calibration criteria, are first described. It is shown that well calibrated ensemble members will not necessarily yield a well calibrated ensemble prediction, and if the ensemble prediction is well calibrated its performance cannot exceed that of the average performance of the calibrated ensemble members. On CIFAR-100 the impact of calibration for ensemble prediction, and associated calibration is evaluated. Additionally the situation where multiple different topologies are combined together is discussed.

翻译：使用统计方法进行广泛应用的基本假设是,从统计模型获得的概率代表了发生事件或结果的“真实”概率。不幸的是,对于现代深神经网络来说,情况并非如此,往往发现这些网络的校准差强。此外,这些深层次的学习方法使用大量模型参数,鼓励使用巴伊西亚或混合近似,处理参数估计问题的方法。本文从理论角度探讨校准计划对深层组合的应用,并从理论角度和经验角度探讨对标准图像分类任务CIFAR-100的深度组合的应用。首先说明校准和相关校准标准的基本理论要求。显示,经过校准的共性成员不一定产生经过适当校准的共性能预测,如果组合预测对它的性能进行精确校准,则不能超过经校准的混合成员的平均性能。在CIFAR-100上,对校准可编目的预测及相关校准工作的影响进行了实验。此外,还共同讨论了多种顶级的合并情况。

0

相关内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

超越深度学习：梯度提升机Gradient Boosting Machines (GBM)，73页ppt

超越深度学习：梯度提升机Gradient Boosting Machines (GBM)，73页ppt

专知会员服务

51+阅读 · 2020年6月21日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【Python最佳实践、技巧与提示30则】《30 Python Best Practices, Tips, And Tricks》by Erik-Jan van Baaren

【Python最佳实践、技巧与提示30则】《30 Python Best Practices, Tips, And Tricks》by Erik-Jan van Baaren

专知会员服务

35+阅读 · 2020年1月6日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Semantics-Empowered Communication for Networked Intelligent Systems

Arxiv

0+阅读 · 2021年3月10日

Stochastic tree ensembles for regularized nonlinear regression

Arxiv

0+阅读 · 2021年3月10日

Determinantal Point Processes Implicitly Regularize Semi-parametric Regression Problems

Arxiv

0+阅读 · 2021年3月9日

Semantic Communications in Networked Systems

Arxiv

0+阅读 · 2021年3月9日

Sliding Window Persistence of Quasiperiodic Functions

Arxiv

0+阅读 · 2021年3月8日

Uncovering the Benefits and Challenges of Continuous Integration Practices

Arxiv

0+阅读 · 2021年3月7日

Regression with reject option and application to kNN

Arxiv

0+阅读 · 2021年3月5日

Cost-sensitive Selection of Variables by Ensemble of Model Sequences

Arxiv

0+阅读 · 2021年3月5日

Ensembles of Random SHAPs

Arxiv

0+阅读 · 2021年3月4日

Hyperparameter Ensembles for Robustness and Uncertainty Quantification

Arxiv

12+阅读 · 2020年6月24日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

超越深度学习：梯度提升机Gradient Boosting Machines (GBM)，73页ppt

超越深度学习：梯度提升机Gradient Boosting Machines (GBM)，73页ppt

专知会员服务

51+阅读 · 2020年6月21日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【Python最佳实践、技巧与提示30则】《30 Python Best Practices, Tips, And Tricks》by Erik-Jan van Baaren

【Python最佳实践、技巧与提示30则】《30 Python Best Practices, Tips, And Tricks》by Erik-Jan van Baaren

专知会员服务

35+阅读 · 2020年1月6日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《主动式社会工程防御（ASED）项目》美空军24页项目报告

《军事域人工智能及其对国际影响：未来政策行动路线图》2025最新31页报告

《假新闻检测的特征计算流程：基于大语言模型的提取方法》

《多视角时空一致多模态感知目标检测的对抗鲁棒性研究》DARPA赞助最新96页技术报告

相关资讯

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Semantics-Empowered Communication for Networked Intelligent Systems

Arxiv

0+阅读 · 2021年3月10日

Stochastic tree ensembles for regularized nonlinear regression

Arxiv

0+阅读 · 2021年3月10日

Determinantal Point Processes Implicitly Regularize Semi-parametric Regression Problems

Arxiv

0+阅读 · 2021年3月9日

Semantic Communications in Networked Systems

Arxiv

0+阅读 · 2021年3月9日

Sliding Window Persistence of Quasiperiodic Functions

Arxiv

0+阅读 · 2021年3月8日

Uncovering the Benefits and Challenges of Continuous Integration Practices

Arxiv

0+阅读 · 2021年3月7日

Regression with reject option and application to kNN

Arxiv

0+阅读 · 2021年3月5日

Cost-sensitive Selection of Variables by Ensemble of Model Sequences

Arxiv

0+阅读 · 2021年3月5日

Ensembles of Random SHAPs

Arxiv

0+阅读 · 2021年3月4日

Hyperparameter Ensembles for Robustness and Uncertainty Quantification

Arxiv

12+阅读 · 2020年6月24日

微信扫码咨询专知VIP会员