通过组合组合逐步推进中的不确定性 (Uncertainty in Gradient Boosting via Ensembles) - 专知论文

会员服务 ·

0

Boosting（一种模型训练加速方式） · MoDELS · 集成 · 可约的 · 估计/估计量 ·

2021 年 4 月 2 日

Uncertainty in Gradient Boosting via Ensembles

翻译：通过组合组合逐步推进中的不确定性

Andrey Malinin,Liudmila Prokhorenkova,Aleksei Ustimenko

For many practical, high-risk applications, it is essential to quantify uncertainty in a model's predictions to avoid costly mistakes. While predictive uncertainty is widely studied for neural networks, the topic seems to be under-explored for models based on gradient boosting. However, gradient boosting often achieves state-of-the-art results on tabular data. This work examines a probabilistic ensemble-based framework for deriving uncertainty estimates in the predictions of gradient boosting classification and regression models. We conducted experiments on a range of synthetic and real datasets and investigated the applicability of ensemble approaches to gradient boosting models that are themselves ensembles of decision trees. Our analysis shows that ensembles of gradient boosting models successfully detect anomalous inputs while having limited ability to improve the predicted total uncertainty. Importantly, we also propose a concept of a virtual ensemble to get the benefits of an ensemble via only one gradient boosting model, which significantly reduces complexity.

翻译：对于许多实际的高风险应用,必须量化模型预测中的不确定性,以避免代价高昂的错误。虽然对神经网络进行了广泛的预测性不确定性研究,但对于基于梯度推升的模型来说,这一专题似乎没有得到充分探讨。然而,梯度推升往往在列表数据上达到最先进的结果。这项工作审查了在梯度推升分类和回归模型预测中得出不确定性估计的概率共性框架。我们就一系列合成和真实数据集进行了实验,并调查了对梯度推动模型的共通方法的适用性,这些模型本身就是决策树的集合。我们的分析表明,梯度推动模型的集合成功地探测了异常的投入,而改进预测的全不确定性的能力却很有限。重要的是,我们还提出了一个虚拟组合概念,即通过一个梯度推升模型来获得共性加速模型的好处,这大大降低了复杂性。

0

相关内容

Boosting（一种模型训练加速方式）

Boosting（一种模型训练加速方式）

康奈尔大学「深度概率与生成模型」2021SP课程

专知会员服务

49+阅读 · 2021年4月24日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

不可错过！最新《大规模机器学习》2020教程，133页ppt，台湾清华大学吴尚鸿教授

不可错过！最新《大规模机器学习》2020教程，133页ppt，台湾清华大学吴尚鸿教授

专知会员服务

58+阅读 · 2020年11月8日

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

专知会员服务

41+阅读 · 2020年10月13日

超越深度学习：梯度提升机Gradient Boosting Machines (GBM)，73页ppt

超越深度学习：梯度提升机Gradient Boosting Machines (GBM)，73页ppt

专知会员服务

52+阅读 · 2020年6月21日

【2020新书】面向AI开发者的集成学习，146页pdf讲述bagging、bootstrap方法等

【2020新书】面向AI开发者的集成学习，146页pdf讲述bagging、bootstrap方法等

专知会员服务

93+阅读 · 2020年6月19日

【新书册】贝叶斯神经网络，41页pdf

【新书册】贝叶斯神经网络，41页pdf

专知会员服务

180+阅读 · 2020年6月3日

【ICCV 2019 Workshop】UGLLI Face Alignment: Estimating Uncertainty with Gaussian Log-Likelihood Loss（UGLLI人脸对齐：估计不确定性与高斯对数似然损失），犹他大学 Abhinav Kumar

【ICCV 2019 Workshop】UGLLI Face Alignment: Estimating Uncertainty with Gaussian Log-Likelihood Loss（UGLLI人脸对齐：估计不确定性与高斯对数似然损失），犹他大学 Abhinav Kumar

专知会员服务

15+阅读 · 2019年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【2020新书】面向AI开发者的集成学习，146页pdf讲述bagging、bootstrap方法等

【2020新书】面向AI开发者的集成学习，146页pdf讲述bagging、bootstrap方法等

专知

7+阅读 · 2020年6月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

误差反向传播——RNN

误差反向传播——RNN

统计学习与视觉计算组

18+阅读 · 2018年9月6日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

Accuracy-Privacy Trade-off in Deep Ensembles

Arxiv

0+阅读 · 2021年5月28日

GAN for time series prediction, data assimilation and uncertainty quantification

GAN for time series prediction, data assimilation and uncertainty quantification

Arxiv

0+阅读 · 2021年5月28日

Deep Ensembles from a Bayesian Perspective

Deep Ensembles from a Bayesian Perspective

Arxiv

1+阅读 · 2021年5月27日

LVD-NMPC: A Learning-based Vision Dynamics Approach to Nonlinear Model Predictive Control for Autonomous Vehicles

LVD-NMPC: A Learning-based Vision Dynamics Approach to Nonlinear Model Predictive Control for Autonomous Vehicles

Arxiv

0+阅读 · 2021年5月27日

Estimating the Uncertainty of Neural Network Forecasts for Influenza Prevalence Using Web Search Activity

Arxiv

0+阅读 · 2021年5月26日

Semi-Supervised Learning with Variational Bayesian Inference and Maximum Uncertainty Regularization

Arxiv

4+阅读 · 2020年12月3日

Hyperparameter Ensembles for Robustness and Uncertainty Quantification

Arxiv

12+阅读 · 2020年6月24日

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Arxiv

7+阅读 · 2018年7月20日

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Arxiv

4+阅读 · 2018年7月19日

Deep CNN ensembles and suggestive annotations for infant brain MRI segmentation

Arxiv

4+阅读 · 2017年12月19日

VIP会员

文章信息

相关主题

Boosting（一种模型训练加速方式）

估计/估计量

相关VIP内容

康奈尔大学「深度概率与生成模型」2021SP课程

专知会员服务

49+阅读 · 2021年4月24日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

不可错过！最新《大规模机器学习》2020教程，133页ppt，台湾清华大学吴尚鸿教授

不可错过！最新《大规模机器学习》2020教程，133页ppt，台湾清华大学吴尚鸿教授

专知会员服务

58+阅读 · 2020年11月8日

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

专知会员服务

41+阅读 · 2020年10月13日

超越深度学习：梯度提升机Gradient Boosting Machines (GBM)，73页ppt

超越深度学习：梯度提升机Gradient Boosting Machines (GBM)，73页ppt

专知会员服务

52+阅读 · 2020年6月21日

【2020新书】面向AI开发者的集成学习，146页pdf讲述bagging、bootstrap方法等

【2020新书】面向AI开发者的集成学习，146页pdf讲述bagging、bootstrap方法等

专知会员服务

93+阅读 · 2020年6月19日

【新书册】贝叶斯神经网络，41页pdf

【新书册】贝叶斯神经网络，41页pdf

专知会员服务

180+阅读 · 2020年6月3日

【ICCV 2019 Workshop】UGLLI Face Alignment: Estimating Uncertainty with Gaussian Log-Likelihood Loss（UGLLI人脸对齐：估计不确定性与高斯对数似然损失），犹他大学 Abhinav Kumar

【ICCV 2019 Workshop】UGLLI Face Alignment: Estimating Uncertainty with Gaussian Log-Likelihood Loss（UGLLI人脸对齐：估计不确定性与高斯对数似然损失），犹他大学 Abhinav Kumar

专知会员服务

15+阅读 · 2019年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

模型提取攻击与防御的系统综述：最新进展与展望

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

【CMU博士论文】用于物理模拟的高效深度学习模型

大模型解决方案白皮书：社交陪伴场景全流程落地指南

相关资讯

【2020新书】面向AI开发者的集成学习，146页pdf讲述bagging、bootstrap方法等

【2020新书】面向AI开发者的集成学习，146页pdf讲述bagging、bootstrap方法等

专知

7+阅读 · 2020年6月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

误差反向传播——RNN

误差反向传播——RNN

统计学习与视觉计算组

18+阅读 · 2018年9月6日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

相关论文

Accuracy-Privacy Trade-off in Deep Ensembles

Arxiv

0+阅读 · 2021年5月28日

GAN for time series prediction, data assimilation and uncertainty quantification

GAN for time series prediction, data assimilation and uncertainty quantification

Arxiv

0+阅读 · 2021年5月28日

Deep Ensembles from a Bayesian Perspective

Deep Ensembles from a Bayesian Perspective

Arxiv

1+阅读 · 2021年5月27日

LVD-NMPC: A Learning-based Vision Dynamics Approach to Nonlinear Model Predictive Control for Autonomous Vehicles

LVD-NMPC: A Learning-based Vision Dynamics Approach to Nonlinear Model Predictive Control for Autonomous Vehicles

Arxiv

0+阅读 · 2021年5月27日

Estimating the Uncertainty of Neural Network Forecasts for Influenza Prevalence Using Web Search Activity

Arxiv

0+阅读 · 2021年5月26日

Semi-Supervised Learning with Variational Bayesian Inference and Maximum Uncertainty Regularization

Arxiv

4+阅读 · 2020年12月3日

Hyperparameter Ensembles for Robustness and Uncertainty Quantification

Arxiv

12+阅读 · 2020年6月24日

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Arxiv

7+阅读 · 2018年7月20日

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Test-time augmentation with uncertainty estimation for deep learning-based medical image segmentation

Arxiv

4+阅读 · 2018年7月19日

Deep CNN ensembles and suggestive annotations for infant brain MRI segmentation

Arxiv

4+阅读 · 2017年12月19日

微信扫码咨询专知VIP会员