非参数不确定性优化造成的新退缩损失 (A Novel Regression Loss for Non-Parametric Uncertainty Optimization) - 专知论文

会员服务 ·

0

优化器 · 暂退法 · 欠估计 · MoDELS · Networking ·

2021 年 1 月 7 日

A Novel Regression Loss for Non-Parametric Uncertainty Optimization

翻译：非参数不确定性优化造成的新退缩损失

Joachim Sicking,Maram Akila,Maximilian Pintz,Tim Wirtz,Asja Fischer,Stefan Wrobel

from arxiv, Accepted at the 3rd Symposium on Advances in Approximate Bayesian Inference (AABI), code is available on: https://github.com/fraunhofer-iais/second-moment-loss. arXiv admin note: substantial text overlap with arXiv:2012.12687

Quantification of uncertainty is one of the most promising approaches to establish safe machine learning. Despite its importance, it is far from being generally solved, especially for neural networks. One of the most commonly used approaches so far is Monte Carlo dropout, which is computationally cheap and easy to apply in practice. However, it can underestimate the uncertainty. We propose a new objective, referred to as second-moment loss (SML), to address this issue. While the full network is encouraged to model the mean, the dropout networks are explicitly used to optimize the model variance. We intensively study the performance of the new objective on various UCI regression datasets. Comparing to the state-of-the-art of deep ensembles, SML leads to comparable prediction accuracies and uncertainty estimates while only requiring a single model. Under distribution shift, we observe moderate improvements. As a side result, we introduce an intuitive Wasserstein distance-based uncertainty measure that is non-saturating and thus allows to resolve quality differences between any two uncertainty estimates.

翻译：不确定性的量化是建立安全机器学习的最有希望的方法之一。尽管它很重要,但它远未普遍解决,特别是神经网络。迄今为止最常用的方法之一是蒙特卡洛辍学,这是计算成本低且在实践中容易应用的。然而,它可能低估不确定性。我们提出了一个新的目标,称为第二步损失(SML),以解决这一问题。虽然鼓励整个网络模拟这一平均值,但辍学网络被明确用于优化模型差异。我们深入研究了各种UCI回归数据集的新目标的绩效。与深层集合的最新水平相比,SML导致可比较的预测理解性和不确定性估计数,而只需要一个单一的模式。在分配变化中,我们观察到适度的改进。作为附带结果,我们引入了一种不饱和的直觉瓦塞鲁斯坦远程不确定性测量,从而能够解决任何两种不确定性估算的质量差异。

0

相关内容

优化器

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】管理统计和数据科学原理，678页pdf

【干货书】管理统计和数据科学原理，678页pdf

专知会员服务

186+阅读 · 2020年7月29日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

已删除

将门创投

5+阅读 · 2018年11月27日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Bayesian posterior repartitioning for nested sampling

Arxiv

0+阅读 · 2021年3月8日

Brain Waves Analysis Via a Non-parametric Bayesian Mixture of Autoregressive Kernels

Arxiv

0+阅读 · 2021年3月8日

Loss Estimators Improve Model Generalization

Loss Estimators Improve Model Generalization

Arxiv

0+阅读 · 2021年3月5日

Stochastic Gradient Descent Meets Distribution Regression

Arxiv

0+阅读 · 2021年3月5日

Limits of Probabilistic Safety Guarantees when Considering Human Uncertainty

Arxiv

0+阅读 · 2021年3月5日

Distribution-free uncertainty quantification for classification under label shift

Arxiv

1+阅读 · 2021年3月4日

Data-driven Prognostics with Predictive Uncertainty Estimation using Ensemble of Deep Ordinal Regression Models

Arxiv

1+阅读 · 2021年3月4日

Minimax Risk and Uniform Convergence Rates for Nonparametric Dyadic Regression

Arxiv

0+阅读 · 2021年3月4日

Probabilistic Logic Neural Networks for Reasoning

Arxiv

7+阅读 · 2019年6月20日

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

Arxiv

6+阅读 · 2019年2月25日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】管理统计和数据科学原理，678页pdf

【干货书】管理统计和数据科学原理，678页pdf

专知会员服务

186+阅读 · 2020年7月29日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

已删除

将门创投

5+阅读 · 2018年11月27日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Bayesian posterior repartitioning for nested sampling

Arxiv

0+阅读 · 2021年3月8日

Brain Waves Analysis Via a Non-parametric Bayesian Mixture of Autoregressive Kernels

Arxiv

0+阅读 · 2021年3月8日

Loss Estimators Improve Model Generalization

Loss Estimators Improve Model Generalization

Arxiv

0+阅读 · 2021年3月5日

Stochastic Gradient Descent Meets Distribution Regression

Arxiv

0+阅读 · 2021年3月5日

Limits of Probabilistic Safety Guarantees when Considering Human Uncertainty

Arxiv

0+阅读 · 2021年3月5日

Distribution-free uncertainty quantification for classification under label shift

Arxiv

1+阅读 · 2021年3月4日

Data-driven Prognostics with Predictive Uncertainty Estimation using Ensemble of Deep Ordinal Regression Models

Arxiv

1+阅读 · 2021年3月4日

Minimax Risk and Uniform Convergence Rates for Nonparametric Dyadic Regression

Arxiv

0+阅读 · 2021年3月4日

Probabilistic Logic Neural Networks for Reasoning

Arxiv

7+阅读 · 2019年6月20日

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

Arxiv

6+阅读 · 2019年2月25日

微信扫码咨询专知VIP会员