你的回归模型的不确定性在现实世界分配的转变下有多可靠? (How Reliable is Your Regression Model's Uncertainty Under Real-World Distribution Shifts?) - 专知论文

会员服务 ·

0

估计/估计量 · MoDELS · Automator · Extensibility · motivation ·

2023 年 2 月 7 日

How Reliable is Your Regression Model's Uncertainty Under Real-World Distribution Shifts?

翻译：你的回归模型的不确定性在现实世界分配的转变下有多可靠?

Fredrik K. Gustafsson,Martin Danelljan,Thomas B. Schön

from arxiv, Code is available at https://github.com/fregu856/regression_uncertainty

Many important computer vision applications are naturally formulated as regression problems. Within medical imaging, accurate regression models have the potential to automate various tasks, helping to lower costs and improve patient outcomes. Such safety-critical deployment does however require reliable estimation of model uncertainty, also under the wide variety of distribution shifts that might be encountered in practice. Motivated by this, we set out to investigate the reliability of regression uncertainty estimation methods under various real-world distribution shifts. To that end, we propose an extensive benchmark of 8 image-based regression datasets with different types of challenging distribution shifts. We then employ our benchmark to evaluate many of the most common uncertainty estimation methods, as well as two state-of-the-art uncertainty scores from the task of out-of-distribution detection. We find that while methods are well calibrated when there is no distribution shift, they all become highly overconfident on many of the benchmark datasets. This uncovers important limitations of current uncertainty estimation methods, and the proposed benchmark therefore serves as a challenge to the research community. We hope that our benchmark will spur more work on how to develop truly reliable regression uncertainty estimation methods. Code is available at https://github.com/fregu856/regression_uncertainty.

翻译：在医学成像中,准确回归模型有可能使各种任务自动化,帮助降低成本和改善患者结果。然而,这种安全关键部署确实需要可靠地估计模型不确定性,这也是在实际中可能遇到的分布变化的广泛情况下。我们为此开始调查各种真实世界分布变化下的回归不确定性估算方法的可靠性。为此,我们提议了8个基于图像的回归数据集的广泛基准,并有不同种类的具有挑战性的分布变化。我们然后使用我们的基准来评估许多最常见的不确定性估算方法,以及从分配外检测任务中得出的两个最先进的不确定性评分。我们发现,虽然在分配变化不变化时方法已经很好地校准,但所有方法都对许多基准数据集非常不自信。这揭示了当前不确定性估算方法的重大局限性,因此,拟议基准对研究界构成挑战。我们希望,我们的基准将激励更多关于如何制定真正可靠的回归不确定性估算方法的工作。代码可在 https://githth/regregregresaty_regregregrecom查阅 https://gresseral_regrestium_retium_retium_regrestium)。

0

相关内容

估计/估计量

估计/估计量

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

脉冲电流在SiCp/Al多层结构热冲压/TLP连接复合工艺过程中的作用机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Al-0.2Sc-0.04（Zr,Yb）合金高温蠕变机理

国家自然科学基金

0+阅读 · 2012年12月31日

齿轮传动多尺度参数与轮齿裂纹扩展演变关联规律研究

国家自然科学基金

0+阅读 · 2012年12月31日

环境与荷载耦合作用下预应力混凝土桥梁服役行为演变及寿命预测

国家自然科学基金

0+阅读 · 2012年12月31日

关系的分解与Domain的表示

国家自然科学基金

1+阅读 · 2011年12月31日

Keap1-Nrf2-ARE信号通路在花色苷诱导HO-1mRNA表达及抗氧化损伤中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

GmMADS1在大豆花发育中的调控机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

算子代数上的映射及与群SL(2,R)相关的vN代数

国家自然科学基金

0+阅读 · 2008年12月31日

Packed-Ensembles for Efficient Uncertainty Estimation

Arxiv

0+阅读 · 2023年3月30日

A Method for Emerging Empirical Age Structures in Agent-Based Models with Exogenous Survival Probabilities

Arxiv

0+阅读 · 2023年3月30日

Coskewness under dependence uncertainty

Arxiv

0+阅读 · 2023年3月30日

Model Order Reduction for Deforming Domain Problems in a Time-Continuous Space-Time Setting

Arxiv

0+阅读 · 2023年3月29日

Data inaccuracy quantification and uncertainty propagation for bibliometric indicators

Arxiv

0+阅读 · 2023年3月29日

A reinforced learning approach to optimal design under model uncertainty

Arxiv

0+阅读 · 2023年3月28日

Towards Quantifying Calibrated Uncertainty via Deep Ensembles in Multi-output Regression Task

Arxiv

0+阅读 · 2023年3月28日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Arxiv

77+阅读 · 2019年10月22日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】迈向鲁棒的零样本强化学习

一种基于视觉算法生成三维场景重建的多任务系统 | 2025最新200页

【普林斯顿博士论文】量化、评估与缓解现代机器学习系统中的风险

遥感中基于深度学习的领域自适应方法：全面综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

相关论文

Packed-Ensembles for Efficient Uncertainty Estimation

Arxiv

0+阅读 · 2023年3月30日

A Method for Emerging Empirical Age Structures in Agent-Based Models with Exogenous Survival Probabilities

Arxiv

0+阅读 · 2023年3月30日

Coskewness under dependence uncertainty

Arxiv

0+阅读 · 2023年3月30日

Model Order Reduction for Deforming Domain Problems in a Time-Continuous Space-Time Setting

Arxiv

0+阅读 · 2023年3月29日

Data inaccuracy quantification and uncertainty propagation for bibliometric indicators

Arxiv

0+阅读 · 2023年3月29日

A reinforced learning approach to optimal design under model uncertainty

Arxiv

0+阅读 · 2023年3月28日

Towards Quantifying Calibrated Uncertainty via Deep Ensembles in Multi-output Regression Task

Arxiv

0+阅读 · 2023年3月28日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Arxiv

77+阅读 · 2019年10月22日

相关基金

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

脉冲电流在SiCp/Al多层结构热冲压/TLP连接复合工艺过程中的作用机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Al-0.2Sc-0.04（Zr,Yb）合金高温蠕变机理

国家自然科学基金

0+阅读 · 2012年12月31日

齿轮传动多尺度参数与轮齿裂纹扩展演变关联规律研究

国家自然科学基金

0+阅读 · 2012年12月31日

环境与荷载耦合作用下预应力混凝土桥梁服役行为演变及寿命预测

国家自然科学基金

0+阅读 · 2012年12月31日

关系的分解与Domain的表示

国家自然科学基金

1+阅读 · 2011年12月31日

Keap1-Nrf2-ARE信号通路在花色苷诱导HO-1mRNA表达及抗氧化损伤中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

GmMADS1在大豆花发育中的调控机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

算子代数上的映射及与群SL(2,R)相关的vN代数

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员