超越边际不确定性:巴耶斯回归模型如何精确地估计其他的可预测性变化? (Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?) - 专知论文

会员服务 ·

0

估计/估计量 · 相关系数 · MoDELS · Performer · 边缘化 ·

2021 年 3 月 1 日

Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?

翻译：超越边际不确定性:巴耶斯回归模型如何精确地估计其他的可预测性变化?

Chaoqi Wang,Shengyang Sun,Roger Grosse

from arxiv, AISTATS 2021 (Oral)

While uncertainty estimation is a well-studied topic in deep learning, most such work focuses on marginal uncertainty estimates, i.e. the predictive mean and variance at individual input locations. But it is often more useful to estimate predictive correlations between the function values at different input locations. In this paper, we consider the problem of benchmarking how accurately Bayesian models can estimate predictive correlations. We first consider a downstream task which depends on posterior predictive correlations: transductive active learning (TAL). We find that TAL makes better use of models' uncertainty estimates than ordinary active learning, and recommend this as a benchmark for evaluating Bayesian models. Since TAL is too expensive and indirect to guide development of algorithms, we introduce two metrics which more directly evaluate the predictive correlations and which can be computed efficiently: meta-correlations (i.e. the correlations between the models correlation estimates and the true values), and cross-normalized likelihoods (XLL). We validate these metrics by demonstrating their consistency with TAL performance and obtain insights about the relative performance of current Bayesian neural net and Gaussian process models.

翻译：虽然不确定性估算是深思熟虑的一个很好研究的专题,但大多数此类工作都侧重于边际不确定性估算,即单个输入地点的预测平均值和差异。但估算不同输入地点的函数值之间的预测相关性往往更为有用。在本文件中,我们考虑了基准问题,即巴伊西亚模型如何准确估算预测相关性。我们首先考虑一项下游任务,它取决于后游预测相关性:转动积极学习(TAL)。我们发现TAL比普通积极学习更好地使用模型的不确定性估算值,并建议将其作为评估巴伊西亚模型的基准。由于TAL太昂贵和间接,无法指导算法的开发。我们引入了两种指标,更直接评估预测相关性,并且可以有效计算:元-二次关系(即模型相关性估计与真实值之间的关系)和跨常识性可能性(XLL)。我们通过展示这些指标与TAL的性能的一致性,并了解当前贝伊斯神经网和高斯模型的相对性能,从而验证这些指标。

0

相关内容

估计/估计量

估计/估计量

如何构建你的推荐系统？这份21页ppt教程为你讲解

如何构建你的推荐系统？这份21页ppt教程为你讲解

专知会员服务

65+阅读 · 2021年2月12日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

已删除

将门创投

3+阅读 · 2019年11月25日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Bayesian Inference for Stationary Points in Gaussian Process Regression Models for Event-Related Potentials Analysis

Arxiv

0+阅读 · 2021年4月20日

Bayesian subset selection and variable importance for interpretable prediction and classification

Arxiv

0+阅读 · 2021年4月20日

Approximate Multi-Agent Fitted Q Iteration

Arxiv

0+阅读 · 2021年4月19日

Bayesian Optimization with a Prior for the Optimum

Arxiv

0+阅读 · 2021年4月19日

Self-Paced Uncertainty Estimation for One-shot Person Re-Identification

Arxiv

1+阅读 · 2021年4月19日

On the implied weights of linear regression for causal inference

Arxiv

0+阅读 · 2021年4月19日

Can NLI Models Verify QA Systems' Predictions?

Arxiv

0+阅读 · 2021年4月18日

Non-intrusive and semi-intrusive uncertainty quantification of a multiscale in-stent restenosis model

Arxiv

0+阅读 · 2021年4月17日

Fully Bayesian inference for spatiotemporal data with the multi-resolution approximation

Arxiv

0+阅读 · 2021年4月15日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

如何构建你的推荐系统？这份21页ppt教程为你讲解

如何构建你的推荐系统？这份21页ppt教程为你讲解

专知会员服务

65+阅读 · 2021年2月12日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《步兵小单元山地严寒作战指南》美军最新条令200页

《联合作战概念的发展》最新报告

俄制无人机弹药

《复杂场景下自主着陆的模型预测控制技术》92页

相关资讯

已删除

将门创投

3+阅读 · 2019年11月25日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

【泡泡前沿追踪】跟踪SLAM前沿动态系列之IROS2018

泡泡机器人SLAM

29+阅读 · 2018年10月28日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Bayesian Inference for Stationary Points in Gaussian Process Regression Models for Event-Related Potentials Analysis

Arxiv

0+阅读 · 2021年4月20日

Bayesian subset selection and variable importance for interpretable prediction and classification

Arxiv

0+阅读 · 2021年4月20日

Approximate Multi-Agent Fitted Q Iteration

Arxiv

0+阅读 · 2021年4月19日

Bayesian Optimization with a Prior for the Optimum

Arxiv

0+阅读 · 2021年4月19日

Self-Paced Uncertainty Estimation for One-shot Person Re-Identification

Arxiv

1+阅读 · 2021年4月19日

On the implied weights of linear regression for causal inference

Arxiv

0+阅读 · 2021年4月19日

Can NLI Models Verify QA Systems' Predictions?

Arxiv

0+阅读 · 2021年4月18日

Non-intrusive and semi-intrusive uncertainty quantification of a multiscale in-stent restenosis model

Arxiv

0+阅读 · 2021年4月17日

Fully Bayesian inference for spatiotemporal data with the multi-resolution approximation

Arxiv

0+阅读 · 2021年4月15日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

微信扫码咨询专知VIP会员