信任还是不信任信任一个倒退者:估计和解释倒退预测的可信赖性 (To Trust or Not to Trust a Regressor: Estimating and Explaining Trustworthiness of Regression Predictions) - 专知论文

会员服务 ·

0

估计/估计量 · 可辨认的 · 负相关法 · Pair · 相关系数 ·

2021 年 7 月 28 日

To Trust or Not to Trust a Regressor: Estimating and Explaining Trustworthiness of Regression Predictions

翻译：信任还是不信任信任一个倒退者:估计和解释倒退预测的可信赖性

Kim de Bie,Ana Lucic,Hinda Haned

from arxiv, Accepted to ICML 2021 Workshop on Human in the Loop Learning (HILL)

In hybrid human-AI systems, users need to decide whether or not to trust an algorithmic prediction while the true error in the prediction is unknown. To accommodate such settings, we introduce RETRO-VIZ, a method for (i) estimating and (ii) explaining trustworthiness of regression predictions. It consists of RETRO, a quantitative estimate of the trustworthiness of a prediction, and VIZ, a visual explanation that helps users identify the reasons for the (lack of) trustworthiness of a prediction. We find that RETRO-scores negatively correlate with prediction error across 117 experimental settings, indicating that RETRO provides a useful measure to distinguish trustworthy predictions from untrustworthy ones. In a user study with 41 participants, we find that VIZ-explanations help users identify whether a prediction is trustworthy or not: on average, 95.1% of participants correctly select the more trustworthy prediction, given a pair of predictions. In addition, an average of 75.6% of participants can accurately describe why a prediction seems to be (not) trustworthy. Finally, we find that the vast majority of users subjectively experience RETRO-VIZ as a useful tool to assess the trustworthiness of algorithmic predictions.

翻译：在混合的人类-AI系统中,用户需要决定是否相信算法预测,而预测中的真实错误却未知。为了适应这种环境,我们采用RETRO-VIZ,这是(一)估计和(二)解释回归预测可信度的一种方法,由RETRO、预测可信度的定量估计和VIZ组成,这是有助于用户查明预测(缺乏)可信度的原因的直观解释。我们发现,RETRO核心与117个试验环境的预测错误有负相关关系,表明RETRO提供了一种有用的措施,可以区分值得信赖的预测和不可靠的预测。在一项用户研究中,我们发现VIZ的勘探有助于用户确定预测是否可信:平均95.1%的参与者正确选择了比较可靠的预测,因为有一套预测。此外,平均75.6%的参与者可以准确地描述预测似乎(不可信)的原因。最后,我们发现绝大多数用户主观地体验了RETRO-VIZ的预测,作为评估信任的有用工具。

0

相关内容

估计/估计量

估计/估计量

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

可解释强化学习，Explainable Reinforcement Learning: A Survey

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【伯克利】机器学习诊断偏倚，Diagnosing bias with machine learning（附pdf链接）

【伯克利】机器学习诊断偏倚，Diagnosing bias with machine learning（附pdf链接）

专知会员服务

11+阅读 · 2019年11月30日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【IJCAI 2019】人工智能中的认知推理（Epistemic reasoning in AI），法国雷恩François Schwarzentruber，Tristan Charrier

【IJCAI 2019】人工智能中的认知推理（Epistemic reasoning in AI），法国雷恩François Schwarzentruber，Tristan Charrier

专知会员服务

22+阅读 · 2019年8月10日

CCF推荐 | 国际会议信息10条

CCF推荐 | 国际会议信息10条

Call4Papers

8+阅读 · 2019年5月27日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CCF C类 | DSAA 2019 诚邀稿件

CCF C类 | DSAA 2019 诚邀稿件

Call4Papers

6+阅读 · 2019年5月13日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

CCF B类期刊IPM专刊截稿信息1条

CCF B类期刊IPM专刊截稿信息1条

Call4Papers

3+阅读 · 2018年10月11日

人体姿态估计资源大列表（Human Pose Estimation）

人体姿态估计资源大列表（Human Pose Estimation）

专知

9+阅读 · 2018年10月6日

人工智能 | 国际会议/SCI期刊约稿信息9条

人工智能 | 国际会议/SCI期刊约稿信息9条

Call4Papers

3+阅读 · 2018年1月12日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

Understanding Relations Between Perception of Fairness and Trust in Algorithmic Decision Making

Arxiv

0+阅读 · 2021年9月29日

Model-free Bootstrap and Conformal Prediction in Regression: Conditionality, Conjecture Testing, and Pertinent Prediction Intervals

Arxiv

0+阅读 · 2021年9月24日

Explaining Local, Global, And Higher-Order Interactions In Deep Learning

Arxiv

0+阅读 · 2021年9月24日

Learning to maximize global influence from local observations

Arxiv

0+阅读 · 2021年9月24日

Remaining useful life prediction with uncertainty quantification: development of a highly accurate model for rotating machinery

Arxiv

0+阅读 · 2021年9月23日

Trustworthy AI: A Computational Perspective

Arxiv

12+阅读 · 2021年8月19日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

When Truth Discovery Meets Medical Knowledge Graph: Estimating Trustworthiness Degree for Medical Knowledge Condition

Arxiv

4+阅读 · 2018年9月27日

TTMF: A Triple Trustworthiness Measurement Frame for Knowledge Graphs

TTMF: A Triple Trustworthiness Measurement Frame for Knowledge Graphs

Arxiv

8+阅读 · 2018年9月25日

Viewpoint Estimation-Insights & Model

Viewpoint Estimation-Insights & Model

Arxiv

3+阅读 · 2018年7月3日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

可解释强化学习，Explainable Reinforcement Learning: A Survey

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【伯克利】机器学习诊断偏倚，Diagnosing bias with machine learning（附pdf链接）

【伯克利】机器学习诊断偏倚，Diagnosing bias with machine learning（附pdf链接）

专知会员服务

11+阅读 · 2019年11月30日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【IJCAI 2019】人工智能中的认知推理（Epistemic reasoning in AI），法国雷恩François Schwarzentruber，Tristan Charrier

【IJCAI 2019】人工智能中的认知推理（Epistemic reasoning in AI），法国雷恩François Schwarzentruber，Tristan Charrier

专知会员服务

22+阅读 · 2019年8月10日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

CCF推荐 | 国际会议信息10条

CCF推荐 | 国际会议信息10条

Call4Papers

8+阅读 · 2019年5月27日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CCF C类 | DSAA 2019 诚邀稿件

CCF C类 | DSAA 2019 诚邀稿件

Call4Papers

6+阅读 · 2019年5月13日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

CCF B类期刊IPM专刊截稿信息1条

CCF B类期刊IPM专刊截稿信息1条

Call4Papers

3+阅读 · 2018年10月11日

人体姿态估计资源大列表（Human Pose Estimation）

人体姿态估计资源大列表（Human Pose Estimation）

专知

9+阅读 · 2018年10月6日

人工智能 | 国际会议/SCI期刊约稿信息9条

人工智能 | 国际会议/SCI期刊约稿信息9条

Call4Papers

3+阅读 · 2018年1月12日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Understanding Relations Between Perception of Fairness and Trust in Algorithmic Decision Making

Arxiv

0+阅读 · 2021年9月29日

Model-free Bootstrap and Conformal Prediction in Regression: Conditionality, Conjecture Testing, and Pertinent Prediction Intervals

Arxiv

0+阅读 · 2021年9月24日

Explaining Local, Global, And Higher-Order Interactions In Deep Learning

Arxiv

0+阅读 · 2021年9月24日

Learning to maximize global influence from local observations

Arxiv

0+阅读 · 2021年9月24日

Remaining useful life prediction with uncertainty quantification: development of a highly accurate model for rotating machinery

Arxiv

0+阅读 · 2021年9月23日

Trustworthy AI: A Computational Perspective

Arxiv

12+阅读 · 2021年8月19日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

When Truth Discovery Meets Medical Knowledge Graph: Estimating Trustworthiness Degree for Medical Knowledge Condition

Arxiv

4+阅读 · 2018年9月27日

TTMF: A Triple Trustworthiness Measurement Frame for Knowledge Graphs

TTMF: A Triple Trustworthiness Measurement Frame for Knowledge Graphs

Arxiv

8+阅读 · 2018年9月25日

Viewpoint Estimation-Insights & Model

Viewpoint Estimation-Insights & Model

Arxiv

3+阅读 · 2018年7月3日

微信扫码咨询专知VIP会员