共同利益:通过衡量人类-AI协调,对模型行为进行大型视觉分析 (Shared Interest: Large-Scale Visual Analysis of Model Behavior by Measuring Human-AI Alignment) - 专知论文

会员服务 ·

0

可辨认的 · MoDELS · INTERACT · 可理解性 · 真实值 ·

2021 年 7 月 20 日

Shared Interest: Large-Scale Visual Analysis of Model Behavior by Measuring Human-AI Alignment

翻译：共同利益:通过衡量人类-AI协调,对模型行为进行大型视觉分析

Angie Boggust,Benjamin Hoover,Arvind Satyanarayan,Hendrik Strobelt

from arxiv, 14 pages, 8 figures. For more details, see http://shared-interest.csail.mit.edu

Saliency methods -- techniques to identify the importance of input features on a model's output -- are a common first step in understanding neural network behavior. However, interpreting saliency requires tedious manual inspection to identify and aggregate patterns in model behavior, resulting in ad hoc or cherry-picked analysis. To address these concerns, we present Shared Interest: a set of metrics for comparing saliency with human annotated ground truths. By providing quantitative descriptors, Shared Interest allows ranking, sorting, and aggregation of inputs thereby facilitating large-scale systematic analysis of model behavior. We use Shared Interest to identify eight recurring patterns in model behavior including focusing on a sufficient subset of ground truth features or being distracted by contextual features. Working with representative real-world users, we show how Shared Interest can be used to rapidly develop or lose trust in a model's reliability, uncover issues that are missed in manual analyses, and enable interactive probing of model behavior.

翻译：测量方法 -- -- 确定模型输出中输入特征重要性的技术 -- -- 是了解神经网络行为的一个常见的第一步。然而,解释显著性需要冗长的手工检查,以识别和汇总模型行为模式模式模式模式,从而进行特别或樱桃式的分析。为了解决这些问题,我们提出了共同利益:一套衡量标准,用以将显著性与人类附加说明的地面事实进行比较。通过提供数量描述符,共同利益允许对投入进行排序、分类和汇总,从而便利对模型行为进行大规模系统分析。我们利用共同利益来确定模型行为中的八种反复模式,包括侧重于足够一组地面真相特征或被背景特征转移。我们与具有代表性的现实世界用户合作,展示如何利用共同利益迅速开发或丧失对模型可靠性的信任,发现在手工分析中忽略的问题,并能够对模型行为进行互动验证。

0

相关内容

可辨认的

【ACL2020-Facebook AI】跨语言表示学习，Unsupervised Cross-lingual Representation Learning at Scale

【ACL2020-Facebook AI】跨语言表示学习，Unsupervised Cross-lingual Representation Learning at Scale

专知会员服务

27+阅读 · 2020年4月5日

【CVPR2020-Oral-中科院自动化所】元人脸识别，Learning Meta Face Recognition

【CVPR2020-Oral-中科院自动化所】元人脸识别，Learning Meta Face Recognition

专知会员服务

24+阅读 · 2020年3月20日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

【互信息与自监督学习，32页ppt】'Notes and tutorials on "Mutual information and self-supervised learning‘“

【互信息与自监督学习，32页ppt】'Notes and tutorials on "Mutual information and self-supervised learning‘“

专知会员服务

26+阅读 · 2019年12月25日

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

专知会员服务

152+阅读 · 2019年10月27日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

已删除

将门创投

9+阅读 · 2019年11月15日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

SalienTrack: providing salient information for semi-automated self-tracking feedback with model explanations

Arxiv

0+阅读 · 2021年9月21日

Online Multi-horizon Transaction Metric Estimation with Multi-modal Learning in Payment Networks

Arxiv

0+阅读 · 2021年9月21日

Unsupervised Behaviour Discovery with Quality-Diversity Optimisation

Arxiv

0+阅读 · 2021年9月20日

From Human Explanation to Model Interpretability: A Framework Based on Weight of Evidence

From Human Explanation to Model Interpretability: A Framework Based on Weight of Evidence

Arxiv

0+阅读 · 2021年9月20日

A Study of the Generalizability of Self-Supervised Representations

Arxiv

0+阅读 · 2021年9月19日

To what extent do human explanations of model behavior align with actual model behavior?

Arxiv

0+阅读 · 2021年9月16日

On Disentangled Representations Learned From Correlated Data

Arxiv

8+阅读 · 2021年7月16日

Towards Rigorous Interpretations: a Formalisation of Feature Attribution

Arxiv

4+阅读 · 2021年4月26日

Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations

Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations

Arxiv

4+阅读 · 2019年9月2日

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

Arxiv

11+阅读 · 2018年12月8日

VIP会员

文章信息

相关主题

相关VIP内容

【ACL2020-Facebook AI】跨语言表示学习，Unsupervised Cross-lingual Representation Learning at Scale

【ACL2020-Facebook AI】跨语言表示学习，Unsupervised Cross-lingual Representation Learning at Scale

专知会员服务

27+阅读 · 2020年4月5日

【CVPR2020-Oral-中科院自动化所】元人脸识别，Learning Meta Face Recognition

【CVPR2020-Oral-中科院自动化所】元人脸识别，Learning Meta Face Recognition

专知会员服务

24+阅读 · 2020年3月20日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

【互信息与自监督学习，32页ppt】'Notes and tutorials on "Mutual information and self-supervised learning‘“

【互信息与自监督学习，32页ppt】'Notes and tutorials on "Mutual information and self-supervised learning‘“

专知会员服务

26+阅读 · 2019年12月25日

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

专知会员服务

152+阅读 · 2019年10月27日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机战争时代的战时法：大国竞争中的区分原则、相称性原则与行动建议》最新75页

《构建强健军事力量的设计挑战：提升海军兵力支持系统效能的多分辨率建模方法》69页

正视无人机心理战：恐惧效应与战略反思

《精确反蜂群防御系统：三维运动探测与定向空爆拦截技术融合》最新24页

相关资讯

已删除

将门创投

9+阅读 · 2019年11月15日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

SalienTrack: providing salient information for semi-automated self-tracking feedback with model explanations

Arxiv

0+阅读 · 2021年9月21日

Online Multi-horizon Transaction Metric Estimation with Multi-modal Learning in Payment Networks

Arxiv

0+阅读 · 2021年9月21日

Unsupervised Behaviour Discovery with Quality-Diversity Optimisation

Arxiv

0+阅读 · 2021年9月20日

From Human Explanation to Model Interpretability: A Framework Based on Weight of Evidence

From Human Explanation to Model Interpretability: A Framework Based on Weight of Evidence

Arxiv

0+阅读 · 2021年9月20日

A Study of the Generalizability of Self-Supervised Representations

Arxiv

0+阅读 · 2021年9月19日

To what extent do human explanations of model behavior align with actual model behavior?

Arxiv

0+阅读 · 2021年9月16日

On Disentangled Representations Learned From Correlated Data

Arxiv

8+阅读 · 2021年7月16日

Towards Rigorous Interpretations: a Formalisation of Feature Attribution

Arxiv

4+阅读 · 2021年4月26日

Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations

Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations

Arxiv

4+阅读 · 2019年9月2日

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

Arxiv

11+阅读 · 2018年12月8日

微信扫码咨询专知VIP会员