后热后概念概念瓶颈模型 (Post-hoc Concept Bottleneck Models) - 专知论文

会员服务 ·

0

可约的 · MoDELS · Performer · Neural Networks · 训练数据 ·

2022 年 5 月 31 日

Post-hoc Concept Bottleneck Models

翻译：后热后概念概念瓶颈模型

Mert Yuksekgonul,Maggie Wang,James Zou

from arxiv, An earlier version was published in the ICLR 2022 PAIR2Struct Workshop

Concept Bottleneck Models (CBMs) map the inputs onto a set of interpretable concepts (``the bottleneck'') and use the concepts to make predictions. A concept bottleneck enhances interpretability since it can be investigated to understand what concepts the model "sees" in an input and which of these concepts are deemed important. However, CBMs are restrictive in practice as they require concept labels in the training data to learn the bottleneck and do not leverage strong pretrained models. Moreover, CBMs often do not match the accuracy of an unrestricted neural network, reducing the incentive to deploy them in practice. In this work, we address the limitations of CBMs by introducing Post-hoc Concept Bottleneck models (PCBMs). We show that we can turn any neural network into a PCBM without sacrificing model performance while still retaining interpretability benefits. When concept annotation is not available on the training data, we show that PCBM can transfer concepts from other datasets or from natural language descriptions of concepts. PCBM also enables users to quickly debug and update the model to reduce spurious correlations and improve generalization to new (potentially different) data. Through a model-editing user study, we show that editing PCBMs via concept-level feedback can provide significant performance gains without using any data from the target domain or model retraining.

翻译：概念瓶装模型(BBS)将输入的内容映射到一套可解释的概念(“瓶颈”)中,并使用概念来作出预测。概念瓶颈可以提高解释性,因为可以对概念的可解释性进行调查,以了解在输入中“看到”的模式是什么概念,这些概念中哪些被认为重要。然而,建立信任措施在实践中是限制性的,因为它们要求培训数据中的概念标签来学习瓶颈,而不是利用强大的预先培训模型。此外,建立信任措施往往与不受限制的神经网络(“瓶颈” )的准确性不匹配,降低了实际部署这些网络的动力。在这项工作中,我们通过引入后热概念瓶瓶式模型模型模型(PBCS)模型(PBS)模型解决建立信任措施的局限性。我们表明,我们可以在不牺牲模型性能的同时将任何神经网络转化为PCM模型,同时保留可解释性效益。当培训数据没有提供概念说明时,我们表明PCMM可以从其他数据集或自然语言概念描述中转移概念。 PCM还能够快速调试用模型来减少刺激性的相关性,通过新的模型改进模型,并且通过新的域域点化目标反馈,我们可以提供重要的业绩水平的数据。

0

相关内容

可约的

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

功能导向p-n型MoS2/SnO2异质分级结构的构筑及气敏-光催化性能

国家自然科学基金

0+阅读 · 2015年12月31日

大气中HONO的非均相生成和去除过程的研究

国家自然科学基金

0+阅读 · 2014年12月31日

双相I型情感障碍岛叶参与的情感认知环路的脑成像研究

国家自然科学基金

0+阅读 · 2014年12月31日

低NOx浓度条件下挥发性有机物的大气氧化机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于体映射的修复用人体骨骼支架模型生成方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Shc C蛋白参与罗哌卡因脊髓神经毒性机制的作用

国家自然科学基金

0+阅读 · 2012年12月31日

双语者句子理解过程中句法加工的认知/神经时间动态性

国家自然科学基金

0+阅读 · 2012年12月31日

Doublecortin的动态表达在骨折愈合中的作用与调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

骨骼肌-脊髓逆向传导通路对脊髓损伤后神经生长的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

镉激活神经细胞mTOR通路诱导凋亡及雷帕霉素靶向调控抗凋亡分子机理

国家自然科学基金

0+阅读 · 2009年12月31日

Multiple-Modality Associative Memory: a framework for Learning

Arxiv

0+阅读 · 2022年7月18日

BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks

BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks

Arxiv

0+阅读 · 2022年7月14日

Language with Vision: a Study on Grounded Word and Sentence Embeddings

Language with Vision: a Study on Grounded Word and Sentence Embeddings

Arxiv

0+阅读 · 2022年7月14日

ConCL: Concept Contrastive Learning for Dense Prediction Pre-training in Pathology Images

Arxiv

0+阅读 · 2022年7月14日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Attention Bottlenecks for Multimodal Fusion

Arxiv

31+阅读 · 2021年6月30日

Disentangled Information Bottleneck

Disentangled Information Bottleneck

Arxiv

12+阅读 · 2020年12月22日

How to train your MAML

Arxiv

26+阅读 · 2019年3月5日

Multi-pseudo Regularized Label for Generated Samples in Person Re-Identification

Arxiv

12+阅读 · 2018年1月29日

HONE: Higher-Order Network Embeddings

Arxiv

12+阅读 · 2018年1月28日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

相关论文

Multiple-Modality Associative Memory: a framework for Learning

Arxiv

0+阅读 · 2022年7月18日

BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks

BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks

Arxiv

0+阅读 · 2022年7月14日

Language with Vision: a Study on Grounded Word and Sentence Embeddings

Language with Vision: a Study on Grounded Word and Sentence Embeddings

Arxiv

0+阅读 · 2022年7月14日

ConCL: Concept Contrastive Learning for Dense Prediction Pre-training in Pathology Images

Arxiv

0+阅读 · 2022年7月14日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Attention Bottlenecks for Multimodal Fusion

Arxiv

31+阅读 · 2021年6月30日

Disentangled Information Bottleneck

Disentangled Information Bottleneck

Arxiv

12+阅读 · 2020年12月22日

How to train your MAML

Arxiv

26+阅读 · 2019年3月5日

Multi-pseudo Regularized Label for Generated Samples in Person Re-Identification

Arxiv

12+阅读 · 2018年1月29日

HONE: Higher-Order Network Embeddings

Arxiv

12+阅读 · 2018年1月28日

相关基金

功能导向p-n型MoS2/SnO2异质分级结构的构筑及气敏-光催化性能

国家自然科学基金

0+阅读 · 2015年12月31日

大气中HONO的非均相生成和去除过程的研究

国家自然科学基金

0+阅读 · 2014年12月31日

双相I型情感障碍岛叶参与的情感认知环路的脑成像研究

国家自然科学基金

0+阅读 · 2014年12月31日

低NOx浓度条件下挥发性有机物的大气氧化机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于体映射的修复用人体骨骼支架模型生成方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Shc C蛋白参与罗哌卡因脊髓神经毒性机制的作用

国家自然科学基金

0+阅读 · 2012年12月31日

双语者句子理解过程中句法加工的认知/神经时间动态性

国家自然科学基金

0+阅读 · 2012年12月31日

Doublecortin的动态表达在骨折愈合中的作用与调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

骨骼肌-脊髓逆向传导通路对脊髓损伤后神经生长的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

镉激活神经细胞mTOR通路诱导凋亡及雷帕霉素靶向调控抗凋亡分子机理

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员