使概念瓶颈模型具备对其概念预测放弃的能力 (Learn to explain yourself, when you can: Equipping Concept Bottleneck Models with the ability to abstain on their concept predictions) - 专知论文

会员服务 ·

0

Learning · 标注 · MoDELS · 可理解性 · Networking ·

2022 年 12 月 18 日

Learn to explain yourself, when you can: Equipping Concept Bottleneck Models with the ability to abstain on their concept predictions

翻译：使概念瓶颈模型具备对其概念预测放弃的能力

Joshua Lockhart,Daniele Magazzeni,Manuela Veloso

from arxiv, Changed LaTeX template

The Concept Bottleneck Models (CBMs) of Koh et al. [2020] provide a means to ensure that a neural network based classifier bases its predictions solely on human understandable concepts. The concept labels, or rationales as we refer to them, are learned by the concept labeling component of the CBM. Another component learns to predict the target classification label from these predicted concept labels. Unfortunately, these models are heavily reliant on human provided concept labels for each datapoint. To enable CBMs to behave robustly when these labels are not readily available, we show how to equip them with the ability to abstain from predicting concepts when the concept labeling component is uncertain. In other words, our model learns to provide rationales for its predictions, but only whenever it is sure the rationale is correct.

翻译：Koh等人 [2020年] 的“概念瓶颈模型”提供了一种手段,确保基于神经网络的分类师仅仅根据人类可理解的概念作出预测。概念标签或我们所指的理由,是通过CBM的概念标签组成部分学习的。另一个组成部分从这些预测概念标签中学会预测目标分类标签。不幸的是,这些模型严重依赖为每个数据点提供的人类提供的概念标签。如果这些标签不易获得,为了使建立信任措施能够采取强有力的行动,我们展示了如何在概念标签部分不确定时使其有能力不预测概念。换句话说,我们的模型学会为其预测提供理由,但只有在确信理由正确时才这样做。

0

相关内容

Learning

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

“核HO-1”调控miRNA-125a-5p影响血脊髓屏障结构和功能的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

三维流形Heegaard分解稳定化问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

RIPK1调控的NF-κB活化在卵巢癌发生发展中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于格值逻辑的α-n(t)元归结动态自动推理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization

Arxiv

0+阅读 · 2023年2月24日

Extracting Victim Counts from Text

Arxiv

0+阅读 · 2023年2月23日

A Flexible Nadaraya-Watson Head Can Offer Explainable and Calibrated Classification

Arxiv

0+阅读 · 2023年2月23日

Debiased Distillation by Transplanting the Last Layer

Arxiv

0+阅读 · 2023年2月22日

Which Knowledge Graph Is Best for Me?

Arxiv

11+阅读 · 2018年9月28日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【MIT博士论文】弱监督学习：理论、方法与应用

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

锚定情报：合成欺骗时代的地面真相

NeurIPS 2025 | NMKE：基于神经元归因与动态稀疏掩码的终身知识编辑

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization

Arxiv

0+阅读 · 2023年2月24日

Extracting Victim Counts from Text

Arxiv

0+阅读 · 2023年2月23日

A Flexible Nadaraya-Watson Head Can Offer Explainable and Calibrated Classification

Arxiv

0+阅读 · 2023年2月23日

Debiased Distillation by Transplanting the Last Layer

Arxiv

0+阅读 · 2023年2月22日

Which Knowledge Graph Is Best for Me?

Arxiv

11+阅读 · 2018年9月28日

相关基金

“核HO-1”调控miRNA-125a-5p影响血脊髓屏障结构和功能的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

三维流形Heegaard分解稳定化问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

RIPK1调控的NF-κB活化在卵巢癌发生发展中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于格值逻辑的α-n(t)元归结动态自动推理研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员