域域通用化的动态信息瓶颈 (Invariant Information Bottleneck for Domain Generalization) - 专知论文

会员服务 ·

0

INFORMS · 泛化理论 · 互信息 · 损失函数（机器学习） · 不变 ·

2021 年 12 月 10 日

Invariant Information Bottleneck for Domain Generalization

翻译：域域通用化的动态信息瓶颈

Bo Li,Yifei Shen,Yezhen Wang,Wenzhen Zhu,Colorado J. Reed,Jun Zhang,Dongsheng Li,Kurt Keutzer,Han Zhao

from arxiv, AAAI 2022

Invariant risk minimization (IRM) has recently emerged as a promising alternative for domain generalization. Nevertheless, the loss function is difficult to optimize for nonlinear classifiers and the original optimization objective could fail when pseudo-invariant features and geometric skews exist. Inspired by IRM, in this paper we propose a novel formulation for domain generalization, dubbed invariant information bottleneck (IIB). IIB aims at minimizing invariant risks for nonlinear classifiers and simultaneously mitigating the impact of pseudo-invariant features and geometric skews. Specifically, we first present a novel formulation for invariant causal prediction via mutual information. Then we adopt the variational formulation of the mutual information to develop a tractable loss function for nonlinear classifiers. To overcome the failure modes of IRM, we propose to minimize the mutual information between the inputs and the corresponding representations. IIB significantly outperforms IRM on synthetic datasets, where the pseudo-invariant features and geometric skews occur, showing the effectiveness of proposed formulation in overcoming failure modes of IRM. Furthermore, experiments on DomainBed show that IIB outperforms $13$ baselines by $0.9\%$ on average across $7$ real datasets.

翻译：然而,对于非线性分类者来说,损失功能很难优化,最初的优化目标可能失败,如果存在假的异差特征和几何偏差,在IRM的启发下,我们在本文件中提出一个域通用的新提法,称为异差信息瓶颈(IIB)。IIB旨在尽可能减少非线性分类者的异差风险,同时减轻伪的异差特征和几何偏差的影响。具体地说,我们首先提出一种通过相互信息预测异差因果关系的新配方。然后,我们采用相互信息的变式配方,为非线性分类者开发可移动的损失函数。为了克服IRM的失败模式,我们提议尽量减少投入和相应表述之间的相互信息。IIB在合成数据集上大大优于IMM,其中出现假的异差和几何偏差特征。我们首先提出了一种新配方,通过相互信息,通过相互提供信息,为非线性分类者开发可移动的损失函数。此外,为了克服IRM的失败模式,我们提议尽量减少投入和相应表述的相互信息。IRM在合成数据集上,伪的特性和几克基底值上显示拟议配方在超过IRM美元的实际基值为13.9美元的基值数据中的效率。

15

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【AAAI 2022】一致性信息瓶颈在域泛化中的应用

【AAAI 2022】一致性信息瓶颈在域泛化中的应用

专知会员服务

26+阅读 · 2022年1月15日

NeurIPS2021 | Cycle Self-Training：领域自适应的循环自训练方法与理论

NeurIPS2021 | Cycle Self-Training：领域自适应的循环自训练方法与理论

专知会员服务

20+阅读 · 2021年11月13日

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

专知会员服务

66+阅读 · 2020年8月22日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【阿里巴巴-CVPR2020】频域学习，Learning in the Frequency Domain

【阿里巴巴-CVPR2020】频域学习，Learning in the Frequency Domain

专知会员服务

29+阅读 · 2020年3月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【报告推荐】模仿学习前沿进展，62页ppt，New Frontiers in Imitation Learning

【报告推荐】模仿学习前沿进展，62页ppt，New Frontiers in Imitation Learning

专知会员服务

39+阅读 · 2019年11月13日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

2018: AI in All的元年

2018: AI in All的元年

专知

3+阅读 · 2018年12月26日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

迁移学习之Domain Adaptation

迁移学习之Domain Adaptation

全球人工智能

18+阅读 · 2018年4月11日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Domain Invariant Representation Learning with Domain Density Transformations

Arxiv

0+阅读 · 2022年2月15日

Domain Adaptation via Prompt Learning

Domain Adaptation via Prompt Learning

Arxiv

0+阅读 · 2022年2月14日

Domain Generalization using Causal Matching

Arxiv

12+阅读 · 2021年6月29日

Adaptive Methods for Real-World Domain Generalization

Arxiv

6+阅读 · 2021年3月30日

FSDR: Frequency Space Domain Randomization for Domain Generalization

Arxiv

4+阅读 · 2021年3月3日

Few-shot Domain Adaptation by Causal Mechanism Transfer

Arxiv

6+阅读 · 2020年8月19日

Learning in the Frequency Domain

Learning in the Frequency Domain

Arxiv

11+阅读 · 2020年3月12日

Weakly-Supervised Deep Learning for Domain Invariant Sentiment Classification

Arxiv

4+阅读 · 2019年10月29日

Invariant Information Distillation for Unsupervised Image Segmentation and Clustering

Invariant Information Distillation for Unsupervised Image Segmentation and Clustering

Arxiv

5+阅读 · 2018年7月21日

Optimal Transport for Multi-source Domain Adaptation under Target Shift

Arxiv

7+阅读 · 2018年3月13日

VIP会员

文章信息

相关主题

损失函数（机器学习）

相关VIP内容

【AAAI 2022】一致性信息瓶颈在域泛化中的应用

【AAAI 2022】一致性信息瓶颈在域泛化中的应用

专知会员服务

26+阅读 · 2022年1月15日

NeurIPS2021 | Cycle Self-Training：领域自适应的循环自训练方法与理论

NeurIPS2021 | Cycle Self-Training：领域自适应的循环自训练方法与理论

专知会员服务

20+阅读 · 2021年11月13日

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

专知会员服务

66+阅读 · 2020年8月22日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【阿里巴巴-CVPR2020】频域学习，Learning in the Frequency Domain

【阿里巴巴-CVPR2020】频域学习，Learning in the Frequency Domain

专知会员服务

29+阅读 · 2020年3月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【报告推荐】模仿学习前沿进展，62页ppt，New Frontiers in Imitation Learning

【报告推荐】模仿学习前沿进展，62页ppt，New Frontiers in Imitation Learning

专知会员服务

39+阅读 · 2019年11月13日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

2018: AI in All的元年

2018: AI in All的元年

专知

3+阅读 · 2018年12月26日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

迁移学习之Domain Adaptation

迁移学习之Domain Adaptation

全球人工智能

18+阅读 · 2018年4月11日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Domain Invariant Representation Learning with Domain Density Transformations

Arxiv

0+阅读 · 2022年2月15日

Domain Adaptation via Prompt Learning

Domain Adaptation via Prompt Learning

Arxiv

0+阅读 · 2022年2月14日

Domain Generalization using Causal Matching

Arxiv

12+阅读 · 2021年6月29日

Adaptive Methods for Real-World Domain Generalization

Arxiv

6+阅读 · 2021年3月30日

FSDR: Frequency Space Domain Randomization for Domain Generalization

Arxiv

4+阅读 · 2021年3月3日

Few-shot Domain Adaptation by Causal Mechanism Transfer

Arxiv

6+阅读 · 2020年8月19日

Learning in the Frequency Domain

Learning in the Frequency Domain

Arxiv

11+阅读 · 2020年3月12日

Weakly-Supervised Deep Learning for Domain Invariant Sentiment Classification

Arxiv

4+阅读 · 2019年10月29日

Invariant Information Distillation for Unsupervised Image Segmentation and Clustering

Invariant Information Distillation for Unsupervised Image Segmentation and Clustering

Arxiv

5+阅读 · 2018年7月21日

Optimal Transport for Multi-source Domain Adaptation under Target Shift

Arxiv

7+阅读 · 2018年3月13日

微信扫码咨询专知VIP会员