减少快捷键依赖前的不切实际 (A Too-Good-to-be-True Prior to Reduce Shortcut Reliance) - 专知论文

会员服务 ·

0

可约的 · LCN · 泛化理论 · Networking · Performer ·

2021 年 10 月 21 日

A Too-Good-to-be-True Prior to Reduce Shortcut Reliance

翻译：减少快捷键依赖前的不切实际

Nikolay Dagaev,Brett D. Roads,Xiaoliang Luo,Daniel N. Barry,Kaustubh R. Patil,Bradley C. Love

from arxiv, 10 pages, 8 figures

Despite their impressive performance in object recognition and other tasks under standard testing conditions, deep networks often fail to generalize to out-of-distribution (o.o.d.) samples. One cause for this shortcoming is that modern architectures tend to rely on "shortcuts" - superficial features that correlate with categories without capturing deeper invariants that hold across contexts. Real-world concepts often possess a complex structure that can vary superficially across contexts, which can make the most intuitive and promising solutions in one context not generalize to others. One potential way to improve o.o.d. generalization is to assume simple solutions are unlikely to be valid across contexts and avoid them, which we refer to as the too-good-to-be-true prior. A low-capacity network (LCN) with a shallow architecture should only be able to learn surface relationships, including shortcuts. We find that LCNs can serve as shortcut detectors. Furthermore, an LCN's predictions can be used in a two-stage approach to encourage a high-capacity network (HCN) to rely on deeper invariant features that should generalize broadly. In particular, items that the LCN can master are downweighted when training the HCN. Using a modified version of the CIFAR-10 dataset in which we introduced shortcuts, we found that the two-stage LCN-HCN approach reduced reliance on shortcuts and facilitated o.o.d. generalization.

翻译：尽管在标准测试条件下,在目标识别和其他任务方面表现令人印象深刻,但深层次的网络往往未能在标准测试条件下普遍推广超出分配范围(o.o.d.)的样本。这一缺陷的一个原因是,现代建筑往往依赖“短切”——表面特征,这些特征与类别相关,而不捕捉不同背景的更深的变异因素。现实世界概念往往具有复杂的结构,这种结构可以表面地不同,在一种情况下可以使最直观和最有希望的解决办法不向其它方普及。一种潜在的改进O.o.o.d.通用的方法是假设简单的解决办法不可能在各种情况下有效,避免这些简单的解决办法,因为我们称之为“短切”——与类别相关,而没有捕捉到不同背景的更深层差异。现实世界概念往往具有一种复杂的结构,这种结构可以因地而不同而有所差异,在某种情况下,LCN的预测可以在一种两阶段方法中使用一种最直观和最有希望的解决办法,鼓励高能力网络(HCN)以更深的不易变式的特性为基础,在一般情况下,我们可广泛地将LCN-10级的快速地将LCN用于对L.

0

相关内容

可约的

【ICCV2021】参数化对比学习

专知会员服务

33+阅读 · 2021年7月27日

【NeurIPS 2020】学习神经网络中的不变性

专知会员服务

29+阅读 · 2020年10月24日

【NeurIPS 2020 】神经网络结构生成优化

【NeurIPS 2020 】神经网络结构生成优化

专知会员服务

21+阅读 · 2020年10月24日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【论文】结构GANs，Structured GANs，

【论文】结构GANs，Structured GANs，

专知会员服务

15+阅读 · 2020年1月16日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

已删除

将门创投

8+阅读 · 2018年10月31日

Privacy preserving n-party scalar product protocol

Arxiv

0+阅读 · 2021年12月17日

Classification algorithms applied to structure formation simulations

Arxiv

0+阅读 · 2021年12月17日

Adaptive Non-linear Pattern Matching Automata

Arxiv

0+阅读 · 2021年12月16日

Domain Generalization using Causal Matching

Arxiv

12+阅读 · 2021年6月29日

Neural Graph Matching based Collaborative Filtering

Arxiv

4+阅读 · 2021年5月10日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

Matching Entities Across Different Knowledge Graphs with Graph Embeddings

Arxiv

3+阅读 · 2019年3月15日

Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision

Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision

Arxiv

5+阅读 · 2018年7月24日

Learning to Sketch with Shortcut Cycle Consistency

Arxiv

5+阅读 · 2018年5月1日

Latent nested nonparametric priors

Arxiv

4+阅读 · 2018年1月15日

VIP会员

文章信息

相关主题

相关VIP内容

【ICCV2021】参数化对比学习

专知会员服务

33+阅读 · 2021年7月27日

【NeurIPS 2020】学习神经网络中的不变性

专知会员服务

29+阅读 · 2020年10月24日

【NeurIPS 2020 】神经网络结构生成优化

【NeurIPS 2020 】神经网络结构生成优化

专知会员服务

21+阅读 · 2020年10月24日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【论文】结构GANs，Structured GANs，

【论文】结构GANs，Structured GANs，

专知会员服务

15+阅读 · 2020年1月16日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

杀伤网中的生存力：使陆军保障体系适应精确打击与无人威胁时代

《人-智能体知识融合：与可解释、可讲述人工智能进行协同意义建构》372页

《基于自适应模拟的军事决策训练：利用物联网衍生的认知与情绪反馈》

新技术：芬兰贝蒂姆公司以全数字化战场网络重新定义战术通信

相关资讯

已删除

将门创投

8+阅读 · 2018年10月31日

相关论文

Privacy preserving n-party scalar product protocol

Arxiv

0+阅读 · 2021年12月17日

Classification algorithms applied to structure formation simulations

Arxiv

0+阅读 · 2021年12月17日

Adaptive Non-linear Pattern Matching Automata

Arxiv

0+阅读 · 2021年12月16日

Domain Generalization using Causal Matching

Arxiv

12+阅读 · 2021年6月29日

Neural Graph Matching based Collaborative Filtering

Arxiv

4+阅读 · 2021年5月10日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

Matching Entities Across Different Knowledge Graphs with Graph Embeddings

Arxiv

3+阅读 · 2019年3月15日

Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision

Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision

Arxiv

5+阅读 · 2018年7月24日

Learning to Sketch with Shortcut Cycle Consistency

Arxiv

5+阅读 · 2018年5月1日

Latent nested nonparametric priors

Arxiv

4+阅读 · 2018年1月15日

微信扫码咨询专知VIP会员