通过模块化和扩增改进系统性普遍化 (Improving Systematic Generalization Through Modularity and Augmentation) - 专知论文

会员服务 ·

0

泛化理论 · Neural Networks · Networking · state-of-the-art · Principle ·

2022 年 2 月 22 日

Improving Systematic Generalization Through Modularity and Augmentation

翻译：通过模块化和扩增改进系统性普遍化

Laura Ruis,Brenden Lake

Systematic generalization is the ability to combine known parts into novel meaning; an important aspect of efficient human learning, but a weakness of neural network learning. In this work, we investigate how two well-known modeling principles -- modularity and data augmentation -- affect systematic generalization of neural networks in grounded language learning. We analyze how large the vocabulary needs to be to achieve systematic generalization and how similar the augmented data needs to be to the problem at hand. Our findings show that even in the controlled setting of a synthetic benchmark, achieving systematic generalization remains very difficult. After training on an augmented dataset with almost forty times more adverbs than the original problem, a non-modular baseline is not able to systematically generalize to a novel combination of a known verb and adverb. When separating the task into cognitive processes like perception and navigation, a modular neural network is able to utilize the augmented data and generalize more systematically, achieving 70% and 40% exact match increase over state-of-the-art on two gSCAN tests that have not previously been improved. We hope that this work gives insight into the drivers of systematic generalization, and what we still need to improve for neural networks to learn more like humans do.

翻译：系统性的概括化是将已知部分整合为新含义的能力; 高效人类学习的一个重要方面, 但神经网络学习的一个弱点。在这项工作中,我们调查两个众所周知的模型原则 -- -- 模块化和数据增强 -- -- 如何影响在基础语言学习中系统地普及神经网络。我们分析词汇需要多大的词汇才能实现系统化的概括化,以及扩大的数据需要与手头的问题如何相似。我们的研究结果显示,即使在合成基准的控制设置中,实现系统化的普及化也仍然非常困难。在对比原始问题多近40倍的反动的扩大数据集进行培训之后,非模式基线无法系统化地将已知动词和动词组合成新颖的组合。当将任务分为认知过程,如感知和导航时,模块神经网络能够利用强化的数据并更系统化,实现70%和40%的精确匹配率超过两个以前没有改进的GSCAN测试的状态。我们希望这项工作能够让驱动者了解系统化的概括化和动词。我们还需要改进什么来改进网络。

0

相关内容

泛化理论

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

我国HIV/AIDS患者感染人芽囊原虫的基因多态性和致病性研究

国家自然科学基金

0+阅读 · 2014年12月31日

腺病毒介导的miRNA干扰策略抗MERS-CoV的研究

国家自然科学基金

0+阅读 · 2013年12月31日

人肠道病毒71型构象性中和表位的确定及其诱导抗病毒免疫保护应答机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

癌症的靶向基因 - 痘苗溶瘤病毒治疗策略

国家自然科学基金

1+阅读 · 2012年12月31日

人3型腺病毒感染小动物模型的建立和体内应答特点研究

国家自然科学基金

0+阅读 · 2012年12月31日

HIV感染IIA期中医证候特点及演变规律研究

国家自然科学基金

0+阅读 · 2011年12月31日

李斯特菌载体在增强丙型肝炎病毒重组多表位树突细胞疫苗中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

面向隐私保护的移动商务推荐系统研究

国家自然科学基金

2+阅读 · 2009年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

Revisiting Vicinal Risk Minimization for Partially Supervised Multi-Label Classification Under Data Scarcity

Revisiting Vicinal Risk Minimization for Partially Supervised Multi-Label Classification Under Data Scarcity

Arxiv

0+阅读 · 2022年4月19日

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

Arxiv

0+阅读 · 2022年4月19日

On The Cross-Modal Transfer from Natural Language to Code through Adapter Modules

Arxiv

0+阅读 · 2022年4月19日

Modx: Binary Level Partial Imported Third-Party Library Detection through Program Modularization and Semantic Matching

Modx: Binary Level Partial Imported Third-Party Library Detection through Program Modularization and Semantic Matching

Arxiv

0+阅读 · 2022年4月18日

Towards Robust Neural Networks via Orthogonal Diversity

Towards Robust Neural Networks via Orthogonal Diversity

Arxiv

0+阅读 · 2022年4月18日

Improving both domain robustness and domain adaptability in machine translation

Arxiv

0+阅读 · 2022年4月17日

Domain Generalization in Vision: A Survey

Arxiv

16+阅读 · 2021年7月18日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

On Feature Normalization and Data Augmentation

On Feature Normalization and Data Augmentation

Arxiv

15+阅读 · 2020年2月25日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

VIP会员

文章信息

相关主题

Neural Networks

state-of-the-art

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Revisiting Vicinal Risk Minimization for Partially Supervised Multi-Label Classification Under Data Scarcity

Revisiting Vicinal Risk Minimization for Partially Supervised Multi-Label Classification Under Data Scarcity

Arxiv

0+阅读 · 2022年4月19日

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

Arxiv

0+阅读 · 2022年4月19日

On The Cross-Modal Transfer from Natural Language to Code through Adapter Modules

Arxiv

0+阅读 · 2022年4月19日

Modx: Binary Level Partial Imported Third-Party Library Detection through Program Modularization and Semantic Matching

Modx: Binary Level Partial Imported Third-Party Library Detection through Program Modularization and Semantic Matching

Arxiv

0+阅读 · 2022年4月18日

Towards Robust Neural Networks via Orthogonal Diversity

Towards Robust Neural Networks via Orthogonal Diversity

Arxiv

0+阅读 · 2022年4月18日

Improving both domain robustness and domain adaptability in machine translation

Arxiv

0+阅读 · 2022年4月17日

Domain Generalization in Vision: A Survey

Arxiv

16+阅读 · 2021年7月18日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

On Feature Normalization and Data Augmentation

On Feature Normalization and Data Augmentation

Arxiv

15+阅读 · 2020年2月25日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

相关基金

我国HIV/AIDS患者感染人芽囊原虫的基因多态性和致病性研究

国家自然科学基金

0+阅读 · 2014年12月31日

腺病毒介导的miRNA干扰策略抗MERS-CoV的研究

国家自然科学基金

0+阅读 · 2013年12月31日

人肠道病毒71型构象性中和表位的确定及其诱导抗病毒免疫保护应答机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

癌症的靶向基因 - 痘苗溶瘤病毒治疗策略

国家自然科学基金

1+阅读 · 2012年12月31日

人3型腺病毒感染小动物模型的建立和体内应答特点研究

国家自然科学基金

0+阅读 · 2012年12月31日

HIV感染IIA期中医证候特点及演变规律研究

国家自然科学基金

0+阅读 · 2011年12月31日

李斯特菌载体在增强丙型肝炎病毒重组多表位树突细胞疫苗中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

面向隐私保护的移动商务推荐系统研究

国家自然科学基金

2+阅读 · 2009年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员