改进解决数学文字问题中组成的概括化 (Improving Compositional Generalization in Math Word Problem Solving) - 专知论文

会员服务 ·

0

泛化理论 · 数据增强 · Analysis · 数据拆分 · Performer ·

2022 年 9 月 3 日

Improving Compositional Generalization in Math Word Problem Solving

翻译：改进解决数学文字问题中组成的概括化

Yunshi Lan,Lei Wang,Jing Jiang,Ee-Peng Lim

Compositional generalization refers to a model's capability to generalize to newly composed input data based on the data components observed during training. It has triggered a series of compositional generalization analysis on different tasks as generalization is an important aspect of language and problem solving skills. However, the similar discussion on math word problems (MWPs) is limited. In this manuscript, we study compositional generalization in MWP solving. Specifically, we first introduce a data splitting method to create compositional splits from existing MWP datasets. Meanwhile, we synthesize data to isolate the effect of compositions. To improve the compositional generalization in MWP solving, we propose an iterative data augmentation method that includes diverse compositional variation into training data and could collaborate with MWP methods. During the evaluation, we examine a set of methods and find all of them encounter severe performance loss on the evaluated datasets. We also find our data augmentation method could significantly improve the compositional generalization of general MWP methods. Code is available at https://github.com/demoleiwang/CGMWP.

翻译：总体构成是指一种模型能够根据培训期间观察到的数据组成部分对新成的输入数据进行概括分析,它引发了一系列对不同任务进行整体构成分析,因为一般化是语言和解决问题技能的一个重要方面。然而,关于数学词问题的类似讨论是有限的。在这个手稿中,我们在解决MWP时研究整体化。具体地说,我们首先采用数据分解方法,从现有的MWP数据集中产生组成分解。与此同时,我们综合数据,分离组成的影响。为了改进 MWP的构成分解,我们建议一种迭代数据扩增方法,在培训数据中包括多种组成变异,并可以与MWP方法合作。在评估过程中,我们研究一套方法,发现所有方法在经过评估的数据集中都受到严重的性能损失。我们还发现,我们的数据扩增方法可以大大改进一般 MWP方法的构成的概括化。代码见https://github.com/demoleiwang/CGMWP。

0

相关内容

泛化理论

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

柯萨奇病毒B3非结构蛋白3C调控SREBP1促进病毒复制的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

长链非编码RNA HOXD-AS1促进人肝细胞癌增殖的作用及分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

微纳尺度多孔介质中气体运移机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

全钛基背投式PIN异质结钙钛矿型太阳电池研究

国家自然科学基金

0+阅读 · 2014年12月31日

多场多相条件下超临界二氧化碳粒子射流破岩和井筒携岩机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

LMP1诱导SATB1表达及磷酸化在鼻咽癌细胞上皮间叶转化中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于NF-kB/P-gp通路的蒙药枸杞子-7防止子宫内膜异位症复发的分子机制分析

国家自然科学基金

0+阅读 · 2012年12月31日

miRNAs与DNA甲基转移酶1相互作用在同型半胱氨酸致血管平滑肌细胞增殖的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

受体毛细管电泳对赤芍中扩血管活性成分的筛选研究

国家自然科学基金

0+阅读 · 2012年12月31日

超临界二氧化碳射流破岩机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks

Arxiv

0+阅读 · 2022年10月20日

Improving Data Quality with Training Dynamics of Gradient Boosting Decision Trees

Improving Data Quality with Training Dynamics of Gradient Boosting Decision Trees

Arxiv

0+阅读 · 2022年10月20日

Attaining Class-level Forgetting in Pretrained Model using Few Samples

Arxiv

0+阅读 · 2022年10月19日

WebtoonMe: A Data-Centric Approach for Full-Body Portrait Stylization

Arxiv

0+阅读 · 2022年10月19日

On effects of Knowledge Distillation on Transfer Learning

Arxiv

0+阅读 · 2022年10月18日

SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters

Arxiv

0+阅读 · 2022年10月18日

Neural Networks Base on Power Method and Inverse Power Method for Solving Linear Eigenvalue Problems

Arxiv

0+阅读 · 2022年10月18日

Tight Analysis of Extra-gradient and Optimistic Gradient Methods For Nonconvex Minimax Problems

Arxiv

0+阅读 · 2022年10月17日

Improving Knowledge-aware Dialogue Generation via Knowledge Base Question Answering

Arxiv

16+阅读 · 2019年12月16日

Additive Margin Softmax for Face Verification

Arxiv

11+阅读 · 2018年1月18日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

军事战术边缘计算的重要性

《欧洲天空盾牌倡议：应对无人机饱和攻击与高超音速导弹的多层防空演进与挑战》报告

《美军使用大语言模型技术生成领域特定文档》2025最新379页

《代理生成式人工智能与国家安全：提升竞争力的政策建议》

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks

Arxiv

0+阅读 · 2022年10月20日

Improving Data Quality with Training Dynamics of Gradient Boosting Decision Trees

Improving Data Quality with Training Dynamics of Gradient Boosting Decision Trees

Arxiv

0+阅读 · 2022年10月20日

Attaining Class-level Forgetting in Pretrained Model using Few Samples

Arxiv

0+阅读 · 2022年10月19日

WebtoonMe: A Data-Centric Approach for Full-Body Portrait Stylization

Arxiv

0+阅读 · 2022年10月19日

On effects of Knowledge Distillation on Transfer Learning

Arxiv

0+阅读 · 2022年10月18日

SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters

Arxiv

0+阅读 · 2022年10月18日

Neural Networks Base on Power Method and Inverse Power Method for Solving Linear Eigenvalue Problems

Arxiv

0+阅读 · 2022年10月18日

Tight Analysis of Extra-gradient and Optimistic Gradient Methods For Nonconvex Minimax Problems

Arxiv

0+阅读 · 2022年10月17日

Improving Knowledge-aware Dialogue Generation via Knowledge Base Question Answering

Arxiv

16+阅读 · 2019年12月16日

Additive Margin Softmax for Face Verification

Arxiv

11+阅读 · 2018年1月18日

相关基金

柯萨奇病毒B3非结构蛋白3C调控SREBP1促进病毒复制的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

长链非编码RNA HOXD-AS1促进人肝细胞癌增殖的作用及分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

微纳尺度多孔介质中气体运移机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

全钛基背投式PIN异质结钙钛矿型太阳电池研究

国家自然科学基金

0+阅读 · 2014年12月31日

多场多相条件下超临界二氧化碳粒子射流破岩和井筒携岩机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

LMP1诱导SATB1表达及磷酸化在鼻咽癌细胞上皮间叶转化中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于NF-kB/P-gp通路的蒙药枸杞子-7防止子宫内膜异位症复发的分子机制分析

国家自然科学基金

0+阅读 · 2012年12月31日

miRNAs与DNA甲基转移酶1相互作用在同型半胱氨酸致血管平滑肌细胞增殖的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

受体毛细管电泳对赤芍中扩血管活性成分的筛选研究

国家自然科学基金

0+阅读 · 2012年12月31日

超临界二氧化碳射流破岩机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员