像程序执行员一样的理由 (Reasoning Like Program Executors) - 专知论文

会员服务 ·

0

POET · 语言模型化 · Performer · MoDELS · BART ·

2022 年 1 月 27 日

Reasoning Like Program Executors

翻译：像程序执行员一样的理由

Xinyu Pi,Qian Liu,Bei Chen,Morteza Ziyadi,Zeqi Lin,Yan Gao,Qiang Fu,Jian-Guang Lou,Weizhu Chen

from arxiv, Work in progress.The first two authors contributed equally

Reasoning over natural language is a long-standing goal for the research community. However, studies have shown that existing language models are inadequate in reasoning. To address the issue, we present POET, a new pre-training paradigm. Through pre-training language models with programs and their execution results, POET empowers language models to harvest the reasoning knowledge possessed in program executors via a data-driven approach. POET is conceptually simple and can be instantiated by different kinds of programs. In this paper, we show three empirically powerful instances, i.e., POET-Math, POET-Logic, and POET-SQL. Experimental results on six benchmarks demonstrate that POET can significantly boost model performance on natural language reasoning, such as numerical reasoning, logical reasoning, and multi-hop reasoning. Taking the DROP benchmark as a representative example, POET improves the F1 metric of BART from 69.2% to 80.6%. Furthermore, POET shines in giant language models, pushing the F1 metric of T5-11B to 87.6% and achieving a new state-of-the-art performance on DROP. POET opens a new gate on reasoning-enhancement pre-training and we hope our analysis would shed light on the future research of reasoning like program executors.

翻译：自然语言是研究界的长期目标。然而,研究显示,现有的语言模式在推理上是不充分的。为了解决这一问题,我们提出了新的培训前模式POET。通过培训前语言模式,包括程序及其执行结果,POET赋予语言模式以权力,通过数据驱动的方法,获取程序执行者掌握的推理知识。POET在概念上是简单的,可以由不同种类的方案进行回馈。此外,在本文中,我们展示了三个经验强大的实例,即POET-Math、POET-Logic和POET-SQL。六个基准的实验结果表明,POET能够大大提升自然语言推理学的模型性能,如数字推理学、逻辑推理学和多动推理。以DROP基准为代表,将BART的F1衡量标准从69.2%提高到80.6%。此外,POET以巨大的语言模型为闪亮,将F1的T5-11B指标推向87.6%,并实现我们未来研发前的新的州-门推理化方案。

0

相关内容

POET

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

专知会员服务

78+阅读 · 2020年8月13日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

心之所向的无尽蓝，vivo S12 Pro「屿蓝」

心之所向的无尽蓝，vivo S12 Pro「屿蓝」

ZEALER订阅号

0+阅读 · 2022年1月27日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

自然语言处理顶会EMNLP2018接受论文列表！

自然语言处理顶会EMNLP2018接受论文列表！

专知

87+阅读 · 2018年8月26日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

炎症介导的EGFR由内而外调节肺腺癌侵润网络构建及对抗策略

国家自然科学基金

1+阅读 · 2015年12月31日

肿瘤相关成纤维细胞表达COL6A3基因对结直肠癌恶性潜能的影响及临床意义

国家自然科学基金

0+阅读 · 2015年12月31日

启动子区-286位SNP突变调控C-反应蛋白基因转录的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

小细胞肺癌相关的miRNA-375靶基因Runx1的鉴定和功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

高温高压水热体系中CO2传感器的研制

国家自然科学基金

0+阅读 · 2012年12月31日

GP73+AFP+HCC/DC杂交细胞诱生特异性抗人肝癌CTL疫苗的研究

国家自然科学基金

0+阅读 · 2012年12月31日

癌症的靶向基因 - 痘苗溶瘤病毒治疗策略

国家自然科学基金

1+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

前列腺癌特异血清MicroRNA表达谱的病例对照研究

国家自然科学基金

1+阅读 · 2009年12月31日

Modeling Review History for Reviewer Recommendation:A Hypergraph Approach

Modeling Review History for Reviewer Recommendation:A Hypergraph Approach

Arxiv

0+阅读 · 2022年4月20日

Practical considerations for specifying a super learner

Practical considerations for specifying a super learner

Arxiv

0+阅读 · 2022年4月19日

Deep Learning Methods for Abstract Visual Reasoning: A Survey on Raven's Progressive Matrices

Arxiv

1+阅读 · 2022年4月19日

Leveraging Language to Learn Program Abstractions and Search Heuristics

Arxiv

0+阅读 · 2022年4月18日

Zero-Shot Program Representation Learning

Arxiv

0+阅读 · 2022年4月18日

Modality-Balanced Embedding for Video Retrieval

Arxiv

0+阅读 · 2022年4月18日

Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning

Arxiv

17+阅读 · 2022年2月17日

A Survey of Knowledge Enhanced Pre-trained Models

Arxiv

28+阅读 · 2021年10月1日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

专知会员服务

78+阅读 · 2020年8月13日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

心之所向的无尽蓝，vivo S12 Pro「屿蓝」

心之所向的无尽蓝，vivo S12 Pro「屿蓝」

ZEALER订阅号

0+阅读 · 2022年1月27日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

自然语言处理顶会EMNLP2018接受论文列表！

自然语言处理顶会EMNLP2018接受论文列表！

专知

87+阅读 · 2018年8月26日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

相关论文

Modeling Review History for Reviewer Recommendation:A Hypergraph Approach

Modeling Review History for Reviewer Recommendation:A Hypergraph Approach

Arxiv

0+阅读 · 2022年4月20日

Practical considerations for specifying a super learner

Practical considerations for specifying a super learner

Arxiv

0+阅读 · 2022年4月19日

Deep Learning Methods for Abstract Visual Reasoning: A Survey on Raven's Progressive Matrices

Arxiv

1+阅读 · 2022年4月19日

Leveraging Language to Learn Program Abstractions and Search Heuristics

Arxiv

0+阅读 · 2022年4月18日

Zero-Shot Program Representation Learning

Arxiv

0+阅读 · 2022年4月18日

Modality-Balanced Embedding for Video Retrieval

Arxiv

0+阅读 · 2022年4月18日

Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning

Arxiv

17+阅读 · 2022年2月17日

A Survey of Knowledge Enhanced Pre-trained Models

Arxiv

28+阅读 · 2021年10月1日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

相关基金

炎症介导的EGFR由内而外调节肺腺癌侵润网络构建及对抗策略

国家自然科学基金

1+阅读 · 2015年12月31日

肿瘤相关成纤维细胞表达COL6A3基因对结直肠癌恶性潜能的影响及临床意义

国家自然科学基金

0+阅读 · 2015年12月31日

启动子区-286位SNP突变调控C-反应蛋白基因转录的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

小细胞肺癌相关的miRNA-375靶基因Runx1的鉴定和功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

高温高压水热体系中CO2传感器的研制

国家自然科学基金

0+阅读 · 2012年12月31日

GP73+AFP+HCC/DC杂交细胞诱生特异性抗人肝癌CTL疫苗的研究

国家自然科学基金

0+阅读 · 2012年12月31日

癌症的靶向基因 - 痘苗溶瘤病毒治疗策略

国家自然科学基金

1+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

前列腺癌特异血清MicroRNA表达谱的病例对照研究

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员