LogiGAN:通过实习预科培训学习逻辑依据 (LogiGAN: Learning Logical Reasoning via Adversarial Pre-training) - 专知论文

会员服务 ·

0

Learning · 语言模型化 · MoDELS · GaN · 可辨认的 ·

2022 年 12 月 9 日

LogiGAN: Learning Logical Reasoning via Adversarial Pre-training

翻译：LogiGAN:通过实习预科培训学习逻辑依据

Xinyu Pi,Wanjun Zhong,Yan Gao,Nan Duan,Jian-Guang Lou

from arxiv, Accepted by NeurIPS 2022

We present LogiGAN, an unsupervised adversarial pre-training framework for improving logical reasoning abilities of language models. Upon automatic identifying logical reasoning phenomena in massive text corpus via detection heuristics, we train language models to predict the masked-out logical statements. Inspired by the facilitation effect of reflective thinking in human learning, we analogically simulate the learning-thinking process with an adversarial Generator-Verifier architecture to assist logic learning. LogiGAN implements a novel sequential GAN approach that (a) circumvents the non-differentiable challenge of the sequential GAN by leveraging the Generator as a sentence-level generative likelihood scorer with a learning objective of reaching scoring consensus with the Verifier; (b) is computationally feasible for large-scale pre-training with arbitrary target length. Both base and large size language models pre-trained with LogiGAN demonstrate obvious performance improvement on 12 datasets requiring general reasoning abilities, revealing the fundamental role of logic in broad reasoning, as well as the effectiveness of LogiGAN. Ablation studies on LogiGAN components reveal the relative orthogonality between linguistic and logic abilities and suggest that reflective thinking's facilitation effect might also generalize to machine learning.

翻译：我们提出LogiGAN,这是提高语言模型逻辑推理能力的不受监督的对抗性初步训练框架,用于提高语言模型的逻辑推理能力。在通过探测超光速,自动确定大量文本库中逻辑推理现象的逻辑推理现象时,我们培训语言模型,以预测隐藏的逻辑说明;在人类学习中反思思维的促进作用的启发下,我们模拟学习思维过程,用一个对抗性发电机-变异器结构模拟,以协助逻辑学习。LogiGAN采用一种新型的顺序性GAN方法,即(a) 利用发电机作为句级基因级概率分数,以学习目标为目的,与验证者达成评分共识,从而绕过顺序GAN的不可区分的挑战。 (b) 用于任意目标长度的大规模预培训,在计算上都是可行的。与LogiGAN预先培训的基地和大型语言模型都表明12个数据集的性能明显改进,需要一般推理能力,揭示逻辑在广义推理中的基本作用,以及LogiGAN的有效性。LogiGAN的调整研究显示LogiGAN组成部分显示LogiGAN的相对或高度理解能力,并且也反映了语言和逻辑解释机的思维能力之间的一般学习效果。

0

相关内容

Learning

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

60+阅读 · 2019年12月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

GI介导干旱胁迫响应和干旱逃逸的分子机理

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

地标引导的室内行人导航方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

金属纳米颗粒表面等离激元增强半导体发光机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kalirin 7 在雌激素调节海马神经元可塑性中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

基于糖化合物“Ferrier Carbocyclization”汞离子荧光探针的设计、合成及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

关于Cayley图的若干研究

国家自然科学基金

0+阅读 · 2012年12月31日

溶解性有机质对超富集生态型拟南芥吸收、输送和贮存Pb、Zn、Cu的影响

国家自然科学基金

0+阅读 · 2012年12月31日

大气环境中黑碳在臭氧(O3)作用下的老化过程研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于单领航者距离测量的多AUV协同导航方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

Aerial View Goal Localization with Reinforcement Learning

Arxiv

0+阅读 · 2023年2月10日

Visual Spatial Reasoning

Arxiv

0+阅读 · 2023年2月9日

Improving Certified Robustness via Statistical Learning with Logical Reasoning

Improving Certified Robustness via Statistical Learning with Logical Reasoning

Arxiv

0+阅读 · 2023年2月9日

Adversarial Self-Attention for Language Understanding

Arxiv

0+阅读 · 2023年2月8日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

Arxiv

19+阅读 · 2021年10月28日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Arxiv

17+阅读 · 2020年6月2日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

Variational Knowledge Graph Reasoning

Arxiv

15+阅读 · 2018年4月5日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

60+阅读 · 2019年12月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Aerial View Goal Localization with Reinforcement Learning

Arxiv

0+阅读 · 2023年2月10日

Visual Spatial Reasoning

Arxiv

0+阅读 · 2023年2月9日

Improving Certified Robustness via Statistical Learning with Logical Reasoning

Improving Certified Robustness via Statistical Learning with Logical Reasoning

Arxiv

0+阅读 · 2023年2月9日

Adversarial Self-Attention for Language Understanding

Arxiv

0+阅读 · 2023年2月8日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

Arxiv

19+阅读 · 2021年10月28日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Arxiv

17+阅读 · 2020年6月2日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

Variational Knowledge Graph Reasoning

Arxiv

15+阅读 · 2018年4月5日

相关基金

GI介导干旱胁迫响应和干旱逃逸的分子机理

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

地标引导的室内行人导航方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

金属纳米颗粒表面等离激元增强半导体发光机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kalirin 7 在雌激素调节海马神经元可塑性中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

基于糖化合物“Ferrier Carbocyclization”汞离子荧光探针的设计、合成及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

关于Cayley图的若干研究

国家自然科学基金

0+阅读 · 2012年12月31日

溶解性有机质对超富集生态型拟南芥吸收、输送和贮存Pb、Zn、Cu的影响

国家自然科学基金

0+阅读 · 2012年12月31日

大气环境中黑碳在臭氧(O3)作用下的老化过程研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于单领航者距离测量的多AUV协同导航方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员