逻辑感应预感语言学习语言代表 (Learning Language Representations with Logical Inductive Bias) - 专知论文

会员服务 ·

0

归纳偏好 · 语言表示 · 有偏 · Learning · 表示 ·

2023 年 2 月 19 日

Learning Language Representations with Logical Inductive Bias

翻译：逻辑感应预感语言学习语言代表

from arxiv, Published as a conference paper at ICLR 2023

Transformer architectures have achieved great success in solving natural language tasks, which learn strong language representations from large-scale unlabeled texts. In this paper, we seek to go further beyond and explore a new logical inductive bias for better language representation learning. Logic reasoning is known as a formal methodology to reach answers from given knowledge and facts. Inspired by such a view, we develop a novel neural architecture named FOLNet (First-Order Logic Network), to encode this new inductive bias. We construct a set of neural logic operators as learnable Horn clauses, which are further forward-chained into a fully differentiable neural architecture (FOLNet). Interestingly, we find that the self-attention module in transformers can be composed by two of our neural logic operators, which probably explains their strong reasoning performance. Our proposed FOLNet has the same input and output interfaces as other pretrained models and thus could be pretrained/finetuned by using similar losses. It also allows FOLNet to be used in a plug-and-play manner when replacing other pretrained models. With our logical inductive bias, the same set of ``logic deduction skills'' learned through pretraining are expected to be equally capable of solving diverse downstream tasks. For this reason, FOLNet learns language representations that have much stronger transfer capabilities. Experimental results on several language understanding tasks show that our pretrained FOLNet model outperforms the existing strong transformer-based approaches.

翻译：变换器架构在解决自然语言任务方面取得了巨大成功, 学习了大规模无标签文本的强烈语言表现。在本文中, 我们试图超越并探索新的逻辑导导偏向, 以更好地进行语言代表学习。逻辑推理被称为一种正式的方法, 以获得来自特定知识和事实的答案。在这种观点的启发下, 我们开发了一个名为 FOLNet( FOLNet) 的新型神经结构( FOLNet) (FOL- Oder 逻辑网络), 以破解这种新的感化偏差。我们建造了一套神经逻辑操作操作器, 作为可学习的 Horn 条款, 这些操作器被进一步提前连接到完全不同的神经结构( FOLNet 网络 ) 。有趣的是, 我们发现变换器中的自我注意模块可以由两个神经逻辑逻辑逻辑逻辑逻辑逻辑逻辑逻辑操作者组成, 来解释其强的推理性性表现。我们提议的 FOL Net 与其他预训练模型具有相同的输入和输出界面界面界面界面, 从而使用类似的模范模式进行预训练/ 。它还允许 FOL 取代其他强的变压式变换模型, 我们的变压式变压式变压式变压式演法的演化模型具有相同的演法的演法的功能, 。

0

相关内容

归纳偏好

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

BTG1调控乳腺组织电离辐射敏感性机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

仿牙周组织结构的多相支架构建及其诱导牙周再生的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

β-catenin/Ets1复合体在胶质母细胞瘤中对hTERT表达调控机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

关于Lp多调和边值问题的若干研究

国家自然科学基金

0+阅读 · 2013年12月31日

胶质母细胞瘤干性起源的分子生物学研究

国家自然科学基金

0+阅读 · 2012年12月31日

人肝细胞特异性分泌III型干扰素的调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt通路在硒诱导胰腺癌程序性死亡中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

具有a-glucosidase抑制活性的新型穿心莲内酯衍生物抗细胞黏附和血管生成作用及机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Selecting Robust Features for Machine Learning Applications using Multidata Causal Discovery

Arxiv

0+阅读 · 2023年4月12日

A Comprehensive Survey on Deep Graph Representation Learning

Arxiv

103+阅读 · 2023年4月11日

Characterizing personalized effects of family information on disease risk using graph representation learning

Arxiv

0+阅读 · 2023年4月11日

Incorporating Structured Sentences with Time-enhanced BERT for Fully-inductive Temporal Relation Prediction

Arxiv

0+阅读 · 2023年4月10日

Statistical Hardware Design With Multi-model Active Learning

Arxiv

0+阅读 · 2023年4月9日

A Theoretical Study of Inductive Biases in Contrastive Learning

Arxiv

0+阅读 · 2023年4月8日

Multi-Task Learning with Multi-Query Transformer for Dense Prediction

Arxiv

0+阅读 · 2023年4月7日

Disentangled Representation Learning

Arxiv

16+阅读 · 2022年11月21日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

Learning to Propagate for Graph Meta-Learning

Arxiv

14+阅读 · 2019年9月11日

VIP会员

文章信息

相关主题

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

相关论文

Selecting Robust Features for Machine Learning Applications using Multidata Causal Discovery

Arxiv

0+阅读 · 2023年4月12日

A Comprehensive Survey on Deep Graph Representation Learning

Arxiv

103+阅读 · 2023年4月11日

Characterizing personalized effects of family information on disease risk using graph representation learning

Arxiv

0+阅读 · 2023年4月11日

Incorporating Structured Sentences with Time-enhanced BERT for Fully-inductive Temporal Relation Prediction

Arxiv

0+阅读 · 2023年4月10日

Statistical Hardware Design With Multi-model Active Learning

Arxiv

0+阅读 · 2023年4月9日

A Theoretical Study of Inductive Biases in Contrastive Learning

Arxiv

0+阅读 · 2023年4月8日

Multi-Task Learning with Multi-Query Transformer for Dense Prediction

Arxiv

0+阅读 · 2023年4月7日

Disentangled Representation Learning

Arxiv

16+阅读 · 2022年11月21日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

Learning to Propagate for Graph Meta-Learning

Arxiv

14+阅读 · 2019年9月11日

相关基金

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

BTG1调控乳腺组织电离辐射敏感性机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

仿牙周组织结构的多相支架构建及其诱导牙周再生的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

β-catenin/Ets1复合体在胶质母细胞瘤中对hTERT表达调控机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

关于Lp多调和边值问题的若干研究

国家自然科学基金

0+阅读 · 2013年12月31日

胶质母细胞瘤干性起源的分子生物学研究

国家自然科学基金

0+阅读 · 2012年12月31日

人肝细胞特异性分泌III型干扰素的调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt通路在硒诱导胰腺癌程序性死亡中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

具有a-glucosidase抑制活性的新型穿心莲内酯衍生物抗细胞黏附和血管生成作用及机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员