在变换器中利用感性偏见,使语法和语义与 VAE 的无监督分解 (Exploiting Inductive Bias in Transformers for Unsupervised Disentanglement of Syntax and Semantics with VAEs) - 专知论文

会员服务 ·

0

INFORMS · 归纳偏好 · 潜变量/隐变量 · MoDELS · 监督模型 ·

2022 年 5 月 19 日

Exploiting Inductive Bias in Transformers for Unsupervised Disentanglement of Syntax and Semantics with VAEs

翻译：在变换器中利用感性偏见,使语法和语义与 VAE 的无监督分解

Ghazi Felhi,Joseph Le Roux,Djamé Seddah

from arxiv, Accepted @ NAACL 2022

We propose a generative model for text generation, which exhibits disentangled latent representations of syntax and semantics. Contrary to previous work, this model does not need syntactic information such as constituency parses, or semantic information such as paraphrase pairs. Our model relies solely on the inductive bias found in attention-based architectures such as Transformers. In the attention of Transformers, keys handle information selection while values specify what information is conveyed. Our model, dubbed QKVAE, uses Attention in its decoder to read latent variables where one latent variable infers keys while another infers values. We run experiments on latent representations and experiments on syntax/semantics transfer which show that QKVAE displays clear signs of disentangled syntax and semantics. We also show that our model displays competitive syntax transfer capabilities when compared to supervised models and that comparable supervised models need a fairly large amount of data (more than 50K samples) to outperform it on both syntactic and semantic transfer. The code for our experiments is publicly available.

翻译：我们为文本生成建议了一个基因模型, 它显示了语法和语义学的分解潜在代表。与先前的工作相反, 这个模型不需要合成信息, 如选区剖面, 或语义配对等语义信息。我们的模型完全依赖于在以关注为基础的结构( 如变换器)中发现的诱导偏差。在变换器的注意下, 键会处理信息选择, 而值会指定传递的信息。我们的模型, 被称为 QKVAE, 在其解码器中, 使用“ 注意” 来阅读潜在变量, 在其中, 一个潜在变量推断关键值, 而另一个推断值。我们进行关于语法/ 语义转换的实验和实验, 显示 QKVAE 显示分解语法和语义转换的清晰迹象。我们还显示, 我们的模型显示, 当与受监督的模式相比, 具有竞争性的语法转移能力, 并且可比的受监督模型需要相当大量的数据( 超过 50K 样), 以在合成和语义传输上超越它。我们的代码是公开的。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

调控SOX9介导的Müller细胞过度活化对大鼠光损伤视网膜变性的保护作用及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Mir-134/487b/668基因簇与PTEN介导的信号通路在恶性纤维组织细胞瘤中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Hippo通路在高糖诱导的肾小球系膜细胞增殖中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

杏仁核-海马CA1区-前额叶皮层环路异常在抑郁发生中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

Wnt/β-catenin和 Hedgehog信号通路互作在骨关节中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

线粒体蛋白SIRT5对氧化/硝化应激诱导胰岛beta细胞损伤的调控作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白去乙酰化酶抑制剂对骨关节炎中Notch-NFAT信号通路调控的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于MeCP2甲基化调控枯否细胞分泌炎症因子的栀子苷抗肝纤维化机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

SuFu 蛋白在细胞信号调控中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

转移相关基因FMNL2参与大肠癌Rho细胞运动信号通路的机制探讨

国家自然科学基金

0+阅读 · 2008年12月31日

A Medical Information Extraction Workbench to Process German Clinical Text

A Medical Information Extraction Workbench to Process German Clinical Text

Arxiv

0+阅读 · 2022年7月8日

TFCNs: A CNN-Transformer Hybrid Network for Medical Image Segmentation

TFCNs: A CNN-Transformer Hybrid Network for Medical Image Segmentation

Arxiv

0+阅读 · 2022年7月7日

VecGAN: Image-to-Image Translation with Interpretable Latent Directions

VecGAN: Image-to-Image Translation with Interpretable Latent Directions

Arxiv

0+阅读 · 2022年7月7日

Efficient Self-supervised Vision Transformers for Representation Learning

Arxiv

0+阅读 · 2022年7月6日

Don't Pay Attention to the Noise: Learning Self-supervised Representations of Light Curves with a Denoising Time Series Transformer

Arxiv

0+阅读 · 2022年7月6日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models

Arxiv

17+阅读 · 2021年3月23日

Disentangled Information Bottleneck

Disentangled Information Bottleneck

Arxiv

12+阅读 · 2020年12月22日

Dissecting Contextual Word Embeddings: Architecture and Representation

Dissecting Contextual Word Embeddings: Architecture and Representation

Arxiv

22+阅读 · 2018年8月27日

VIP会员

文章信息

相关主题

潜变量/隐变量

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

A Medical Information Extraction Workbench to Process German Clinical Text

A Medical Information Extraction Workbench to Process German Clinical Text

Arxiv

0+阅读 · 2022年7月8日

TFCNs: A CNN-Transformer Hybrid Network for Medical Image Segmentation

TFCNs: A CNN-Transformer Hybrid Network for Medical Image Segmentation

Arxiv

0+阅读 · 2022年7月7日

VecGAN: Image-to-Image Translation with Interpretable Latent Directions

VecGAN: Image-to-Image Translation with Interpretable Latent Directions

Arxiv

0+阅读 · 2022年7月7日

Efficient Self-supervised Vision Transformers for Representation Learning

Arxiv

0+阅读 · 2022年7月6日

Don't Pay Attention to the Noise: Learning Self-supervised Representations of Light Curves with a Denoising Time Series Transformer

Arxiv

0+阅读 · 2022年7月6日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models

Arxiv

17+阅读 · 2021年3月23日

Disentangled Information Bottleneck

Disentangled Information Bottleneck

Arxiv

12+阅读 · 2020年12月22日

Dissecting Contextual Word Embeddings: Architecture and Representation

Dissecting Contextual Word Embeddings: Architecture and Representation

Arxiv

22+阅读 · 2018年8月27日

相关基金

调控SOX9介导的Müller细胞过度活化对大鼠光损伤视网膜变性的保护作用及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Mir-134/487b/668基因簇与PTEN介导的信号通路在恶性纤维组织细胞瘤中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Hippo通路在高糖诱导的肾小球系膜细胞增殖中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

杏仁核-海马CA1区-前额叶皮层环路异常在抑郁发生中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

Wnt/β-catenin和 Hedgehog信号通路互作在骨关节中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

线粒体蛋白SIRT5对氧化/硝化应激诱导胰岛beta细胞损伤的调控作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白去乙酰化酶抑制剂对骨关节炎中Notch-NFAT信号通路调控的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于MeCP2甲基化调控枯否细胞分泌炎症因子的栀子苷抗肝纤维化机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

SuFu 蛋白在细胞信号调控中的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

转移相关基因FMNL2参与大肠癌Rho细胞运动信号通路的机制探讨

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员