以能源为基础的解释性案文建模示范模式 (Latent Diffusion Energy-Based Model for Interpretable Text Modeling) - 专知论文

会员服务 ·

0

Learning · MoDELS · 潜在 · MCMC · INFORMS ·

2022 年 7 月 4 日

Latent Diffusion Energy-Based Model for Interpretable Text Modeling

翻译：以能源为基础的解释性案文建模示范模式

Peiyu Yu,Sirui Xie,Xiaojian Ma,Baoxiong Jia,Bo Pang,Ruiqi Gao,Yixin Zhu,Song-Chun Zhu,Ying Nian Wu

from arxiv, ICML 2022

Latent space Energy-Based Models (EBMs), also known as energy-based priors, have drawn growing interests in generative modeling. Fueled by its flexibility in the formulation and strong modeling power of the latent space, recent works built upon it have made interesting attempts aiming at the interpretability of text modeling. However, latent space EBMs also inherit some flaws from EBMs in data space; the degenerate MCMC sampling quality in practice can lead to poor generation quality and instability in training, especially on data with complex latent structures. Inspired by the recent efforts that leverage diffusion recovery likelihood learning as a cure for the sampling issue, we introduce a novel symbiosis between the diffusion models and latent space EBMs in a variational learning framework, coined as the latent diffusion energy-based model. We develop a geometric clustering-based regularization jointly with the information bottleneck to further improve the quality of the learned latent space. Experiments on several challenging tasks demonstrate the superior performance of our model on interpretable text modeling over strong counterparts.

翻译：深层空间以能源为基础的模型(EBM)也被称为以能源为基础的前身,在基因模型方面引起了越来越多的兴趣。由于在潜在空间的构思方面的灵活性和强大的建模能力,最近基于这一模型的工程做出了令人感兴趣的尝试,目的是解释文本模型的可解释性;然而,潜层空间EBM也继承了数据空间EBM的一些缺陷;实践中的低劣MCMC取样质量会导致培训质量差和不稳定,特别是复杂潜质结构数据的培训。由于最近努力利用扩散回收可能性学习作为取样问题的解药,我们把扩散模型与潜在空间EBM之间的新型共生关系引入一个变异学习框架中,作为潜在的扩散能源基模型。我们与信息瓶颈一起开发了基于几何集群的正规化,以进一步提高所学过的潜在空间的质量。关于若干具有挑战性的任务的实验表明,我们关于可解释的文本模型优于强大的对应方。

0

相关内容

Learning

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【Google-Mila】你的GAN实际上是一个基于能量的模型，你应该使用鉴别器驱动的潜在采样，Your GAN is Secretly an Energy-based Model and You Should Use Discriminator Driven Latent Sampling

【Google-Mila】你的GAN实际上是一个基于能量的模型，你应该使用鉴别器驱动的潜在采样，Your GAN is Secretly an Energy-based Model and You Should Use Discriminator Driven Latent Sampling

专知会员服务

30+阅读 · 2020年3月28日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

专知

55+阅读 · 2018年1月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

纳米碳材料-贵金属颗粒复合结构的同步辐射原位研究

国家自然科学基金

0+阅读 · 2015年12月31日

高指数晶面结构贵金属纳米颗粒超晶格的模板辅助自组装与光学性能

国家自然科学基金

0+阅读 · 2015年12月31日

利用同步辐射X射线磁性圆二色和中子衍射研究MnxFe2-x(P,Si)化合物的结构与磁性

国家自然科学基金

0+阅读 · 2014年12月31日

NSCs、BMSCs移植治疗锰中毒大鼠多巴胺能神经损伤分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

利用同步辐射光电子能谱原位研究NiO-ZnO的界面电子态结构

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

时滞耦合系统分支临界值附近的动力学行为

国家自然科学基金

0+阅读 · 2013年12月31日

二维超薄半导体磁性纳米片的同步辐射研究

国家自然科学基金

0+阅读 · 2013年12月31日

磁性金属-铁氧体软磁纳米颗粒膜的高频磁性研究

国家自然科学基金

0+阅读 · 2012年12月31日

输出输入时滞系统风险灵敏估计与控制的研究

国家自然科学基金

0+阅读 · 2010年12月31日

Few-Shot Table-to-Text Generation with Prefix-Controlled Generator

Arxiv

0+阅读 · 2022年8月23日

K-space and Image Domain Collaborative Energy based Model for Parallel MRI Reconstruction

Arxiv

0+阅读 · 2022年8月21日

Kernel Memory Networks: A Unifying Framework for Memory Modeling

Arxiv

0+阅读 · 2022年8月19日

Synthetic Data in Human Analysis: A Survey

Arxiv

0+阅读 · 2022年8月19日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

RetaGNN: Relational Temporal Attentive Graph Neural Networks for Holistic Sequential Recommendation

RetaGNN: Relational Temporal Attentive Graph Neural Networks for Holistic Sequential Recommendation

Arxiv

14+阅读 · 2021年1月29日

Learning Latent Representations to Influence Multi-Agent Interaction

Arxiv

11+阅读 · 2020年11月12日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Latent Relation Language Models

Arxiv

21+阅读 · 2019年8月21日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

VIP会员

文章信息

相关主题

相关VIP内容

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【Google-Mila】你的GAN实际上是一个基于能量的模型，你应该使用鉴别器驱动的潜在采样，Your GAN is Secretly an Energy-based Model and You Should Use Discriminator Driven Latent Sampling

【Google-Mila】你的GAN实际上是一个基于能量的模型，你应该使用鉴别器驱动的潜在采样，Your GAN is Secretly an Energy-based Model and You Should Use Discriminator Driven Latent Sampling

专知会员服务

30+阅读 · 2020年3月28日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

NeurIPS 2025 | 自动化所新作速览（一）

大型语言模型（LLM）赋能的知识图谱构建：综述

NeurIPS 2025 | 自动化所新作速览（二）

领域特定文本分类中的预训练语言模型新进展：系统综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

【论文推荐】最新5篇情感分析相关论文—深度学习情感分析综述、情感分析语料库、情感预测性、上下文和位置感知的因子分解模型、LSTM

专知

55+阅读 · 2018年1月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Few-Shot Table-to-Text Generation with Prefix-Controlled Generator

Arxiv

0+阅读 · 2022年8月23日

K-space and Image Domain Collaborative Energy based Model for Parallel MRI Reconstruction

Arxiv

0+阅读 · 2022年8月21日

Kernel Memory Networks: A Unifying Framework for Memory Modeling

Arxiv

0+阅读 · 2022年8月19日

Synthetic Data in Human Analysis: A Survey

Arxiv

0+阅读 · 2022年8月19日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

RetaGNN: Relational Temporal Attentive Graph Neural Networks for Holistic Sequential Recommendation

RetaGNN: Relational Temporal Attentive Graph Neural Networks for Holistic Sequential Recommendation

Arxiv

14+阅读 · 2021年1月29日

Learning Latent Representations to Influence Multi-Agent Interaction

Arxiv

11+阅读 · 2020年11月12日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Latent Relation Language Models

Arxiv

21+阅读 · 2019年8月21日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

相关基金

纳米碳材料-贵金属颗粒复合结构的同步辐射原位研究

国家自然科学基金

0+阅读 · 2015年12月31日

高指数晶面结构贵金属纳米颗粒超晶格的模板辅助自组装与光学性能

国家自然科学基金

0+阅读 · 2015年12月31日

利用同步辐射X射线磁性圆二色和中子衍射研究MnxFe2-x(P,Si)化合物的结构与磁性

国家自然科学基金

0+阅读 · 2014年12月31日

NSCs、BMSCs移植治疗锰中毒大鼠多巴胺能神经损伤分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

利用同步辐射光电子能谱原位研究NiO-ZnO的界面电子态结构

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

时滞耦合系统分支临界值附近的动力学行为

国家自然科学基金

0+阅读 · 2013年12月31日

二维超薄半导体磁性纳米片的同步辐射研究

国家自然科学基金

0+阅读 · 2013年12月31日

磁性金属-铁氧体软磁纳米颗粒膜的高频磁性研究

国家自然科学基金

0+阅读 · 2012年12月31日

输出输入时滞系统风险灵敏估计与控制的研究

国家自然科学基金

0+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员