改善基于密度差距的多变自动调整器 (Improving Variational Autoencoders with Density Gap-based Regularization) - 专知论文

会员服务 ·

0

正则化项 · 自编码器 · 变分自编码 · 优化器 · Learning ·

2022 年 11 月 1 日

Improving Variational Autoencoders with Density Gap-based Regularization

翻译：改善基于密度差距的多变自动调整器

Jianfei Zhang,Jun Bai,Chenghua Lin,Yanmeng Wang,Wenge Rong

from arxiv, Accepted to NeurIPS 2022

Variational autoencoders (VAEs) are one of the powerful unsupervised learning frameworks in NLP for latent representation learning and latent-directed generation. The classic optimization goal of VAEs is to maximize the Evidence Lower Bound (ELBo), which consists of a conditional likelihood for generation and a negative Kullback-Leibler (KL) divergence for regularization. In practice, optimizing ELBo often leads the posterior distribution of all samples converge to the same degenerated local optimum, namely posterior collapse or KL vanishing. There are effective ways proposed to prevent posterior collapse in VAEs, but we observe that they in essence make trade-offs between posterior collapse and hole problem, i.e., mismatch between the aggregated posterior distribution and the prior distribution. To this end, we introduce new training objectives to tackle both two problems through a novel regularization based on the probabilistic density gap between the aggregated posterior distribution and the prior distribution. Through experiments on language modeling, latent space visualization and interpolation, we show that our proposed method can solve both problems effectively and thus outperforms the existing methods in latent-directed generation. To the best of our knowledge, we are the first to jointly solve the hole problem and the posterior collapse.

翻译：动态自动读数器(VAE)是国家实验室规划中用于潜在代表学习和潜导一代的强大、不受监督的学习框架之一。 VAE的经典优化目标是最大限度地增加证据下下界(ELBo),其中包括生成的有条件可能性和负面的 Kullback-Liber(KL) 差异,以规范化。在实践中,优化 ELBo 常常导致所有样本的后端分布,形成同一退化的本地最佳样本,即后端崩溃或 KL消失。有建议的有效方法防止 VAE 的后端崩溃,但我们认为,从本质上说,它们会在后端崩溃和洞问题之间作出交易,即综合后端分布与先前分布之间的不匹配。为此,我们引入了新的培训目标,通过基于总体后端分布与先前分布之间概率性密度差距的新式的稳妥性调节,解决这两个问题。通过语言模型化、潜潜伏空间视觉化和内置等实验,我们发现,从本质上说,在后端崩溃和前端生成过程中,我们所提议的方法可以有效地解决现有的潜在问题,从而形成后方方法。

0

相关内容

正则化项

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

IMP3调控上皮间质转化和肿瘤干细胞进而参与结肠癌发生和转移的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

多房棘球绦虫Argonaute蛋白新类群在小RNA诱导的沉默途径中的功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

新型非硫氧化还原电对的设计及其在量子点敏化太阳能电池中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于签名的Groebner基算法及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

Cycloartane型三萜抗肝损伤构效关系和作用机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

c-Abl基因缺失与PrPSc诱导神经元细胞氧化应激机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于GPU的directionlets域SAR图像相干斑噪声抑制并行算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

原发性骨髓纤维化造血干细胞生物学特征和分子演变研究

国家自然科学基金

0+阅读 · 2012年12月31日

Notch/Hes1信号介导脑肿瘤干细胞辐射抵抗的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Temporal Output Discrepancy for Loss Estimation-based Active Learning

Arxiv

0+阅读 · 2022年12月20日

Confidence-aware Training of Smoothed Classifiers for Certified Robustness

Arxiv

0+阅读 · 2022年12月20日

Design and Structure Dependent Priors for Scale Parameters in Latent Gaussian Models

Arxiv

0+阅读 · 2022年12月19日

Collaborative Algorithms for Online Personalized Mean Estimation

Arxiv

0+阅读 · 2022年12月19日

Fast and robust Bayesian Inference using Gaussian Processes with GPry

Arxiv

0+阅读 · 2022年12月17日

AUC Maximization for Low-Resource Named Entity Recognition

Arxiv

0+阅读 · 2022年12月16日

Controllable Text Generation via Probability Density Estimation in the Latent Space

Arxiv

0+阅读 · 2022年12月16日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Knowledge Graph Convolutional Networks for Recommender Systems with Label Smoothness Regularization

Arxiv

21+阅读 · 2019年5月11日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

VIP会员

文章信息

相关主题

变分自编码

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《科研智能：人工智能赋能工业仿真研究报告（2025年）》

具身智能中的世界模型：全面综述

【NeurIPS2025】迈向开放世界的三维“物体性”学习

【博士论文】用于排序与扩散模型的安全、高效与鲁棒强化学习

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Temporal Output Discrepancy for Loss Estimation-based Active Learning

Arxiv

0+阅读 · 2022年12月20日

Confidence-aware Training of Smoothed Classifiers for Certified Robustness

Arxiv

0+阅读 · 2022年12月20日

Design and Structure Dependent Priors for Scale Parameters in Latent Gaussian Models

Arxiv

0+阅读 · 2022年12月19日

Collaborative Algorithms for Online Personalized Mean Estimation

Arxiv

0+阅读 · 2022年12月19日

Fast and robust Bayesian Inference using Gaussian Processes with GPry

Arxiv

0+阅读 · 2022年12月17日

AUC Maximization for Low-Resource Named Entity Recognition

Arxiv

0+阅读 · 2022年12月16日

Controllable Text Generation via Probability Density Estimation in the Latent Space

Arxiv

0+阅读 · 2022年12月16日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Knowledge Graph Convolutional Networks for Recommender Systems with Label Smoothness Regularization

Arxiv

21+阅读 · 2019年5月11日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

相关基金

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

IMP3调控上皮间质转化和肿瘤干细胞进而参与结肠癌发生和转移的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

多房棘球绦虫Argonaute蛋白新类群在小RNA诱导的沉默途径中的功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

新型非硫氧化还原电对的设计及其在量子点敏化太阳能电池中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于签名的Groebner基算法及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

Cycloartane型三萜抗肝损伤构效关系和作用机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

c-Abl基因缺失与PrPSc诱导神经元细胞氧化应激机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于GPU的directionlets域SAR图像相干斑噪声抑制并行算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

原发性骨髓纤维化造血干细胞生物学特征和分子演变研究

国家自然科学基金

0+阅读 · 2012年12月31日

Notch/Hes1信号介导脑肿瘤干细胞辐射抵抗的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员