SQ-VAE: 与自我处理的蒸汽量化的分辨代表制的变异性海湾 (SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization) - 专知论文

会员服务 ·

0

VQ-VAE · 离散化 · 变分自编码 · 自编码器 · 表示 ·

2022 年 5 月 16 日

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

翻译：SQ-VAE: 与自我处理的蒸汽量化的分辨代表制的变异性海湾

Yuhta Takida,Takashi Shibuya,WeiHsiang Liao,Chieh-Hsin Lai,Junki Ohmura,Toshimitsu Uesaka,Naoki Murata,Shusuke Takahashi,Toshiyuki Kumakura,Yuki Mitsufuji

from arxiv, 25 pages with 10 figures, accepted for publication in ICML 2022

One noted issue of vector-quantized variational autoencoder (VQ-VAE) is that the learned discrete representation uses only a fraction of the full capacity of the codebook, also known as codebook collapse. We hypothesize that the training scheme of VQ-VAE, which involves some carefully designed heuristics, underlies this issue. In this paper, we propose a new training scheme that extends the standard VAE via novel stochastic dequantization and quantization, called stochastically quantized variational autoencoder (SQ-VAE). In SQ-VAE, we observe a trend that the quantization is stochastic at the initial stage of the training but gradually converges toward a deterministic quantization, which we call self-annealing. Our experiments show that SQ-VAE improves codebook utilization without using common heuristics. Furthermore, we empirically show that SQ-VAE is superior to VAE and VQ-VAE in vision- and speech-related tasks.

翻译：人们注意到的病媒定量变异自动coder(VQ-VAE)问题是,所学的离散式自动coder(SQ-VAE)只使用了代码簿全部容量的一小部分,也称为代码簿的崩溃。我们假设VQ-VAE的培训计划,涉及一些精心设计的超自然现象,是这一问题的基础。在本文中,我们提出了一个新的培训计划,通过新颖的蒸馏分解和定量来扩展标准VAE,称为Stochastical定量变异式自动coder(SQ-VAE)。在 SQ-VAE,我们观察到一种趋势,即量化在培训的初始阶段是随机的,但逐渐趋向于确定性量化,我们称之为自我抵消。我们的实验显示,SQ-VAE在与视觉和语言有关的任务中,SQ-VAE优于VAE和VQ-VAE。

0

相关内容

VQ-VAE

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

关于全空间上一类Kirchhoff型方程正解的存在性和多重性的研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Heisenberg 群上的 k-平面变换

国家自然科学基金

0+阅读 · 2015年12月31日

全空间中临界Surface Quasi-geostrophic方程的全局吸引子及其分形维数

国家自然科学基金

0+阅读 · 2014年12月31日

基于结构学习的非平行支持向量机最优化方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

关于Lp多调和边值问题的若干研究

国家自然科学基金

0+阅读 · 2013年12月31日

带粗糙系数的高阶微分算子的若干研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

多孔介质中的Brinkman-Forchheimer方程解的稳定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

Dirichlet空间的分析与几何

国家自然科学基金

0+阅读 · 2011年12月31日

ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

Arxiv

0+阅读 · 2022年7月5日

$π$VAE: a stochastic process prior for Bayesian deep learning with MCMC

Arxiv

0+阅读 · 2022年7月5日

A Deep Learning Approach for the solution of Probability Density Evolution of Stochastic Systems

Arxiv

0+阅读 · 2022年7月5日

Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework

Arxiv

0+阅读 · 2022年7月5日

An additive framework for kirigami design

Arxiv

0+阅读 · 2022年7月5日

Spatiotemporal Feature Learning Based on Two-Step LSTM and Transformer for CT Scans

Spatiotemporal Feature Learning Based on Two-Step LSTM and Transformer for CT Scans

Arxiv

0+阅读 · 2022年7月4日

An end-to-end deep learning approach for extracting stochastic dynamical systems with $α$-stable Lévy noise

Arxiv

0+阅读 · 2022年7月2日

Analysis of Kinetic Models for Label Switching and Stochastic Gradient Descent

Analysis of Kinetic Models for Label Switching and Stochastic Gradient Descent

Arxiv

0+阅读 · 2022年7月1日

Cross-Modal Discrete Representation Learning

Arxiv

18+阅读 · 2021年6月10日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Arxiv

37+阅读 · 2020年10月9日

VIP会员

文章信息

相关主题

变分自编码

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

Arxiv

0+阅读 · 2022年7月5日

$π$VAE: a stochastic process prior for Bayesian deep learning with MCMC

Arxiv

0+阅读 · 2022年7月5日

A Deep Learning Approach for the solution of Probability Density Evolution of Stochastic Systems

Arxiv

0+阅读 · 2022年7月5日

Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework

Arxiv

0+阅读 · 2022年7月5日

An additive framework for kirigami design

Arxiv

0+阅读 · 2022年7月5日

Spatiotemporal Feature Learning Based on Two-Step LSTM and Transformer for CT Scans

Spatiotemporal Feature Learning Based on Two-Step LSTM and Transformer for CT Scans

Arxiv

0+阅读 · 2022年7月4日

An end-to-end deep learning approach for extracting stochastic dynamical systems with $α$-stable Lévy noise

Arxiv

0+阅读 · 2022年7月2日

Analysis of Kinetic Models for Label Switching and Stochastic Gradient Descent

Analysis of Kinetic Models for Label Switching and Stochastic Gradient Descent

Arxiv

0+阅读 · 2022年7月1日

Cross-Modal Discrete Representation Learning

Arxiv

18+阅读 · 2021年6月10日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Arxiv

37+阅读 · 2020年10月9日

相关基金

关于全空间上一类Kirchhoff型方程正解的存在性和多重性的研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Heisenberg 群上的 k-平面变换

国家自然科学基金

0+阅读 · 2015年12月31日

全空间中临界Surface Quasi-geostrophic方程的全局吸引子及其分形维数

国家自然科学基金

0+阅读 · 2014年12月31日

基于结构学习的非平行支持向量机最优化方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

关于Lp多调和边值问题的若干研究

国家自然科学基金

0+阅读 · 2013年12月31日

带粗糙系数的高阶微分算子的若干研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

多孔介质中的Brinkman-Forchheimer方程解的稳定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

Dirichlet空间的分析与几何

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员