Discrete 后端变量的渐变动动画 (Coupled Gradient Estimators for Discrete Latent Variables) - 专知论文

会员服务 ·

0

估计/估计量 · 潜变量/隐变量 · Continuity · 再参数化/重参数化 · 离散化 ·

2021 年 6 月 15 日

Coupled Gradient Estimators for Discrete Latent Variables

翻译：Discrete 后端变量的渐变动动画

Zhe Dong,Andriy Mnih,George Tucker

from arxiv, Under Review

Training models with discrete latent variables is challenging due to the high variance of unbiased gradient estimators. While low-variance reparameterization gradients of a continuous relaxation can provide an effective solution, a continuous relaxation is not always available or tractable. Dong et al. (2020) and Yin et al. (2020) introduced a performant estimator that does not rely on continuous relaxations; however, it is limited to binary random variables. We introduce a novel derivation of their estimator based on importance sampling and statistical couplings, which we extend to the categorical setting. Motivated by the construction of a stick-breaking coupling, we introduce gradient estimators based on reparameterizing categorical variables as sequences of binary variables and Rao-Blackwellization. In systematic experiments, we show that our proposed categorical gradient estimators provide state-of-the-art performance, whereas even with additional Rao-Blackwellization, previous estimators (Yin et al., 2019) underperform a simpler REINFORCE with a leave-one-out-baseline estimator (Kool et al., 2019).

翻译：由于不偏向梯度估计值差异很大,具有离散潜伏变量的培训模式具有挑战性。虽然持续放松的低差再量化梯度可以提供有效的解决方案,但持续放松并不总是可用或可移动的。Dong等人(2020年)和Yin等人(202020年)引入了不依赖连续放松的性能估计仪;然而,它仅限于二进制随机变量。我们引入了基于重要抽样和统计组合的新颖估计器,我们将其扩展到绝对环境。我们借助于建造一个破碎的组合,我们引入了基于对绝对变量进行再量化的梯度估计器,作为二进制变量和彩虹-黑化的序列。在系统实验中,我们显示我们提议的绝对梯度估计器提供了最新性性性能,而即使增加了Rao-Blackwelliz,以前的估计器(Yin等人,2019年)也低于一个简单的REINFORCE,以离线标值为基础的基线(Kol等人,19年)。

0

相关内容

估计/估计量

估计/估计量

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

【CVPR2020-CUHK】探索和利用GANs中的可解释语义，60页ppt，Exploring and Exploiting Interpretable Semantics in GANs

【CVPR2020-CUHK】探索和利用GANs中的可解释语义，60页ppt，Exploring and Exploiting Interpretable Semantics in GANs

专知会员服务

13+阅读 · 2020年6月18日

图机器学习-图拉普拉斯算子的离散正则性，141页ppt，Discrete regularity graph Laplacians

专知会员服务

29+阅读 · 2020年6月4日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【北京大学】探索提取跨模态信息进行图像caption，Exploring and Distilling Cross-Modal Information for Image Captioning

【北京大学】探索提取跨模态信息进行图像caption，Exploring and Distilling Cross-Modal Information for Image Captioning

专知会员服务

54+阅读 · 2020年3月3日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【OpenAI】深度强化学习关键论文列表

【OpenAI】深度强化学习关键论文列表

专知

11+阅读 · 2018年11月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Implicitly Regularized RL with Implicit Q-Values

Arxiv

0+阅读 · 2021年8月16日

MINE: Mutual Information Neural Estimation

Arxiv

0+阅读 · 2021年8月14日

Adaptive optimal estimation of irregular mean and covariance functions

Arxiv

0+阅读 · 2021年8月14日

A Generalization of the Ornstein-Uhlenbeck Process: Theoretical Results, Simulations and Parameter Estimation

Arxiv

0+阅读 · 2021年8月13日

Distributional Depth-Based Estimation of Object Articulation Models

Distributional Depth-Based Estimation of Object Articulation Models

Arxiv

0+阅读 · 2021年8月12日

Optimal error estimates of a second-order projection finite element method for magnetohydrodynamic equations

Arxiv

0+阅读 · 2021年8月12日

A posteriori error analysis of a local adaptive discontinuous Galerkin method for convection-diffusion-reaction equations

Arxiv

0+阅读 · 2021年8月12日

Learning Discrete Structures for Graph Neural Networks

Arxiv

6+阅读 · 2019年5月17日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Arxiv

8+阅读 · 2018年12月18日

Implicit Maximum Likelihood Estimation

Implicit Maximum Likelihood Estimation

Arxiv

7+阅读 · 2018年9月24日

VIP会员

文章信息

相关主题

估计/估计量

潜变量/隐变量

再参数化/重参数化

相关VIP内容

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

【CVPR2020-CUHK】探索和利用GANs中的可解释语义，60页ppt，Exploring and Exploiting Interpretable Semantics in GANs

【CVPR2020-CUHK】探索和利用GANs中的可解释语义，60页ppt，Exploring and Exploiting Interpretable Semantics in GANs

专知会员服务

13+阅读 · 2020年6月18日

图机器学习-图拉普拉斯算子的离散正则性，141页ppt，Discrete regularity graph Laplacians

专知会员服务

29+阅读 · 2020年6月4日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【北京大学】探索提取跨模态信息进行图像caption，Exploring and Distilling Cross-Modal Information for Image Captioning

【北京大学】探索提取跨模态信息进行图像caption，Exploring and Distilling Cross-Modal Information for Image Captioning

专知会员服务

54+阅读 · 2020年3月3日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【OpenAI】深度强化学习关键论文列表

【OpenAI】深度强化学习关键论文列表

专知

11+阅读 · 2018年11月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Implicitly Regularized RL with Implicit Q-Values

Arxiv

0+阅读 · 2021年8月16日

MINE: Mutual Information Neural Estimation

Arxiv

0+阅读 · 2021年8月14日

Adaptive optimal estimation of irregular mean and covariance functions

Arxiv

0+阅读 · 2021年8月14日

A Generalization of the Ornstein-Uhlenbeck Process: Theoretical Results, Simulations and Parameter Estimation

Arxiv

0+阅读 · 2021年8月13日

Distributional Depth-Based Estimation of Object Articulation Models

Distributional Depth-Based Estimation of Object Articulation Models

Arxiv

0+阅读 · 2021年8月12日

Optimal error estimates of a second-order projection finite element method for magnetohydrodynamic equations

Arxiv

0+阅读 · 2021年8月12日

A posteriori error analysis of a local adaptive discontinuous Galerkin method for convection-diffusion-reaction equations

Arxiv

0+阅读 · 2021年8月12日

Learning Discrete Structures for Graph Neural Networks

Arxiv

6+阅读 · 2019年5月17日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Arxiv

8+阅读 · 2018年12月18日

Implicit Maximum Likelihood Estimation

Implicit Maximum Likelihood Estimation

Arxiv

7+阅读 · 2018年9月24日

微信扫码咨询专知VIP会员