与分异石 Stesteste 运算符一起的渐变估计 (Gradient Estimation with Discrete Stein Operators) - 专知论文

会员服务 ·

0

估计/估计量 · 离散化 · 方差 · 泛函 · 控制器 ·

2022 年 11 月 30 日

Gradient Estimation with Discrete Stein Operators

翻译：与分异石 Stesteste 运算符一起的渐变估计

Jiaxin Shi,Yuhao Zhou,Jessica Hwang,Michalis K. Titsias,Lester Mackey

from arxiv, NeurIPS 2022

Gradient estimation -- approximating the gradient of an expectation with respect to the parameters of a distribution -- is central to the solution of many machine learning problems. However, when the distribution is discrete, most common gradient estimators suffer from excessive variance. To improve the quality of gradient estimation, we introduce a variance reduction technique based on Stein operators for discrete distributions. We then use this technique to build flexible control variates for the REINFORCE leave-one-out estimator. Our control variates can be adapted online to minimize variance and do not require extra evaluations of the target function. In benchmark generative modeling tasks such as training binary variational autoencoders, our gradient estimator achieves substantially lower variance than state-of-the-art estimators with the same number of function evaluations.

翻译：渐变估计 -- -- 接近分配参数预期值的梯度 -- -- 是解决许多机器学习问题的核心。然而,如果分布是离散的,大多数常见的梯度估计者会受到差异过大的影响。为了提高梯度估计的质量,我们采用了基于斯坦因操作员的差异减少技术,用于分散分布。然后我们使用这一技术为REINFORCE休假单出估计器建立灵活的控制变量。我们的控制变量可以在网上调整,以尽量减少差异,而不需要对目标功能进行额外评价。在将培训二进制自动变异器等基因模型任务作为基准时,我们的梯度估测器的变差大大低于具有相同数量功能评价的、最先进的估量器。

0

相关内容

估计/估计量

估计/估计量

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

纽约大学/Facebook人工智能研究院--2022春季最新课程【认知计算建模】Computational cognitive modeling（含ppt）

纽约大学/Facebook人工智能研究院--2022春季最新课程【认知计算建模】Computational cognitive modeling（含ppt）

专知会员服务

58+阅读 · 2022年2月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

讲座报名丨 ICML专场

讲座报名丨 ICML专场

THU数据派

0+阅读 · 2021年9月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

几类扩散过程的逼近及应用

国家自然科学基金

1+阅读 · 2014年12月31日

基于粘性解的随机时滞方程最优控制问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

BER通路基因miRNA结合位点基因多态性与结直肠癌易感性的关联及功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

渐近锥流形上色散方程的研究

国家自然科学基金

0+阅读 · 2013年12月31日

调控山羊卵巢雌激素和孕酮的相关miRNA的鉴定及作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

退化k-Hessian方程解的正则性研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于梯度Kriging方法的轮胎花纹形状优化

国家自然科学基金

0+阅读 · 2011年12月31日

Navier-Stokes方程稳定化有限元方法后验误差估计

国家自然科学基金

0+阅读 · 2011年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

Randomized low-rank approximation of parameter-dependent matrices

Arxiv

0+阅读 · 2023年2月24日

Balanced Off-Policy Evaluation for Personalized Pricing

Arxiv

0+阅读 · 2023年2月24日

Bayesian Structure Scores for Probabilistic Circuits

Arxiv

0+阅读 · 2023年2月23日

Communication-Efficient Distributed Estimation and Inference for Cox's Model

Arxiv

0+阅读 · 2023年2月23日

When is Momentum Extragradient Optimal? A Polynomial-Based Analysis

Arxiv

0+阅读 · 2023年2月23日

Exploration by self-supervised exploitation

Arxiv

0+阅读 · 2023年2月22日

Estimating Total Correlation with Mutual Information Estimators

Arxiv

0+阅读 · 2023年2月22日

Contrastive Representation Learning for Acoustic Parameter Estimation

Arxiv

0+阅读 · 2023年2月22日

Error Estimation for Random Fourier Features

Arxiv

0+阅读 · 2023年2月22日

Derivative-Informed Neural Operator: An Efficient Framework for High-Dimensional Parametric Derivative Learning

Arxiv

0+阅读 · 2023年2月21日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

纽约大学/Facebook人工智能研究院--2022春季最新课程【认知计算建模】Computational cognitive modeling（含ppt）

纽约大学/Facebook人工智能研究院--2022春季最新课程【认知计算建模】Computational cognitive modeling（含ppt）

专知会员服务

58+阅读 · 2022年2月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】用于提升含优化层学习的算法与体系结构

【NeurIPS2025】有何不同于过去？基于自监督偏差学习的时空时间序列预测

超越决策优势：情报在创新与适应中的作用

量子计算发展态势研究报告（2025年）

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

讲座报名丨 ICML专场

讲座报名丨 ICML专场

THU数据派

0+阅读 · 2021年9月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Randomized low-rank approximation of parameter-dependent matrices

Arxiv

0+阅读 · 2023年2月24日

Balanced Off-Policy Evaluation for Personalized Pricing

Arxiv

0+阅读 · 2023年2月24日

Bayesian Structure Scores for Probabilistic Circuits

Arxiv

0+阅读 · 2023年2月23日

Communication-Efficient Distributed Estimation and Inference for Cox's Model

Arxiv

0+阅读 · 2023年2月23日

When is Momentum Extragradient Optimal? A Polynomial-Based Analysis

Arxiv

0+阅读 · 2023年2月23日

Exploration by self-supervised exploitation

Arxiv

0+阅读 · 2023年2月22日

Estimating Total Correlation with Mutual Information Estimators

Arxiv

0+阅读 · 2023年2月22日

Contrastive Representation Learning for Acoustic Parameter Estimation

Arxiv

0+阅读 · 2023年2月22日

Error Estimation for Random Fourier Features

Arxiv

0+阅读 · 2023年2月22日

Derivative-Informed Neural Operator: An Efficient Framework for High-Dimensional Parametric Derivative Learning

Arxiv

0+阅读 · 2023年2月21日

相关基金

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

几类扩散过程的逼近及应用

国家自然科学基金

1+阅读 · 2014年12月31日

基于粘性解的随机时滞方程最优控制问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

BER通路基因miRNA结合位点基因多态性与结直肠癌易感性的关联及功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

渐近锥流形上色散方程的研究

国家自然科学基金

0+阅读 · 2013年12月31日

调控山羊卵巢雌激素和孕酮的相关miRNA的鉴定及作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

退化k-Hessian方程解的正则性研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于梯度Kriging方法的轮胎花纹形状优化

国家自然科学基金

0+阅读 · 2011年12月31日

Navier-Stokes方程稳定化有限元方法后验误差估计

国家自然科学基金

0+阅读 · 2011年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员