保守目标模型是一种特殊类型的对比散度能量模型 (Conservative objective models are a special kind of contrastive divergence-based energy model) - 专知论文

会员服务 ·

0

对比散度 · 目标模型 · 能量模型 · 概率 · 散度 ·

2023 年 4 月 7 日

Conservative objective models are a special kind of contrastive divergence-based energy model

翻译：保守目标模型是一种特殊类型的对比散度能量模型

Christopher Beckham,Christopher Pal

In this work we theoretically show that conservative objective models (COMs) for offline model-based optimisation (MBO) are a special kind of contrastive divergence-based energy model, one where the energy function represents both the unconditional probability of the input and the conditional probability of the reward variable. While the initial formulation only samples modes from its learned distribution, we propose a simple fix that replaces its gradient ascent sampler with a Langevin MCMC sampler. This gives rise to a special probabilistic model where the probability of sampling an input is proportional to its predicted reward. Lastly, we show that better samples can be obtained if the model is decoupled so that the unconditional and conditional probabilities are modelled separately.

翻译：---- 在本研究中，我们理论上证明了保守目标模型（COMs）用于离线模型优化（MBO）是一种特殊类型的对比散度能量模型，即能量函数表示了输入的无条件概率和奖励变量的条件概率。虽然最初的建模方法仅从其学习的分布中采样模式，但我们提出了一个简单的修正措施，将其梯度上升采样器替换为Langevin MCMC采样器。这导致产生一种特殊的概率模型，其中采样输入的概率与其预测的奖励成正比。最后，我们表明如果将模型分离开来，分别对无条件概率和条件概率进行建模，就能够获得更好的样本。

0

相关内容

对比散度

【CVPR2022】通过特征Mixing进行主动学习

【CVPR2022】通过特征Mixing进行主动学习

专知会员服务

26+阅读 · 2022年3月15日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

专知会员服务

32+阅读 · 2020年4月26日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

【DeepMind】PolyGen: 一种三维网格的自回归生成模型，PolyGen: An Autoregressive Generative Model of 3D Meshes

【DeepMind】PolyGen: 一种三维网格的自回归生成模型，PolyGen: An Autoregressive Generative Model of 3D Meshes

专知会员服务

37+阅读 · 2020年2月27日

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

专知会员服务

37+阅读 · 2020年2月27日

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

专知会员服务

32+阅读 · 2020年2月24日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

专知会员服务

38+阅读 · 2019年12月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

笔记 | Deep active learning for named entity recognition

笔记 | Deep active learning for named entity recognition

黑龙江大学自然语言处理实验室

24+阅读 · 2018年5月27日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

中国地区气溶胶吸湿增长因子参数化的模式研究

国家自然科学基金

0+阅读 · 2013年12月31日

关于具有奇异参数的偏微分方程边值问题与带双边反射的随机偏微分方程的研究

国家自然科学基金

0+阅读 · 2013年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

一类时滞积分方程解的存在性

国家自然科学基金

0+阅读 · 2012年12月31日

保守振动方程周期解的存在性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Markov跳跃随机非线性系统的有限时间稳定与镇定

国家自然科学基金

1+阅读 · 2012年12月31日

测量值相关的稀疏信号可重构条件研究

国家自然科学基金

0+阅读 · 2012年12月31日

取向多晶Fe-Ga合金逆效应的动态特性与本征非线性动态模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

压缩采样框架下的自适应稀疏信号感知与重建

国家自然科学基金

0+阅读 · 2009年12月31日

“#32511;洲—#33618;漠”#23707;屿生态种群的扩散模型研究

国家自然科学基金

0+阅读 · 2009年12月31日

Matrix Quantile Factor Model

Arxiv

0+阅读 · 2023年5月26日

Sampling random graphs with specified degree sequences

Arxiv

0+阅读 · 2023年5月26日

Memory-Based Meta-Learning on Non-Stationary Distributions

Arxiv

0+阅读 · 2023年5月25日

Learning to Compute the Articulatory Representations of Speech with the MIRRORNET

Arxiv

0+阅读 · 2023年5月25日

Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference

Arxiv

0+阅读 · 2023年5月24日

Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution

Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution

Arxiv

0+阅读 · 2023年5月24日

Towards Optimizing Storage Costs on the Cloud

Arxiv

0+阅读 · 2023年5月24日

Transformers are Meta-Reinforcement Learners

Arxiv

15+阅读 · 2022年6月14日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2022】通过特征Mixing进行主动学习

【CVPR2022】通过特征Mixing进行主动学习

专知会员服务

26+阅读 · 2022年3月15日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

专知会员服务

32+阅读 · 2020年4月26日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

【DeepMind】PolyGen: 一种三维网格的自回归生成模型，PolyGen: An Autoregressive Generative Model of 3D Meshes

【DeepMind】PolyGen: 一种三维网格的自回归生成模型，PolyGen: An Autoregressive Generative Model of 3D Meshes

专知会员服务

37+阅读 · 2020年2月27日

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

专知会员服务

37+阅读 · 2020年2月27日

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

【CVPR2020】CONSAC: 基于条件样本一致性的稳健多模型拟合，Robust Multi-Model Fitting by Conditional Sample Consensus

专知会员服务

32+阅读 · 2020年2月24日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

专知会员服务

38+阅读 · 2019年12月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

笔记 | Deep active learning for named entity recognition

笔记 | Deep active learning for named entity recognition

黑龙江大学自然语言处理实验室

24+阅读 · 2018年5月27日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Matrix Quantile Factor Model

Arxiv

0+阅读 · 2023年5月26日

Sampling random graphs with specified degree sequences

Arxiv

0+阅读 · 2023年5月26日

Memory-Based Meta-Learning on Non-Stationary Distributions

Arxiv

0+阅读 · 2023年5月25日

Learning to Compute the Articulatory Representations of Speech with the MIRRORNET

Arxiv

0+阅读 · 2023年5月25日

Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference

Arxiv

0+阅读 · 2023年5月24日

Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution

Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution

Arxiv

0+阅读 · 2023年5月24日

Towards Optimizing Storage Costs on the Cloud

Arxiv

0+阅读 · 2023年5月24日

Transformers are Meta-Reinforcement Learners

Arxiv

15+阅读 · 2022年6月14日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

中国地区气溶胶吸湿增长因子参数化的模式研究

国家自然科学基金

0+阅读 · 2013年12月31日

关于具有奇异参数的偏微分方程边值问题与带双边反射的随机偏微分方程的研究

国家自然科学基金

0+阅读 · 2013年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

一类时滞积分方程解的存在性

国家自然科学基金

0+阅读 · 2012年12月31日

保守振动方程周期解的存在性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Markov跳跃随机非线性系统的有限时间稳定与镇定

国家自然科学基金

1+阅读 · 2012年12月31日

测量值相关的稀疏信号可重构条件研究

国家自然科学基金

0+阅读 · 2012年12月31日

取向多晶Fe-Ga合金逆效应的动态特性与本征非线性动态模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

压缩采样框架下的自适应稀疏信号感知与重建

国家自然科学基金

0+阅读 · 2009年12月31日

“#32511;洲—#33618;漠”#23707;屿生态种群的扩散模型研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员