BAM:有适应性内存的距离 (BAM: Bayes with Adaptive Memory) - 专知论文

会员服务 ·

0

Continuity · 回合 · 估计/估计量 · 状态估计 · Integration ·

2022 年 2 月 8 日

BAM: Bayes with Adaptive Memory

翻译：BAM:有适应性内存的距离

Josue Nassar,Jennifer Brennan,Ben Evans,Kendall Lowrey

from arxiv, International Conference on Learning Representations (ICLR), 2022

Online learning via Bayes' theorem allows new data to be continuously integrated into an agent's current beliefs. However, a naive application of Bayesian methods in non stationary environments leads to slow adaptation and results in state estimates that may converge confidently to the wrong parameter value. A common solution when learning in changing environments is to discard/downweight past data; however, this simple mechanism of "forgetting" fails to account for the fact that many real-world environments involve revisiting similar states. We propose a new framework, Bayes with Adaptive Memory (BAM), that takes advantage of past experience by allowing the agent to choose which past observations to remember and which to forget. We demonstrate that BAM generalizes many popular Bayesian update rules for non-stationary environments. Through a variety of experiments, we demonstrate the ability of BAM to continuously adapt in an ever-changing world.

翻译：通过 Bayes 理论的在线学习使得新的数据能够不断融入代理商的当前信仰。但是,在非固定环境中天真地应用Bayesian方法导致适应速度缓慢,导致国家估算结果可能自信地与错误的参数值趋同。在变化环境中学习时的一个共同解决办法是抛弃/降低过去的数据;然而,这种简单的“忘记”机制未能考虑到许多现实世界环境涉及重新审视类似状态这一事实。我们提出了一个新框架,即“有适应记忆的Bayes ” (BAM ), 利用过去的经验,让代理商选择过去哪些观察可以记住,哪些可以忘记。我们证明BAM将许多受欢迎的Bayesian 更新非静止环境的规则普遍化。通过各种实验,我们展示了BAM 在一个不断变化的世界中不断适应的能力。

0

相关内容

Continuity

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

专知会员服务

33+阅读 · 2020年8月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

质膜水通道蛋白基因MpPIP2;1在苹果植株响应干旱胁迫中的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

半参数工具变量模型的统计方法、理论及其应用

国家自然科学基金

1+阅读 · 2013年12月31日

农作物物候遥感反演方法的适用性研究

国家自然科学基金

0+阅读 · 2013年12月31日

带约束推断的参数和半参数回归模型有偏估计及变量选择理论与方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

外膜蛋白组装复合体（Bam）结构与功能的综合计算研究

国家自然科学基金

0+阅读 · 2012年12月31日

节点运行模式状态自适应的自组织型排队网络研究

国家自然科学基金

0+阅读 · 2012年12月31日

血管平滑肌细胞5-羟色胺受体1B差异表达影响手臂振动病发生的核因子-κB信号机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

HIV的非结构蛋白对ABC转运蛋白作用的研究

国家自然科学基金

0+阅读 · 2011年12月31日

高通量基因数据分析中的 Bayes 统计方法

国家自然科学基金

1+阅读 · 2008年12月31日

Memory-Constrained Policy Optimization

Arxiv

0+阅读 · 2022年4月20日

Expert-Calibrated Learning for Online Optimization with Switching Costs

Arxiv

0+阅读 · 2022年4月18日

MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning

Arxiv

0+阅读 · 2022年4月18日

Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration

Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration

Arxiv

1+阅读 · 2022年4月15日

Invariance Through Inference

Arxiv

0+阅读 · 2022年4月14日

Interacting with Non-Cooperative User: A New Paradigm for Proactive Dialogue Policy

Arxiv

0+阅读 · 2022年4月7日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Forecasting: theory and practice

Arxiv

57+阅读 · 2022年1月5日

Continual Lifelong Learning with Neural Networks: A Review

Arxiv

14+阅读 · 2019年2月11日

Adaptive Correlation Filters with Long-Term and Short-Term Memory for Object Tracking

Arxiv

11+阅读 · 2018年3月23日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

专知会员服务

33+阅读 · 2020年8月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Memory-Constrained Policy Optimization

Arxiv

0+阅读 · 2022年4月20日

Expert-Calibrated Learning for Online Optimization with Switching Costs

Arxiv

0+阅读 · 2022年4月18日

MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning

Arxiv

0+阅读 · 2022年4月18日

Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration

Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration

Arxiv

1+阅读 · 2022年4月15日

Invariance Through Inference

Arxiv

0+阅读 · 2022年4月14日

Interacting with Non-Cooperative User: A New Paradigm for Proactive Dialogue Policy

Arxiv

0+阅读 · 2022年4月7日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Forecasting: theory and practice

Arxiv

57+阅读 · 2022年1月5日

Continual Lifelong Learning with Neural Networks: A Review

Arxiv

14+阅读 · 2019年2月11日

Adaptive Correlation Filters with Long-Term and Short-Term Memory for Object Tracking

Arxiv

11+阅读 · 2018年3月23日

相关基金

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

质膜水通道蛋白基因MpPIP2;1在苹果植株响应干旱胁迫中的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

半参数工具变量模型的统计方法、理论及其应用

国家自然科学基金

1+阅读 · 2013年12月31日

农作物物候遥感反演方法的适用性研究

国家自然科学基金

0+阅读 · 2013年12月31日

带约束推断的参数和半参数回归模型有偏估计及变量选择理论与方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

外膜蛋白组装复合体（Bam）结构与功能的综合计算研究

国家自然科学基金

0+阅读 · 2012年12月31日

节点运行模式状态自适应的自组织型排队网络研究

国家自然科学基金

0+阅读 · 2012年12月31日

血管平滑肌细胞5-羟色胺受体1B差异表达影响手臂振动病发生的核因子-κB信号机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

HIV的非结构蛋白对ABC转运蛋白作用的研究

国家自然科学基金

0+阅读 · 2011年12月31日

高通量基因数据分析中的 Bayes 统计方法

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员