通过政策嵌入,使代代用辅助的进化强化学习成为可能 (Enabling surrogate-assisted evolutionary reinforcement learning via policy embedding) - 专知论文

会员服务 ·

0

Learning · 强化学习 · Weight · DNN · 可约的 ·

2023 年 1 月 31 日

Enabling surrogate-assisted evolutionary reinforcement learning via policy embedding

翻译：通过政策嵌入,使代代用辅助的进化强化学习成为可能

Lan Tang,Xiaxi Li,Jinyuan Zhang,Guiying Li,Peng Yang,Ke Tang

from arxiv, This paper is submitted to bicta-2022

Evolutionary Reinforcement Learning (ERL) that applying Evolutionary Algorithms (EAs) to optimize the weight parameters of Deep Neural Network (DNN) based policies has been widely regarded as an alternative to traditional reinforcement learning methods. However, the evaluation of the iteratively generated population usually requires a large amount of computational time and can be prohibitively expensive, which may potentially restrict the applicability of ERL. Surrogate is often used to reduce the computational burden of evaluation in EAs. Unfortunately, in ERL, each individual of policy usually represents millions of weights parameters of DNN. This high-dimensional representation of policy has introduced a great challenge to the application of surrogates into ERL to speed up training. This paper proposes a PE-SAERL Framework to at the first time enable surrogate-assisted evolutionary reinforcement learning via policy embedding (PE). Empirical results on 5 Atari games show that the proposed method can perform more efficiently than the four state-of-the-art algorithms. The training process is accelerated up to 7x on tested games, comparing to its counterpart without the surrogate and PE.

翻译：应用进化分数优化深神经网络(DNN)政策重量参数的进化强化学习(ERL)应用进化分数优化深神经网络(EAs)政策被广泛视为传统强化学习方法的一种替代方法,然而,对迭代生成的人口的评价通常需要大量计算时间,而且可能过于昂贵,这可能会限制ERL的适用性。代孕常常被用来减少EAs中评估的计算负担。不幸的是,在ERL中,每个政策个体通常代表DNN数以百万计的重量参数。这种高度的政策表现对将代孕器应用到ERL以加速培训带来了巨大的挑战。本文提议了一个PE-SAERL框架, 首次通过政策嵌入(PE)使代孕辅助进化强化学习成为可能(PE) 。 5 Atari游戏的经验显示,拟议的方法可以比四种最先进的算法效率更高。在测试的游戏上加速到7x,比没有代孕和PE的对口。

0

相关内容

Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

112+阅读 · 2021年4月17日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

中国田鼠亚科 Microtini族(Rodentia: Cricetidae: Arvicolinae)的分类与系统发育研究

国家自然科学基金

0+阅读 · 2014年12月31日

黎曼流形上椭圆算子的谱估计

国家自然科学基金

0+阅读 · 2013年12月31日

用于光电化学电池的一维Si纳米结构复合光电极研究

国家自然科学基金

0+阅读 · 2013年12月31日

中国淡水异极藻科（ Gomphonemaceae）植物的分类学研究

国家自然科学基金

0+阅读 · 2012年12月31日

苯并二噻吩-吡咯并吡咯二酮D-A型聚合物太阳能电池研究

国家自然科学基金

0+阅读 · 2012年12月31日

人参新的mlncRNA基因HTAR在高温胁迫响应中的调控作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于FBAR的紫外和红外光传感器的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Cf/SiC复合材料与钛合金复合扩散钎焊动力学与界面反应研究

国家自然科学基金

0+阅读 · 2011年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

噻二唑类金属配合物的合成、表征及电致发光性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

Deep reinforcement learning for the olfactory search POMDP: a quantitative benchmark

Arxiv

0+阅读 · 2023年3月20日

Minimizing Human Assistance: Augmenting a Single Demonstration for Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年3月19日

Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks

Arxiv

0+阅读 · 2023年3月17日

A Survey on Causal Reinforcement Learning

Arxiv

29+阅读 · 2023年2月10日

A Survey of Meta-Reinforcement Learning

Arxiv

12+阅读 · 2023年1月19日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Reinforcement Learning on Graph: A Survey

Arxiv

67+阅读 · 2022年4月13日

Deep learning in agriculture: A survey

Arxiv

11+阅读 · 2018年7月31日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

112+阅读 · 2021年4月17日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Deep reinforcement learning for the olfactory search POMDP: a quantitative benchmark

Arxiv

0+阅读 · 2023年3月20日

Minimizing Human Assistance: Augmenting a Single Demonstration for Deep Reinforcement Learning

Arxiv

0+阅读 · 2023年3月19日

Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks

Arxiv

0+阅读 · 2023年3月17日

A Survey on Causal Reinforcement Learning

Arxiv

29+阅读 · 2023年2月10日

A Survey of Meta-Reinforcement Learning

Arxiv

12+阅读 · 2023年1月19日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Reinforcement Learning on Graph: A Survey

Arxiv

67+阅读 · 2022年4月13日

Deep learning in agriculture: A survey

Arxiv

11+阅读 · 2018年7月31日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

相关基金

中国田鼠亚科 Microtini族(Rodentia: Cricetidae: Arvicolinae)的分类与系统发育研究

国家自然科学基金

0+阅读 · 2014年12月31日

黎曼流形上椭圆算子的谱估计

国家自然科学基金

0+阅读 · 2013年12月31日

用于光电化学电池的一维Si纳米结构复合光电极研究

国家自然科学基金

0+阅读 · 2013年12月31日

中国淡水异极藻科（ Gomphonemaceae）植物的分类学研究

国家自然科学基金

0+阅读 · 2012年12月31日

苯并二噻吩-吡咯并吡咯二酮D-A型聚合物太阳能电池研究

国家自然科学基金

0+阅读 · 2012年12月31日

人参新的mlncRNA基因HTAR在高温胁迫响应中的调控作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于FBAR的紫外和红外光传感器的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Cf/SiC复合材料与钛合金复合扩散钎焊动力学与界面反应研究

国家自然科学基金

0+阅读 · 2011年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

噻二唑类金属配合物的合成、表征及电致发光性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员