改进通过前箱剂加强学习的普及化 (Improving generalization in reinforcement learning through forked agents) - 专知论文

会员服务 ·

0

Agent · 泛化理论 · Learning · 强化学习 · 回合 ·

2022 年 12 月 29 日

Improving generalization in reinforcement learning through forked agents

翻译：改进通过前箱剂加强学习的普及化

Olivier Moulin,Vincent Francois-Lavet,Mark Hoogendoorn

from arxiv, 12 pages

An eco-system of agents each having their own policy with some, but limited, generalizability has proven to be a reliable approach to increase generalization across procedurally generated environments. In such an approach, new agents are regularly added to the eco-system when encountering a new environment that is outside of the scope of the eco-system. The speed of adaptation and general effectiveness of the eco-system approach highly depends on the initialization of new agents. In this paper we propose different initialization techniques, inspired from Deep Neural Network initialization and transfer learning, and study their impact.

翻译：实践证明,一个由每个机构各自制定政策的生态系统,其政策具有某些但有限的通用性,是在整个程序产生的环境中增加普遍化的可靠方法。在这种方法中,在遇到生态系统范围以外的新环境时,经常在生态系统中增加新的代理。生态系统方法的适应速度和一般效力在很大程度上取决于新代理的初始化。在本文件中,我们提出了从深神经网络初始化和转移学习中得到启发的不同初始化技术,并研究其影响。

0

相关内容

Agent

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【强化学习资源集合】Awesome Reinforcement Learning

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

专知

23+阅读 · 2018年2月23日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

45+阅读 · 2015年12月31日

Blimp-1对小鼠allo-HSCT后GVHD发病的调控作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于分子影像可视化动物模型对TRAF4调控鼻咽癌放疗抵抗分子机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

双功能磁性介孔生物活性玻璃微球的研究

国家自然科学基金

0+阅读 · 2013年12月31日

有序钙钛矿型铁电材料的构建及光电性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

低氧对线粒体铁蛋白表达的调控及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

IMPDH为靶点的小分子抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

环糊精酯化交联天然生物质材料强化吸附水中典型HOCs的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

STC-1在奶牛钙磷调控中的作用与分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

动态多智能体协同进化约束优化模型与算法研究

国家自然科学基金

4+阅读 · 2008年12月31日

Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年2月28日

Graph Reinforcement Learning for Operator Selection in the ALNS Metaheuristic

Arxiv

0+阅读 · 2023年2月28日

The In-Sample Softmax for Offline Reinforcement Learning

Arxiv

0+阅读 · 2023年2月28日

On the Role of Emergent Communication for Social Learning in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年2月28日

A Reinforcement Learning Approach for Scheduling Problems With Improved Generalization Through Order Swapping

Arxiv

0+阅读 · 2023年2月27日

Improving Quantal Cognitive Hierarchy Model Through Iterative Population Learning

Arxiv

0+阅读 · 2023年2月25日

Improving the Data Efficiency of Multi-Objective Quality-Diversity through Gradient Assistance and Crowding Exploration

Arxiv

0+阅读 · 2023年2月24日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Reinforcement Learning on Graph: A Survey

Arxiv

67+阅读 · 2022年4月13日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

VIP会员

文章信息

相关主题

相关VIP内容

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【强化学习资源集合】Awesome Reinforcement Learning

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】在低维和高维空间中分析、建模和转换潜在表征

从无人机到数据：揭示边缘计算作为新作战域

可解释人工智能的基础

大规模视觉模型中的基于提示的适应：综述

相关资讯

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

专知

23+阅读 · 2018年2月23日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年2月28日

Graph Reinforcement Learning for Operator Selection in the ALNS Metaheuristic

Arxiv

0+阅读 · 2023年2月28日

The In-Sample Softmax for Offline Reinforcement Learning

Arxiv

0+阅读 · 2023年2月28日

On the Role of Emergent Communication for Social Learning in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年2月28日

A Reinforcement Learning Approach for Scheduling Problems With Improved Generalization Through Order Swapping

Arxiv

0+阅读 · 2023年2月27日

Improving Quantal Cognitive Hierarchy Model Through Iterative Population Learning

Arxiv

0+阅读 · 2023年2月25日

Improving the Data Efficiency of Multi-Objective Quality-Diversity through Gradient Assistance and Crowding Exploration

Arxiv

0+阅读 · 2023年2月24日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Reinforcement Learning on Graph: A Survey

Arxiv

67+阅读 · 2022年4月13日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

相关基金

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

45+阅读 · 2015年12月31日

Blimp-1对小鼠allo-HSCT后GVHD发病的调控作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于分子影像可视化动物模型对TRAF4调控鼻咽癌放疗抵抗分子机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

双功能磁性介孔生物活性玻璃微球的研究

国家自然科学基金

0+阅读 · 2013年12月31日

有序钙钛矿型铁电材料的构建及光电性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

低氧对线粒体铁蛋白表达的调控及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

IMPDH为靶点的小分子抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

环糊精酯化交联天然生物质材料强化吸附水中典型HOCs的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

STC-1在奶牛钙磷调控中的作用与分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

动态多智能体协同进化约束优化模型与算法研究

国家自然科学基金

4+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员