Investigating the Properties of Neural Network Representations in Reinforcement Learning - 专知论文

会员服务 ·

0

Learning · 表示 · Agent · 强化学习 · Better ·

2023 年 5 月 5 日

Investigating the Properties of Neural Network Representations in Reinforcement Learning

翻译：暂无翻译

Han Wang,Erfan Miahi,Martha White,Marlos C. Machado,Zaheer Abbas,Raksha Kumaraswamy,Vincent Liu,Adam White

In this paper we investigate the properties of representations learned by deep reinforcement learning systems. Much of the early work on representations for reinforcement learning focused on designing fixed-basis architectures to achieve properties thought to be desirable, such as orthogonality and sparsity. In contrast, the idea behind deep reinforcement learning methods is that the agent designer should not encode representational properties, but rather that the data stream should determine the properties of the representation -- good representations emerge under appropriate training schemes. In this paper we bring these two perspectives together, empirically investigating the properties of representations that support transfer in reinforcement learning. We introduce and measure six representational properties over more than 25 thousand agent-task settings. We consider Deep Q-learning agents with different auxiliary losses in a pixel-based navigation environment, with source and transfer tasks corresponding to different goal locations. We develop a method to better understand why some representations work better for transfer, through a systematic approach varying task similarity and measuring and correlating representation properties with transfer performance. We demonstrate the generality of the methodology by investigating representations learned by a Rainbow agent that successfully transfer across games modes in Atari 2600.

翻译：暂无翻译

0

相关内容

Learning

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

石墨烯增强铝基复合材料的仿生制备与界面行为研究

国家自然科学基金

0+阅读 · 2015年12月31日

“共核”法制备高Ms锌铁氧体纳米颗粒的形成机理

国家自然科学基金

0+阅读 · 2015年12月31日

通过grafting from技术修饰纳米粒子的计算机模拟研究

国家自然科学基金

0+阅读 · 2013年12月31日

硫化物/石墨烯纳米复合材料的室温固相合成及光催化性能

国家自然科学基金

0+阅读 · 2012年12月31日

MEMS中的纳米三体磨料磨损

国家自然科学基金

0+阅读 · 2012年12月31日

DNA介导钌配合物包覆纳米颗粒的新型光电复合材料研究

国家自然科学基金

0+阅读 · 2012年12月31日

纳米BaTiO3/BaFe12O19复合材料的铁电、磁介电及磁电耦合性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

去酰基化ghrelin改善脂肪组织炎症所致胰岛素抵抗的机制- - 调节性T细胞的作用

国家自然科学基金

0+阅读 · 2011年12月31日

石墨烯/炭基纳米复合材料制备及其摩擦学研究

国家自然科学基金

0+阅读 · 2009年12月31日

携Herceptin和紫杉醇纳米造影剂乳腺癌分子显影和新辅助化疗研究

国家自然科学基金

0+阅读 · 2009年12月31日

Sim-to-real transfer of active suspension control using deep reinforcement learning

Arxiv

0+阅读 · 2023年6月21日

Variational Sequential Optimal Experimental Design using Reinforcement Learning

Arxiv

0+阅读 · 2023年6月17日

Bootstrapped Representations in Reinforcement Learning

Arxiv

0+阅读 · 2023年6月16日

A Survey on Transformers in Reinforcement Learning

Arxiv

31+阅读 · 2023年1月8日

The Principles of Deep Learning Theory

Arxiv

66+阅读 · 2021年6月18日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

A Comparative Study for Unsupervised Network Representation Learning

Arxiv

24+阅读 · 2020年3月11日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

17+阅读 · 2018年5月31日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Sim-to-real transfer of active suspension control using deep reinforcement learning

Arxiv

0+阅读 · 2023年6月21日

Variational Sequential Optimal Experimental Design using Reinforcement Learning

Arxiv

0+阅读 · 2023年6月17日

Bootstrapped Representations in Reinforcement Learning

Arxiv

0+阅读 · 2023年6月16日

A Survey on Transformers in Reinforcement Learning

Arxiv

31+阅读 · 2023年1月8日

The Principles of Deep Learning Theory

Arxiv

66+阅读 · 2021年6月18日

Scaling Properties of Deep Residual Networks

Arxiv

13+阅读 · 2021年5月25日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

A Comparative Study for Unsupervised Network Representation Learning

Arxiv

24+阅读 · 2020年3月11日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

17+阅读 · 2018年5月31日

相关基金

石墨烯增强铝基复合材料的仿生制备与界面行为研究

国家自然科学基金

0+阅读 · 2015年12月31日

“共核”法制备高Ms锌铁氧体纳米颗粒的形成机理

国家自然科学基金

0+阅读 · 2015年12月31日

通过grafting from技术修饰纳米粒子的计算机模拟研究

国家自然科学基金

0+阅读 · 2013年12月31日

硫化物/石墨烯纳米复合材料的室温固相合成及光催化性能

国家自然科学基金

0+阅读 · 2012年12月31日

MEMS中的纳米三体磨料磨损

国家自然科学基金

0+阅读 · 2012年12月31日

DNA介导钌配合物包覆纳米颗粒的新型光电复合材料研究

国家自然科学基金

0+阅读 · 2012年12月31日

纳米BaTiO3/BaFe12O19复合材料的铁电、磁介电及磁电耦合性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

去酰基化ghrelin改善脂肪组织炎症所致胰岛素抵抗的机制- - 调节性T细胞的作用

国家自然科学基金

0+阅读 · 2011年12月31日

石墨烯/炭基纳米复合材料制备及其摩擦学研究

国家自然科学基金

0+阅读 · 2009年12月31日

携Herceptin和紫杉醇纳米造影剂乳腺癌分子显影和新辅助化疗研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员