OpenAI Gym / ALE 高速公路环境的甲骨文和观察 (An Oracle and Observations for the OpenAI Gym / ALE Freeway Environment) - 专知论文

会员服务 ·

0

Oracle · OpenAI · 回合 · Better · 控制器 ·

2021 年 9 月 2 日

An Oracle and Observations for the OpenAI Gym / ALE Freeway Environment

翻译：OpenAI Gym / ALE 高速公路环境的甲骨文和观察

James S. Plank,Catherine D. Schuman,Robert M. Patton

The OpenAI Gym project contains hundreds of control problems whose goal is to provide a testbed for reinforcement learning algorithms. One such problem is Freeway-ram-v0, where the observations presented to the agent are 128 bytes of RAM. While the goals of the project are for non-expert AI agents to solve the control problems with general training, in this work, we seek to learn more about the problem, so that we can better evaluate solutions. In particular, we develop on oracle to play the game, so that we may have baselines for success. We present details of the oracle, plus optimal game-playing situations that can be used for training and testing AI agents.

翻译：OpenAI Gym项目包含数以百计的控制问题,目标是为强化学习算法提供一个测试台。其中一个问题是Freiway-ram-v0, 向代理提供的观测结果为 RAM 128 字节。虽然该项目的目标是让非专家AI 代理人员通过一般性培训解决控制问题,但我们在这项工作中寻求更多地了解问题,以便我们更好地评估解决方案。特别是,我们开发游戏的奥克莱,以便我们可以有成功的基准。我们介绍了神器的细节,以及可用于培训和测试AI 代理的游戏场景。

0

相关内容

Oracle

甲骨文公司，全称甲骨文股份有限公司(甲骨文软件系统有限公司)，是全球最大的企业级软件公司，总部位于美国加利福尼亚州的红木滩。1989年正式进入中国市场。2013年，甲骨文已超越 IBM ，成为继 Microsoft 后全球第二大软件公司。

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

432+阅读 · 2021年1月11日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

Ranking Policy Decisions

Arxiv

0+阅读 · 2021年10月26日

PettingZoo: Gym for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2021年10月26日

Unraveling the hidden environmental impacts of AI solutions for environment

Unraveling the hidden environmental impacts of AI solutions for environment

Arxiv

1+阅读 · 2021年10月22日

The Critique of Crowds: Using Collective Criticism to Crowdsource Subjective Preferences

Arxiv

0+阅读 · 2021年10月22日

The StarCraft Multi-Agent Challenge

The StarCraft Multi-Agent Challenge

Arxiv

3+阅读 · 2019年2月11日

VIP会员

文章信息

相关主题

相关VIP内容

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

432+阅读 · 2021年1月11日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用人工智能对军事行动进行建模》

《利用人工智能学习、优化与推演美国海军作战部队的战略布局与分散（续文）》

机器人、无人机与实时影像：应对城市爆炸威胁的三大技术方案

《指挥官意图消息中关键概念自动提取》最新47页

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

相关论文

Ranking Policy Decisions

Arxiv

0+阅读 · 2021年10月26日

PettingZoo: Gym for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2021年10月26日

Unraveling the hidden environmental impacts of AI solutions for environment

Unraveling the hidden environmental impacts of AI solutions for environment

Arxiv

1+阅读 · 2021年10月22日

The Critique of Crowds: Using Collective Criticism to Crowdsource Subjective Preferences

Arxiv

0+阅读 · 2021年10月22日

The StarCraft Multi-Agent Challenge

The StarCraft Multi-Agent Challenge

Arxiv

3+阅读 · 2019年2月11日

微信扫码咨询专知VIP会员