利用深强化学习和模拟转让解决火箭联盟 (On the Verge of Solving Rocket League using Deep Reinforcement Learning and Sim-to-sim Transfer) - 专知论文

会员服务 ·

0

学成 · 深度强化学习 · The Verge · 强化学习 · 目标领域 ·

2022 年 5 月 24 日

On the Verge of Solving Rocket League using Deep Reinforcement Learning and Sim-to-sim Transfer

翻译：利用深强化学习和模拟转让解决火箭联盟

Marco Pleines,Konstantin Ramthun,Yannik Wegener,Hendrik Meyer,Matthias Pallasch,Sebastian Prior,Jannik Drögemüller,Leon Büttinghaus,Thilo Röthemeyer,Alexander Kaschwig,Oliver Chmurzynski,Frederik Rohkrähmer,Roman Kalkreuth,Frank Zimmer,Mike Preuss

from arxiv, Accepted at IEEE Conference on Games 2022, 8 pages, 6 figures, 3 tables

Autonomously trained agents that are supposed to play video games reasonably well rely either on fast simulation speeds or heavy parallelization across thousands of machines running concurrently. This work explores a third way that is established in robotics, namely sim-to-real transfer, or if the game is considered a simulation itself, sim-to-sim transfer. In the case of Rocket League, we demonstrate that single behaviors of goalies and strikers can be successfully learned using Deep Reinforcement Learning in the simulation environment and transferred back to the original game. Although the implemented training simulation is to some extent inaccurate, the goalkeeping agent saves nearly 100% of its faced shots once transferred, while the striking agent scores in about 75% of cases. Therefore, the trained agent is robust enough and able to generalize to the target domain of Rocket League.

翻译：自动培训的代理人本应玩电子游戏,他们应该合理地完全依赖快速模拟速度或对同时运行的数千台机器的高度平行。这项工作探索了机器人所建立的第三个方法,即模拟到现实的转移,或者如果游戏本身被视为模拟,即模拟到现实的转移。在火箭联盟的情况下,我们证明在模拟环境中利用深强化学习可以成功学习守门员和罢工者的单项行为,然后将其转回原游戏。尽管实施的培训模拟在某种程度上不准确,但目标维护代理人在转移后可以省下近100%的瞄准镜头,而打击代理人则在大约75%的案例中得分。因此,训练有素的代理人足够强大,能够推广到火箭联盟的目标领域。

0

相关内容

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于SEM全模型和类ROC曲线法对亚健康状态“单证”量化诊断的方法学研究

国家自然科学基金

0+阅读 · 2013年12月31日

STK11基因第XI功能区突变激活p38在Peutz-Jeghers综合征胃肠道息肉恶变中的作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

三氧化二砷降解HER2蛋白的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Periostin在前列腺癌侵袭转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

CDC73基因异常在颌骨骨化纤维瘤发病中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

鸡毒支原体感染相关miRNAs鉴定及其分子调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

CDC73基因异常对颌骨骨化纤维瘤发病的作用

国家自然科学基金

0+阅读 · 2011年12月31日

Navier-Stokes方程稳定化有限元方法后验误差估计

国家自然科学基金

0+阅读 · 2011年12月31日

重元素掺杂对层状钙钛矿型结构氧化物Srm+1TimO3m+1(m=1,2)热电性质的影响及机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

TGF-β28608;活Myocardin家族诱导骨髓间充质干细胞分化的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Offline Reinforcement Learning for Road Traffic Control

Arxiv

0+阅读 · 2022年7月11日

Mechanisms that Incentivize Data Sharing in Federated Learning

Arxiv

0+阅读 · 2022年7月10日

Optimizing Bipedal Maneuvers of Single Rigid-Body Models for Reinforcement Learning

Arxiv

0+阅读 · 2022年7月9日

How to Leverage Unlabeled Data in Offline Reinforcement Learning

How to Leverage Unlabeled Data in Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年7月8日

High Performance Simulation for Scalable Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年7月8日

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning

Arxiv

0+阅读 · 2022年7月8日

Reinforcement Learning based Air Combat Maneuver Generation

Reinforcement Learning based Air Combat Maneuver Generation

Arxiv

91+阅读 · 2022年1月14日

The Principles of Deep Learning Theory

Arxiv

66+阅读 · 2021年6月18日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

VIP会员

文章信息

相关主题

深度强化学习

相关VIP内容

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津大学博士论文】将序列结构与几何结构融入深度神经网络

工程视角：影响战争进程的小型无人机

企业级AI应用开发：从技术选型到生产落地

AI生成代码缺陷综述

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Offline Reinforcement Learning for Road Traffic Control

Arxiv

0+阅读 · 2022年7月11日

Mechanisms that Incentivize Data Sharing in Federated Learning

Arxiv

0+阅读 · 2022年7月10日

Optimizing Bipedal Maneuvers of Single Rigid-Body Models for Reinforcement Learning

Arxiv

0+阅读 · 2022年7月9日

How to Leverage Unlabeled Data in Offline Reinforcement Learning

How to Leverage Unlabeled Data in Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年7月8日

High Performance Simulation for Scalable Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年7月8日

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning

Arxiv

0+阅读 · 2022年7月8日

Reinforcement Learning based Air Combat Maneuver Generation

Reinforcement Learning based Air Combat Maneuver Generation

Arxiv

91+阅读 · 2022年1月14日

The Principles of Deep Learning Theory

Arxiv

66+阅读 · 2021年6月18日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

相关基金

基于SEM全模型和类ROC曲线法对亚健康状态“单证”量化诊断的方法学研究

国家自然科学基金

0+阅读 · 2013年12月31日

STK11基因第XI功能区突变激活p38在Peutz-Jeghers综合征胃肠道息肉恶变中的作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

三氧化二砷降解HER2蛋白的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Periostin在前列腺癌侵袭转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

CDC73基因异常在颌骨骨化纤维瘤发病中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

鸡毒支原体感染相关miRNAs鉴定及其分子调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

CDC73基因异常对颌骨骨化纤维瘤发病的作用

国家自然科学基金

0+阅读 · 2011年12月31日

Navier-Stokes方程稳定化有限元方法后验误差估计

国家自然科学基金

0+阅读 · 2011年12月31日

重元素掺杂对层状钙钛矿型结构氧化物Srm+1TimO3m+1(m=1,2)热电性质的影响及机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

TGF-β28608;活Myocardin家族诱导骨髓间充质干细胞分化的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员