公式RL: 利用遥测数据为自主赛跑进行深强化学习 (Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data) - 专知论文

会员服务 ·

0

Learning · 强化学习 · MoDELS · 确定性策略 · 深度强化学习 ·

2022 年 6 月 13 日

Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

翻译：公式RL: 利用遥测数据为自主赛跑进行深强化学习

Adrian Remonda,Sarah Krebs,Eduardo Veas,Granit Luzhnica,Roman Kern

This paper explores the use of reinforcement learning (RL) models for autonomous racing. In contrast to passenger cars, where safety is the top priority, a racing car aims to minimize the lap-time. We frame the problem as a reinforcement learning task with a multidimensional input consisting of the vehicle telemetry, and a continuous action space. To find out which RL methods better solve the problem and whether the obtained models generalize to driving on unknown tracks, we put 10 variants of deep deterministic policy gradient (DDPG) to race in two experiments: i)~studying how RL methods learn to drive a racing car and ii)~studying how the learning scenario influences the capability of the models to generalize. Our studies show that models trained with RL are not only able to drive faster than the baseline open source handcrafted bots but also generalize to unknown tracks.

翻译：本文探讨了自动赛车使用强化学习模式( RL) 的问题。与乘客汽车相比, 安全是最重要的优先事项, 赛车的目的是最大限度地减少大腿时间。我们将此问题描述为一个强化学习任务, 包含由车辆遥测和连续行动空间组成的多维输入。要找出哪一种RL方法更好地解决问题, 以及所获得的模型是否在未知轨道上通用驾驶, 我们将10种深度确定性政策梯度( DDPG) 的变种在两个实验中进行比赛 : i) ~ 研究RL 如何学会驾驶赛车, ii) ~ 研究学习场景如何影响模型的普及能力。我们的研究显示, 接受RL 培训的模型不仅能够比基线开源手工制作的机器人更快的驱动速度, 而且还将未知的轨迹概括化。

0

相关内容

Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【强化学习论文推荐集合】2019年必读的10篇TOP强化学习论文，My Top 10 Deep RL Papers of 2019

【强化学习论文推荐集合】2019年必读的10篇TOP强化学习论文，My Top 10 Deep RL Papers of 2019

专知会员服务

42+阅读 · 2020年1月15日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于晶格点缺陷的二维Frenkel-Kontorova模型耗散动力学研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

西伯利亚鲟早期侧线发生、发育及调控机制

国家自然科学基金

0+阅读 · 2013年12月31日

前列腺癌中Nedd4L对TrkA的抑癌性泛素化研究

国家自然科学基金

0+阅读 · 2012年12月31日

局灶性皮质发育不良所致药物难治性癫痫中甘丙肽的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

表观遗传学修饰NANOG基因对胶质瘤细胞生物学活性的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

寻找多氯联苯代谢途径中缺失的一环

国家自然科学基金

0+阅读 · 2009年12月31日

AIM和ELF理论方法及应用的新拓展

国家自然科学基金

0+阅读 · 2009年12月31日

ROS、PGs、NO和TNF-α22312;LPS调控肝细胞LXR-α21450;其靶基因中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

SAC-AP: Soft Actor Critic based Deep Reinforcement Learning for Alert Prioritization

Arxiv

0+阅读 · 2022年8月2日

How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for Efficient and Safe Driving Strategies

Arxiv

0+阅读 · 2022年8月2日

An Introduction to Multi-Agent Reinforcement Learning and Review of its Application to Autonomous Mobility

Arxiv

0+阅读 · 2022年8月2日

Off-Policy Correction for Actor-Critic Algorithms in Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年8月1日

Locomotion Policy Guided Traversability Learning using Volumetric Representations of Complex Environments

Arxiv

0+阅读 · 2022年8月1日

Meta Reinforcement Learning with Successor Feature Based Context

Meta Reinforcement Learning with Successor Feature Based Context

Arxiv

1+阅读 · 2022年7月29日

Learning Based High-Level Decision Making for Abortable Overtaking in Autonomous Vehicles

Arxiv

0+阅读 · 2022年7月29日

Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization

Arxiv

0+阅读 · 2022年7月29日

Recent Advances in Reinforcement Learning in Finance

Arxiv

11+阅读 · 2021年12月8日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

VIP会员

文章信息

相关主题

确定性策略

深度强化学习

相关VIP内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【强化学习论文推荐集合】2019年必读的10篇TOP强化学习论文，My Top 10 Deep RL Papers of 2019

【强化学习论文推荐集合】2019年必读的10篇TOP强化学习论文，My Top 10 Deep RL Papers of 2019

专知会员服务

42+阅读 · 2020年1月15日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《多智能体不确定环境追逃博弈研究》216页

美智库最新发布《解放军"人机编组协同作战"发展路径：理论与实践》53页

现代战争"杀伤区"理论：空间尺度与结构特征、控制手段与毁伤机制、生存策略与战线转移

《俄军无人机创新技术或已在乌克兰达成"战场空中封锁"作战效果》最新18页报告

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

SAC-AP: Soft Actor Critic based Deep Reinforcement Learning for Alert Prioritization

Arxiv

0+阅读 · 2022年8月2日

How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for Efficient and Safe Driving Strategies

Arxiv

0+阅读 · 2022年8月2日

An Introduction to Multi-Agent Reinforcement Learning and Review of its Application to Autonomous Mobility

Arxiv

0+阅读 · 2022年8月2日

Off-Policy Correction for Actor-Critic Algorithms in Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年8月1日

Locomotion Policy Guided Traversability Learning using Volumetric Representations of Complex Environments

Arxiv

0+阅读 · 2022年8月1日

Meta Reinforcement Learning with Successor Feature Based Context

Meta Reinforcement Learning with Successor Feature Based Context

Arxiv

1+阅读 · 2022年7月29日

Learning Based High-Level Decision Making for Abortable Overtaking in Autonomous Vehicles

Arxiv

0+阅读 · 2022年7月29日

Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization

Arxiv

0+阅读 · 2022年7月29日

Recent Advances in Reinforcement Learning in Finance

Arxiv

11+阅读 · 2021年12月8日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

相关基金

基于晶格点缺陷的二维Frenkel-Kontorova模型耗散动力学研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

西伯利亚鲟早期侧线发生、发育及调控机制

国家自然科学基金

0+阅读 · 2013年12月31日

前列腺癌中Nedd4L对TrkA的抑癌性泛素化研究

国家自然科学基金

0+阅读 · 2012年12月31日

局灶性皮质发育不良所致药物难治性癫痫中甘丙肽的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

表观遗传学修饰NANOG基因对胶质瘤细胞生物学活性的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

寻找多氯联苯代谢途径中缺失的一环

国家自然科学基金

0+阅读 · 2009年12月31日

AIM和ELF理论方法及应用的新拓展

国家自然科学基金

0+阅读 · 2009年12月31日

ROS、PGs、NO和TNF-α22312;LPS调控肝细胞LXR-α21450;其靶基因中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员