探索人驾驶模仿与交通模拟安全之间的交换 (Exploring the trade off between human driving imitation and safety for traffic simulation) - 专知论文

会员服务 ·

0

Learning · INTERACT · Agent · Performer · 强化学习 ·

2022 年 8 月 9 日

Exploring the trade off between human driving imitation and safety for traffic simulation

翻译：探索人驾驶模仿与交通模拟安全之间的交换

Yann Koeberle,Stefano Sabatini,Dzmitry Tsishkou,Christophe Sabourin

Traffic simulation has gained a lot of interest for quantitative evaluation of self driving vehicles performance. In order for a simulator to be a valuable test bench, it is required that the driving policy animating each traffic agent in the scene acts as humans would do while maintaining minimal safety guarantees. Learning the driving policies of traffic agents from recorded human driving data or through reinforcement learning seems to be an attractive solution for the generation of realistic and highly interactive traffic situations in uncontrolled intersections or roundabouts. In this work, we show that a trade-off exists between imitating human driving and maintaining safety when learning driving policies. We do this by comparing how various Imitation learning and Reinforcement learning algorithms perform when applied to the driving task. We also propose a multi objective learning algorithm (MOPPO) that improves both objectives together. We test our driving policies on highly interactive driving scenarios extracted from INTERACTION Dataset to evaluate how human-like they behave.

翻译：交通模拟在自我驾驶车辆性能的定量评估方面引起了很大的兴趣。为了让模拟器成为有价值的测试台,需要将现场每个交通代理器的驱动政策作为人来操作,同时保持最低限度的安全保障。从载人驾驶记录数据或通过强化学习学习交通代理器的驱动政策似乎是在不受控制的交叉点或环形地带产生现实和高度互动的交通状况的一个有吸引力的解决方案。在这项工作中,我们表明在学习驾驶政策时模仿人驾驶与维护安全之间存在着权衡。我们这样做的方法是比较各种模拟学习和强化学习算法在应用驾驶任务时是如何表现的。我们还提出了一个多目标学习算法(MOPPO),可以共同改善这两个目标。我们用从InterACtive Dataset提取的高度互动的驾驶方案测试我们的驾驶政策,以评价他们的行为方式。

0

相关内容

Learning

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

基于二值传感网络及隐私保护的人物室内动态定位、多行为识别与老人摔倒实时监测方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

复杂应用环境影响下光伏阵列MPPT的控制与定量评价方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

压水堆PCI风险控制策略研究

国家自然科学基金

1+阅读 · 2013年12月31日

农户生产行为对农业面源污染的影响及控制对策研究

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

复杂工业全流程分层混合优化控制方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

可控磨粒织构金刚石固结工具高效抛光单晶蓝宝石曲面的基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

中小学生穿越无控制人行横道时人车冲突建模与仿真

国家自然科学基金

0+阅读 · 2012年12月31日

收益管理中的排序理论及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

驴Cathelicidin EA-CATH1的结构与功能研究及分子设计

国家自然科学基金

0+阅读 · 2009年12月31日

Learning Autonomous Vehicle Safety Concepts from Demonstrations

Arxiv

0+阅读 · 2022年10月6日

DRAMA: Joint Risk Localization and Captioning in Driving

Arxiv

0+阅读 · 2022年10月5日

Human-AI Shared Control via Policy Dissection

Arxiv

0+阅读 · 2022年10月5日

Learning from Demonstrations of Critical Driving Behaviours Using Driver's Risk Field

Arxiv

0+阅读 · 2022年10月4日

Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech Translation

Arxiv

0+阅读 · 2022年10月3日

OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors

Arxiv

0+阅读 · 2022年10月3日

Learning Eco-Driving Strategies at Signalized Intersections

Arxiv

16+阅读 · 2022年4月26日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Compositional GAN: Learning Conditional Image Composition

Compositional GAN: Learning Conditional Image Composition

Arxiv

31+阅读 · 2018年7月19日

VIP会员

文章信息

相关主题

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

相关论文

Learning Autonomous Vehicle Safety Concepts from Demonstrations

Arxiv

0+阅读 · 2022年10月6日

DRAMA: Joint Risk Localization and Captioning in Driving

Arxiv

0+阅读 · 2022年10月5日

Human-AI Shared Control via Policy Dissection

Arxiv

0+阅读 · 2022年10月5日

Learning from Demonstrations of Critical Driving Behaviours Using Driver's Risk Field

Arxiv

0+阅读 · 2022年10月4日

Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech Translation

Arxiv

0+阅读 · 2022年10月3日

OPT-Mimic: Imitation of Optimized Trajectories for Dynamic Quadruped Behaviors

Arxiv

0+阅读 · 2022年10月3日

Learning Eco-Driving Strategies at Signalized Intersections

Arxiv

16+阅读 · 2022年4月26日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Compositional GAN: Learning Conditional Image Composition

Compositional GAN: Learning Conditional Image Composition

Arxiv

31+阅读 · 2018年7月19日

相关基金

基于二值传感网络及隐私保护的人物室内动态定位、多行为识别与老人摔倒实时监测方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

复杂应用环境影响下光伏阵列MPPT的控制与定量评价方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

压水堆PCI风险控制策略研究

国家自然科学基金

1+阅读 · 2013年12月31日

农户生产行为对农业面源污染的影响及控制对策研究

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

复杂工业全流程分层混合优化控制方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

可控磨粒织构金刚石固结工具高效抛光单晶蓝宝石曲面的基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

中小学生穿越无控制人行横道时人车冲突建模与仿真

国家自然科学基金

0+阅读 · 2012年12月31日

收益管理中的排序理论及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

驴Cathelicidin EA-CATH1的结构与功能研究及分子设计

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员