利用多种试剂优势行为者-批评方为合作自治车辆制定 Altruisic Museeruver 规划 (Altruistic Maneuver Planning for Cooperative Autonomous Vehicles Using Multi-agent Advantage Actor-Critic) - 专知论文

会员服务 ·

0

Processing（编程语言） · contrastive · 学成 · 回合 · 端到端 ·

2021 年 7 月 12 日

Altruistic Maneuver Planning for Cooperative Autonomous Vehicles Using Multi-agent Advantage Actor-Critic

翻译：利用多种试剂优势行为者-批评方为合作自治车辆制定 Altruisic Museeruver 规划

Behrad Toghi,Rodolfo Valiente,Dorsa Sadigh,Ramtin Pedarsani,Yaser P. Fallah

from arxiv, Accepted to 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021) - Workshop on Autonomous Driving: Perception, Prediction and Planning

With the adoption of autonomous vehicles on our roads, we will witness a mixed-autonomy environment where autonomous and human-driven vehicles must learn to co-exist by sharing the same road infrastructure. To attain socially-desirable behaviors, autonomous vehicles must be instructed to consider the utility of other vehicles around them in their decision-making process. Particularly, we study the maneuver planning problem for autonomous vehicles and investigate how a decentralized reward structure can induce altruism in their behavior and incentivize them to account for the interest of other autonomous and human-driven vehicles. This is a challenging problem due to the ambiguity of a human driver's willingness to cooperate with an autonomous vehicle. Thus, in contrast with the existing works which rely on behavior models of human drivers, we take an end-to-end approach and let the autonomous agents to implicitly learn the decision-making process of human drivers only from experience. We introduce a multi-agent variant of the synchronous Advantage Actor-Critic (A2C) algorithm and train agents that coordinate with each other and can affect the behavior of human drivers to improve traffic flow and safety.

翻译：随着在公路上采用自治车辆,我们将看到一个混合自治环境,在这个环境中,自主和人类驱动的车辆必须学会通过共享相同的道路基础设施而共存;为了实现社会可取的行为,必须指示自治车辆在决策过程中考虑周围其他车辆的效用;特别是,我们研究自治车辆的机动规划问题,并调查分散奖励结构如何诱发其行为的利他主义,并激励其为其他自主和人类驱动的车辆的利益负责;这是一个具有挑战性的问题,因为人驾驶员与自主车辆合作的意愿模糊不清。因此,与目前依靠人驾驶员行为模式的工程不同,我们采取端对端办法,让自治代理人仅从经验中隐含地学习人驾驶员的决策进程。我们引入了同步的A2C-Atvantage Acor-Critic(A2C)算法和训练代理人的多剂变式,这些算法与他人协调,并可能影响人驾驶员改善交通流量和安全的行为。

0

相关内容

Processing（编程语言）

Processing（编程语言）

Processing 是一门开源编程语言和与之配套的集成开发环境（IDE）的名称。Processing 在电子艺术和视觉设计社区被用来教授编程基础，并运用于大量的新媒体和互动艺术作品中。

机器视觉在轨道交通系统状态检测中的应用综述

专知会员服务

26+阅读 · 2021年4月2日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【微软-Victor Bahl】边缘计算，49页ppt，Edge Computing for Infrastructure

【微软-Victor Bahl】边缘计算，49页ppt，Edge Computing for Infrastructure

专知会员服务

55+阅读 · 2020年4月13日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【CVPR 2019 | tutorial】阿波罗，开放式自主驾驶平台：Apollo， Open Autonomous Driving Platform

【CVPR 2019 | tutorial】阿波罗，开放式自主驾驶平台：Apollo， Open Autonomous Driving Platform

专知会员服务

32+阅读 · 2019年11月28日

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

专知会员服务

8+阅读 · 2019年11月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

已删除

将门创投

4+阅读 · 2018年6月12日

【论文推荐】最新5篇深度强化学习相关论文推荐—经验驱动的网络、自动数据库管理、双光技术推荐系统、UAVs、多代理竞争对手

【论文推荐】最新5篇深度强化学习相关论文推荐—经验驱动的网络、自动数据库管理、双光技术推荐系统、UAVs、多代理竞争对手

专知

5+阅读 · 2018年1月19日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Learning to Navigate Intersections with Unsupervised Driver Trait Inference

Learning to Navigate Intersections with Unsupervised Driver Trait Inference

Arxiv

0+阅读 · 2021年9月14日

A Hierarchical Control Framework for Drift Maneuvering of Autonomous Vehicles

A Hierarchical Control Framework for Drift Maneuvering of Autonomous Vehicles

Arxiv

0+阅读 · 2021年9月14日

Dependability Analysis of Deep Reinforcement Learning based Robotics and Autonomous Systems

Arxiv

0+阅读 · 2021年9月14日

Tracking Control foe Multi-Agent Systems Using Broadcast Signals Based on Positive Realness

Arxiv

0+阅读 · 2021年9月14日

Vision-Aided Autonomous Navigation of Bipedal Robots in Height-Constrained Environments

Arxiv

0+阅读 · 2021年9月13日

Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios

Arxiv

0+阅读 · 2021年9月12日

Topology-Informed Model Predictive Control for Anticipatory Collision Avoidance on a Ballbot

Arxiv

0+阅读 · 2021年9月10日

A General Framework to Forecast the Adoption of Novel Products: A Case of Autonomous Vehicles

Arxiv

0+阅读 · 2021年9月8日

No Blind Spots: Full-Surround Multi-Object Tracking for Autonomous Vehicles using Cameras & LiDARs

Arxiv

6+阅读 · 2018年2月23日

Monocular Imaging-based Autonomous Tracking for Low-cost Quad-rotor Design - TraQuad

Arxiv

6+阅读 · 2018年1月21日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

机器视觉在轨道交通系统状态检测中的应用综述

专知会员服务

26+阅读 · 2021年4月2日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【微软-Victor Bahl】边缘计算，49页ppt，Edge Computing for Infrastructure

【微软-Victor Bahl】边缘计算，49页ppt，Edge Computing for Infrastructure

专知会员服务

55+阅读 · 2020年4月13日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【CVPR 2019 | tutorial】阿波罗，开放式自主驾驶平台：Apollo， Open Autonomous Driving Platform

【CVPR 2019 | tutorial】阿波罗，开放式自主驾驶平台：Apollo， Open Autonomous Driving Platform

专知会员服务

32+阅读 · 2019年11月28日

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

专知会员服务

8+阅读 · 2019年11月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

已删除

将门创投

4+阅读 · 2018年6月12日

【论文推荐】最新5篇深度强化学习相关论文推荐—经验驱动的网络、自动数据库管理、双光技术推荐系统、UAVs、多代理竞争对手

【论文推荐】最新5篇深度强化学习相关论文推荐—经验驱动的网络、自动数据库管理、双光技术推荐系统、UAVs、多代理竞争对手

专知

5+阅读 · 2018年1月19日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

Learning to Navigate Intersections with Unsupervised Driver Trait Inference

Learning to Navigate Intersections with Unsupervised Driver Trait Inference

Arxiv

0+阅读 · 2021年9月14日

A Hierarchical Control Framework for Drift Maneuvering of Autonomous Vehicles

A Hierarchical Control Framework for Drift Maneuvering of Autonomous Vehicles

Arxiv

0+阅读 · 2021年9月14日

Dependability Analysis of Deep Reinforcement Learning based Robotics and Autonomous Systems

Arxiv

0+阅读 · 2021年9月14日

Tracking Control foe Multi-Agent Systems Using Broadcast Signals Based on Positive Realness

Arxiv

0+阅读 · 2021年9月14日

Vision-Aided Autonomous Navigation of Bipedal Robots in Height-Constrained Environments

Arxiv

0+阅读 · 2021年9月13日

Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios

Arxiv

0+阅读 · 2021年9月12日

Topology-Informed Model Predictive Control for Anticipatory Collision Avoidance on a Ballbot

Arxiv

0+阅读 · 2021年9月10日

A General Framework to Forecast the Adoption of Novel Products: A Case of Autonomous Vehicles

Arxiv

0+阅读 · 2021年9月8日

No Blind Spots: Full-Surround Multi-Object Tracking for Autonomous Vehicles using Cameras & LiDARs

Arxiv

6+阅读 · 2018年2月23日

Monocular Imaging-based Autonomous Tracking for Low-cost Quad-rotor Design - TraQuad

Arxiv

6+阅读 · 2018年1月21日

微信扫码咨询专知VIP会员