自主驾驶在交互式合并场景中基于学习的预测的决策制定 (Decision Making for Autonomous Driving in Interactive Merge Scenarios via Learning-based Prediction) - 专知论文

会员服务 ·

0

交互 · 蒙特卡罗树搜索 · 性能指标 · 马尔可夫决策过程 · 蒙特卡罗 ·

2023 年 3 月 29 日

Decision Making for Autonomous Driving in Interactive Merge Scenarios via Learning-based Prediction

翻译：自主驾驶在交互式合并场景中基于学习的预测的决策制定

Salar Arbabi,Davide Tavernini,Saber Fallah,Richard Bowden

from arxiv, 12 pages, 12 figures

Autonomous agents that drive on roads shared with human drivers must reason about the nuanced interactions among traffic participants. This poses a highly challenging decision making problem since human behavior is influenced by a multitude of factors (e.g., human intentions and emotions) that are hard to model. This paper presents a decision making approach for autonomous driving, focusing on the complex task of merging into moving traffic where uncertainty emanates from the behavior of other drivers and imperfect sensor measurements. We frame the problem as a partially observable Markov decision process (POMDP) and solve it online with Monte Carlo tree search. The solution to the POMDP is a policy that performs high-level driving maneuvers, such as giving way to an approaching car, keeping a safe distance from the vehicle in front or merging into traffic. Our method leverages a model learned from data to predict the future states of traffic while explicitly accounting for interactions among the surrounding agents. From these predictions, the autonomous vehicle can anticipate the future consequences of its actions on the environment and optimize its trajectory accordingly. We thoroughly test our approach in simulation, showing that the autonomous vehicle can adapt its behavior to different situations. We also compare against other methods, demonstrating an improvement with respect to the considered performance metrics.

翻译：自动代理驾驶在与人类驾驶员共享道路时必须考虑交通参与者之间微妙的互动。这提出了一个非常具有挑战性的决策制定问题，因为人类行为受到多种因素的影响（例如，人类意图和情感），这些因素很难建模。本文针对自主驾驶的决策制定方法进行了研究，重点研究了合并到移动交通中的复杂任务，其中不确定性来自其他驾驶员的行为和不完美的传感器测量。我们将问题作为部分可观察的马尔可夫决策过程（POMDP）来框架，并使用蒙特卡罗树搜索进行在线解决。 POMDP的解决方案是执行高级驾驶动作的策略，例如让路给接近的汽车，与前方车辆保持安全距离或合并进入交通。我们的方法利用从数据中学习的模型来预测未来的交通状态，同时明确考虑周围代理之间的交互。从这些预测中，自动车可以预测其动作对环境的未来影响并相应地优化其轨迹。我们在仿真中彻底测试了我们的方法，表明自主驾驶车辆可以适应不同的情况。我们还与其他方法进行了比较，展示了与考虑的性能指标的改进。

0

相关内容

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

231+阅读 · 2022年4月10日

重磅！【AI+军事】北约《群体自主系统的态势感知》系列技术论文（群体智能、可解释AI、自主系统建模、军事气象情报等）

重磅！【AI+军事】北约《群体自主系统的态势感知》系列技术论文（群体智能、可解释AI、自主系统建模、军事气象情报等）

专知会员服务

202+阅读 · 2022年3月30日

UC Berkeley博士论文《自动驾驶汽车安全高效的自适应预测和规划》

UC Berkeley博士论文《自动驾驶汽车安全高效的自适应预测和规划》

专知会员服务

48+阅读 · 2022年3月28日

Berkeley博士论文《反馈系统中的可信机器学习》203页pdf

Berkeley博士论文《反馈系统中的可信机器学习》203页pdf

专知会员服务

40+阅读 · 2022年3月25日

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

专知会员服务

118+阅读 · 2022年3月18日

【ETH、Stanford】基于博弈论的运动规划，Tutorial ICRA '21

【ETH、Stanford】基于博弈论的运动规划，Tutorial ICRA '21

专知会员服务

56+阅读 · 2022年3月7日

维多利亚运输政策研究所“Autonomous Vehicle Implementation Predictions：Implications for Transport Planning”（自动驾驶汽车实施预测:对交通规划的影响）

维多利亚运输政策研究所“Autonomous Vehicle Implementation Predictions：Implications for Transport Planning”（自动驾驶汽车实施预测:对交通规划的影响）

专知会员服务

17+阅读 · 2022年2月16日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

去中心化多智能体导航的基于模型的强化学习 (RL)

去中心化多智能体导航的基于模型的强化学习 (RL)

TensorFlow

13+阅读 · 2021年6月24日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

基于模型预测的AUV三维轨迹跟踪控制研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于模型的安全关键的信息物理融合系统的设计方法中的软件综合

国家自然科学基金

1+阅读 · 2014年12月31日

一种新型的基于行人检测与行走方向识别的辅助驾驶安全系统的设计与开发

国家自然科学基金

0+阅读 · 2014年12月31日

基于动态合作博弈模型的协同驾驶行为建模和碰撞预测研究

国家自然科学基金

1+阅读 · 2013年12月31日

车联网环境下人车路一体化的汽车协同式避撞方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

滑移辅助转向下的四轮冗余电驱动车辆分层式力矩协同控制

国家自然科学基金

0+阅读 · 2013年12月31日

车辆横向运动中的人机共享控制方法与关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

microRNA-29b介导血管平滑肌细胞AT1aR基因DNA去甲基化参与高血压发病机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于集值函数描述的移动机器人自主行为基础问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

车-车通信环境下多车型合作驾驶跟驰建模及仿真研究

国家自然科学基金

0+阅读 · 2009年12月31日

UAV-Enabled Integrated Sensing and Communication: Opportunities and Challenges

Arxiv

0+阅读 · 2023年5月19日

Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models

Arxiv

0+阅读 · 2023年5月19日

AMII: Adaptive Multimodal Inter-personal and Intra-personal Model for Adapted Behavior Synthesis

Arxiv

0+阅读 · 2023年5月18日

An Interactively Reinforced Paradigm for Joint Infrared-Visible Image Fusion and Saliency Object Detection

Arxiv

0+阅读 · 2023年5月17日

Foundation Models for Decision Making: Problems, Methods, and Opportunities

Arxiv

36+阅读 · 2023年3月7日

A Survey of Decision Making in Adversarial Games

Arxiv

84+阅读 · 2022年7月16日

Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions

Arxiv

18+阅读 · 2021年12月21日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

VIP会员

文章信息

相关主题

蒙特卡罗树搜索

马尔可夫决策过程

相关VIP内容

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

231+阅读 · 2022年4月10日

重磅！【AI+军事】北约《群体自主系统的态势感知》系列技术论文（群体智能、可解释AI、自主系统建模、军事气象情报等）

重磅！【AI+军事】北约《群体自主系统的态势感知》系列技术论文（群体智能、可解释AI、自主系统建模、军事气象情报等）

专知会员服务

202+阅读 · 2022年3月30日

UC Berkeley博士论文《自动驾驶汽车安全高效的自适应预测和规划》

UC Berkeley博士论文《自动驾驶汽车安全高效的自适应预测和规划》

专知会员服务

48+阅读 · 2022年3月28日

Berkeley博士论文《反馈系统中的可信机器学习》203页pdf

Berkeley博士论文《反馈系统中的可信机器学习》203页pdf

专知会员服务

40+阅读 · 2022年3月25日

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

专知会员服务

118+阅读 · 2022年3月18日

【ETH、Stanford】基于博弈论的运动规划，Tutorial ICRA '21

【ETH、Stanford】基于博弈论的运动规划，Tutorial ICRA '21

专知会员服务

56+阅读 · 2022年3月7日

维多利亚运输政策研究所“Autonomous Vehicle Implementation Predictions：Implications for Transport Planning”（自动驾驶汽车实施预测:对交通规划的影响）

维多利亚运输政策研究所“Autonomous Vehicle Implementation Predictions：Implications for Transport Planning”（自动驾驶汽车实施预测:对交通规划的影响）

专知会员服务

17+阅读 · 2022年2月16日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

去中心化多智能体导航的基于模型的强化学习 (RL)

去中心化多智能体导航的基于模型的强化学习 (RL)

TensorFlow

13+阅读 · 2021年6月24日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

相关论文

UAV-Enabled Integrated Sensing and Communication: Opportunities and Challenges

Arxiv

0+阅读 · 2023年5月19日

Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models

Arxiv

0+阅读 · 2023年5月19日

AMII: Adaptive Multimodal Inter-personal and Intra-personal Model for Adapted Behavior Synthesis

Arxiv

0+阅读 · 2023年5月18日

An Interactively Reinforced Paradigm for Joint Infrared-Visible Image Fusion and Saliency Object Detection

Arxiv

0+阅读 · 2023年5月17日

Foundation Models for Decision Making: Problems, Methods, and Opportunities

Arxiv

36+阅读 · 2023年3月7日

A Survey of Decision Making in Adversarial Games

Arxiv

84+阅读 · 2022年7月16日

Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions

Arxiv

18+阅读 · 2021年12月21日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

相关基金

基于模型预测的AUV三维轨迹跟踪控制研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于模型的安全关键的信息物理融合系统的设计方法中的软件综合

国家自然科学基金

1+阅读 · 2014年12月31日

一种新型的基于行人检测与行走方向识别的辅助驾驶安全系统的设计与开发

国家自然科学基金

0+阅读 · 2014年12月31日

基于动态合作博弈模型的协同驾驶行为建模和碰撞预测研究

国家自然科学基金

1+阅读 · 2013年12月31日

车联网环境下人车路一体化的汽车协同式避撞方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

滑移辅助转向下的四轮冗余电驱动车辆分层式力矩协同控制

国家自然科学基金

0+阅读 · 2013年12月31日

车辆横向运动中的人机共享控制方法与关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

microRNA-29b介导血管平滑肌细胞AT1aR基因DNA去甲基化参与高血压发病机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于集值函数描述的移动机器人自主行为基础问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

车-车通信环境下多车型合作驾驶跟驰建模及仿真研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员