安全和佩德斯 -- 禽兽自主驾驶的元学习 (Cognitive Level-$k$ Meta-Learning for Safe and Pedestrian-Aware Autonomous Driving) - 专知论文

会员服务 ·

0

Cognition · Learning · INTERACT · CARS · MAML ·

2023 年 2 月 1 日

Cognitive Level-$k$ Meta-Learning for Safe and Pedestrian-Aware Autonomous Driving

翻译：安全和佩德斯 -- 禽兽自主驾驶的元学习

Haozhe Lei,Quanyan Zhu

The potential market for modern self-driving cars is enormous, as they are developing remarkably rapidly. At the same time, however, accidents of pedestrian fatalities caused by autonomous driving have been recorded in the case of street crossing. To ensure traffic safety in self-driving environments and respond to vehicle-human interaction challenges such as jaywalking, we propose Level-$k$ Meta Reinforcement Learning (LK-MRL) algorithm. It takes into account the cognitive hierarchy of pedestrian responses and enables self-driving vehicles to adapt to various human behaviors. %which takes into account pedestrian responses while learning the optimal strategies. As a self-driving vehicle algorithm, the LK-MRL combines level-$k$ thinking into MAML to prepare for heterogeneous pedestrians and improve intersection safety based on the combination of meta-reinforcement learning and human cognitive hierarchy framework. We evaluate the algorithm in two cognitive confrontation hierarchy scenarios in an urban traffic simulator and illustrate its role in ensuring road safety by demonstrating its capability of conjectural and higher-level reasoning.

翻译：现代自行驾驶汽车的潜在市场是巨大的,因为它们正在迅速发展。但与此同时,在街头过境时记录了自驾驾驶造成的行人死亡事故。为了确保自行驾驶环境中的交通安全,并应对汽车与人之间的交互挑战,例如行车横行等,我们建议采用“水平-千美元”的“元强化学习”算法(LK-MRL),其中考虑到行人反应的认知等级,使自驾车辆能够适应人类的各种行为。%在学习最佳战略时会考虑到行人的反应。作为自行驾驶车辆算法,LK-MRL将“水平-k$”的思维结合到MAML中,为多行人做好准备,并根据超强力学习和人类认知等级框架的结合,改善交叉安全。我们评估城市交通模拟器两种认知对立等级假设的算法,并通过展示其预测和更高层次推理能力来说明其在确保道路安全方面的作用。

0

相关内容

Cognition

Cognition：Cognition：International Journal of Cognitive Science Explanation：认知：国际认知科学杂志。 Publisher：Elsevier。 SIT： http://www.journals.elsevier.com/cognition/

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

重磅开讲：图灵奖得主—— Joseph Sifakis

重磅开讲：图灵奖得主—— Joseph Sifakis

THU数据派

0+阅读 · 2022年6月13日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【OpenAI】深度强化学习关键论文列表

【OpenAI】深度强化学习关键论文列表

专知

11+阅读 · 2018年11月10日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

铁铬污泥制备的刚玉型含Cr片状氧化铁及其稳定性机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

分层强剪切内波环境中柱体失稳机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

Al(1-x)CoCrFeNi(1+x)和Cu(1-x)CoCrFeNi(1+x)合金微观组织结构与性能的研究

国家自然科学基金

0+阅读 · 2013年12月31日

微波场强化熔渗烧结CuW80合金Cu组元迁移机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Riemann问题的交通网络流体力学建模与数值求解

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境下数据中心的power capping关键问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

高温抗氧化Mo(Si,Al)2/(Cr+Si)复合涂层引起Nb-Si基合金力学性能退化的机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于4R的Cu-Al异种金属焊接接头再制造方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

CFRP构件与Al合金构件胶接界面特性与失效机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

环境友好型高取向织构化铁电压电陶瓷的制备及机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving Environment

Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving Environment

Arxiv

0+阅读 · 2023年3月23日

Planning-oriented Autonomous Driving

Arxiv

1+阅读 · 2023年3月23日

Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian Interactions using Reinforcement Learning

Arxiv

0+阅读 · 2023年3月22日

Penalty-Based Imitation Learning With Cross Semantics Generation Sensor Fusion for Autonomous Driving

Penalty-Based Imitation Learning With Cross Semantics Generation Sensor Fusion for Autonomous Driving

Arxiv

0+阅读 · 2023年3月21日

Motion Planning for Autonomous Driving: The State of the Art and Perspectives

Arxiv

0+阅读 · 2023年3月21日

Deep Q-Network Based Decision Making for Autonomous Driving

Arxiv

0+阅读 · 2023年3月21日

Digital twin in virtual reality for human-vehicle interactions in the context of autonomous driving

Arxiv

0+阅读 · 2023年3月20日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions

Arxiv

18+阅读 · 2021年12月21日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

49+阅读 · 2021年1月6日

VIP会员

文章信息

相关主题

相关VIP内容

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

重磅开讲：图灵奖得主—— Joseph Sifakis

重磅开讲：图灵奖得主—— Joseph Sifakis

THU数据派

0+阅读 · 2022年6月13日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【OpenAI】深度强化学习关键论文列表

【OpenAI】深度强化学习关键论文列表

专知

11+阅读 · 2018年11月10日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving Environment

Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving Environment

Arxiv

0+阅读 · 2023年3月23日

Planning-oriented Autonomous Driving

Arxiv

1+阅读 · 2023年3月23日

Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian Interactions using Reinforcement Learning

Arxiv

0+阅读 · 2023年3月22日

Penalty-Based Imitation Learning With Cross Semantics Generation Sensor Fusion for Autonomous Driving

Penalty-Based Imitation Learning With Cross Semantics Generation Sensor Fusion for Autonomous Driving

Arxiv

0+阅读 · 2023年3月21日

Motion Planning for Autonomous Driving: The State of the Art and Perspectives

Arxiv

0+阅读 · 2023年3月21日

Deep Q-Network Based Decision Making for Autonomous Driving

Arxiv

0+阅读 · 2023年3月21日

Digital twin in virtual reality for human-vehicle interactions in the context of autonomous driving

Arxiv

0+阅读 · 2023年3月20日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions

Arxiv

18+阅读 · 2021年12月21日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

49+阅读 · 2021年1月6日

相关基金

铁铬污泥制备的刚玉型含Cr片状氧化铁及其稳定性机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

分层强剪切内波环境中柱体失稳机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

Al(1-x)CoCrFeNi(1+x)和Cu(1-x)CoCrFeNi(1+x)合金微观组织结构与性能的研究

国家自然科学基金

0+阅读 · 2013年12月31日

微波场强化熔渗烧结CuW80合金Cu组元迁移机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Riemann问题的交通网络流体力学建模与数值求解

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境下数据中心的power capping关键问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

高温抗氧化Mo(Si,Al)2/(Cr+Si)复合涂层引起Nb-Si基合金力学性能退化的机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于4R的Cu-Al异种金属焊接接头再制造方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

CFRP构件与Al合金构件胶接界面特性与失效机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

环境友好型高取向织构化铁电压电陶瓷的制备及机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员