竞争性自治竞赛中合作队的等级控制 (Hierarchical Control for Cooperative Teams in Competitive Autonomous Racing) - 专知论文

会员服务 ·

0

控制器 · 可约的 · 基准 · Performer · 离散化 ·

2022 年 4 月 27 日

Hierarchical Control for Cooperative Teams in Competitive Autonomous Racing

翻译：竞争性自治竞赛中合作队的等级控制

Rishabh Saumil Thakkar,Aryaman Singh Samyal,David Fridovich-Keil,Zhe Xu,Ufuk Topcu

from arxiv, Submitted to IEEE-Transactions on Control Systems Technology. arXiv admin note: substantial text overlap with arXiv:2202.12861

We study the problem of autonomous racing amongst teams composed of cooperative agents subject to realistic safety and fairness rules. We develop a hierarchical controller to solve this problem consisting of two levels, extending prior work where bi-level hierarchical control is applied to head-to-head autonomous racing. A high-level planner constructs a discrete game that encodes the complex rules with simplified dynamics to produce a sequence of target waypoints. The low-level controller uses the resulting waypoints as a reference trajectory and computes high-resolution control inputs by solving a simplified racing game with a reduced set of rules. We consider two approaches for the low-level planner: training a multi-agent reinforcement learning (MARL) policy and solving a linear-quadratic Nash game (LQNG) approximation. We test our controllers against three baselines on a simple oval track and a complex track: an end-to-end MARL controller, a MARL controller tracking a fixed racing line, and an LQNG controller tracking a fixed racing line. Quantitative results show that our hierarchical methods outperform their respective baseline methods in terms of race wins, overall team performance, and abiding by the rules. Qualitatively, we observe the hierarchical controllers mimicking actions performed by expert human drivers such as coordinated overtaking moves, defending against multiple opponents, and long-term planning for delayed advantages. We show that hierarchical planning for game-theoretic reasoning produces both cooperative and competitive behavior even when challenged with complex rules and constraints.

翻译：我们研究由合作人员组成、符合现实安全和公平规则的团队之间自主赛跑的问题。我们开发了一个等级控制器来解决这个问题,由两个级别组成:扩大以前工作范围,对头对头自动赛实行双级等级控制;一个高级规划器将复杂的规则编码为简化的动态,以生成一系列目标路标。低级别控制器将由此产生的路标用作参考轨迹,并通过解决一套简化的游戏,降低规则,计算高分辨率控制投入。我们考虑低级别规划器的两个方法:培训多级强化(MARL)政策,并解决线性横向Nash游戏(LQNG)近似值。我们用三个基线来测试我们的控制器,在简单的奥瓦尔轨道和复杂轨道上将复杂的规则编码为复杂的规则编码:末至终点MARL控制器,跟踪固定的赛道线,以及一个连级控制器跟踪固定赛线。量化结果显示,我们的等级方法在种族赢、总体团队级级和直线性比赛规则方面超越了各自的基线方法,我们用多级规则来遵守了多级规则,并遵守了多级规则,我们用专家级规则来捍卫了多级规则。

0

相关内容

控制器

最新报告64页《军事中的人工智能和自主性：北约成员国的战略和部署概述》北约卓越合作网络防御中心，Artificial Intelligence and Autonomy in the Military: An Overview of NATO Member States’ Strategies and Deployment

最新报告64页《军事中的人工智能和自主性：北约成员国的战略和部署概述》北约卓越合作网络防御中心，Artificial Intelligence and Autonomy in the Military: An Overview of NATO Member States’ Strategies and Deployment

专知会员服务

30+阅读 · 2022年4月7日

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

专知会员服务

67+阅读 · 2022年3月29日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

一类四阶非线性方程的非协调有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

可扩展内容感知路由架构、协议及算法设计

国家自然科学基金

1+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

非线性Cahn-Hilliard型方程自适应高阶稳定数值方法分析

国家自然科学基金

0+阅读 · 2013年12月31日

Witten Laplacian的特征值及与其相关的Ricci Soliton研究

国家自然科学基金

0+阅读 · 2012年12月31日

Multi-Agent架构智能机器人推理机实时性研究

国家自然科学基金

1+阅读 · 2011年12月31日

线性积分方程的Galerkin快速谱方法

国家自然科学基金

0+阅读 · 2009年12月31日

补肾抗衰片动态调控HO-1/CO与NOS/NO系统微平衡稳定动脉粥样硬化斑块的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

三维片上网络（3D NoC）关键技术研究

国家自然科学基金

1+阅读 · 2008年12月31日

Autonomous Platoon Control with Integrated Deep Reinforcement Learning and Dynamic Programming

Arxiv

0+阅读 · 2022年6月15日

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年6月15日

Formalizing Human Ingenuity: A Quantitative Framework for Copyright Law's Substantial Similarity

Arxiv

0+阅读 · 2022年6月14日

Hierarchical Primitive Composition: Simultaneous Activation of Skills with Inconsistent Action Dimensions in Multiple Hierarchies

Arxiv

0+阅读 · 2022年6月14日

Interactive Inverse Reinforcement Learning for Cooperative Games

Arxiv

0+阅读 · 2022年6月13日

Towards Autonomous Grading In The Real World

Arxiv

0+阅读 · 2022年6月13日

Geometrically Guided Integrated Gradients

Arxiv

0+阅读 · 2022年6月13日

Leveraging Heterogeneous Capabilities in Multi-Agent Systems for Environmental Conflict Resolution

Leveraging Heterogeneous Capabilities in Multi-Agent Systems for Environmental Conflict Resolution

Arxiv

0+阅读 · 2022年6月11日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

49+阅读 · 2021年1月6日

Hierarchical Graph Pooling with Structure Learning

Arxiv

13+阅读 · 2019年11月14日

VIP会员

文章信息

相关主题

相关VIP内容

最新报告64页《军事中的人工智能和自主性：北约成员国的战略和部署概述》北约卓越合作网络防御中心，Artificial Intelligence and Autonomy in the Military: An Overview of NATO Member States’ Strategies and Deployment

最新报告64页《军事中的人工智能和自主性：北约成员国的战略和部署概述》北约卓越合作网络防御中心，Artificial Intelligence and Autonomy in the Military: An Overview of NATO Member States’ Strategies and Deployment

专知会员服务

30+阅读 · 2022年4月7日

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

【决策Transformers 导论】Introducing Decision Transformers on Hugging Face 🤗

专知会员服务

67+阅读 · 2022年3月29日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《巡飞弹药（爆炸性无人机）威胁态势分析》最新24页报告

《军用后勤无人机：破解战场运输挑战的创新方案》

人工智能战争：以色列、伊朗与新型AI战争形态

《俄乌战争：现代战争未来的启示与经验》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Autonomous Platoon Control with Integrated Deep Reinforcement Learning and Dynamic Programming

Arxiv

0+阅读 · 2022年6月15日

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年6月15日

Formalizing Human Ingenuity: A Quantitative Framework for Copyright Law's Substantial Similarity

Arxiv

0+阅读 · 2022年6月14日

Hierarchical Primitive Composition: Simultaneous Activation of Skills with Inconsistent Action Dimensions in Multiple Hierarchies

Arxiv

0+阅读 · 2022年6月14日

Interactive Inverse Reinforcement Learning for Cooperative Games

Arxiv

0+阅读 · 2022年6月13日

Towards Autonomous Grading In The Real World

Arxiv

0+阅读 · 2022年6月13日

Geometrically Guided Integrated Gradients

Arxiv

0+阅读 · 2022年6月13日

Leveraging Heterogeneous Capabilities in Multi-Agent Systems for Environmental Conflict Resolution

Leveraging Heterogeneous Capabilities in Multi-Agent Systems for Environmental Conflict Resolution

Arxiv

0+阅读 · 2022年6月11日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

49+阅读 · 2021年1月6日

Hierarchical Graph Pooling with Structure Learning

Arxiv

13+阅读 · 2019年11月14日

相关基金

一类四阶非线性方程的非协调有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

可扩展内容感知路由架构、协议及算法设计

国家自然科学基金

1+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

非线性Cahn-Hilliard型方程自适应高阶稳定数值方法分析

国家自然科学基金

0+阅读 · 2013年12月31日

Witten Laplacian的特征值及与其相关的Ricci Soliton研究

国家自然科学基金

0+阅读 · 2012年12月31日

Multi-Agent架构智能机器人推理机实时性研究

国家自然科学基金

1+阅读 · 2011年12月31日

线性积分方程的Galerkin快速谱方法

国家自然科学基金

0+阅读 · 2009年12月31日

补肾抗衰片动态调控HO-1/CO与NOS/NO系统微平衡稳定动脉粥样硬化斑块的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

三维片上网络（3D NoC）关键技术研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员