回到未来:在 " 达空运动会 " 中有效、时间一致的解决办法 (Back to the Future: Efficient, Time-Consistent Solutions in Reach-Avoid Games) - 专知论文

会员服务 ·

0

优化器 · INTERACT · 散度 · 回合 · 值域 ·

2021 年 9 月 16 日

Back to the Future: Efficient, Time-Consistent Solutions in Reach-Avoid Games

翻译：回到未来:在 " 达空运动会 " 中有效、时间一致的解决办法

Dennis R. Anthony,David Fridovich-Keil,Jaime F. Fisac

from arxiv, submitted to Robotics and Automation Letters

We study the class of reach-avoid dynamic games in which multiple agents interact noncooperatively, and each wishes to satisfy a distinct target condition while avoiding a failure condition. Reach-avoid games are commonly used to express safety-critical optimal control problems found in mobile robot motion planning. While a wide variety of approaches exist for these motion planning problems, we focus on finding time-consistent solutions, in which planned future motion is still optimal despite prior suboptimal actions. Though abstract, time consistency encapsulates an extremely desirable property: namely, time-consistent motion plans remain optimal even when a robot's motion diverges from the plan early on due to, e.g., intrinsic dynamic uncertainty or extrinsic environment disturbances. Our main contribution is a computationally-efficient algorithm for multi-agent reach-avoid games which renders time-consistent solutions. We demonstrate our approach in a simulated driving scenario, where we construct a two-player adversarial game to model a range of defensive driving behaviors.

翻译：我们研究的是“达到-避免”的动态游戏,其中多个代理人不合作地互动,每个代理人都希望满足一个不同的目标条件,同时避免失败条件。“达到-避免”游戏通常用来表达移动机器人运动规划中发现的安全关键最佳控制问题。虽然在这些运动规划问题方面存在着各种各样的办法,但我们侧重于寻找时间一致的解决办法,在这种办法中,计划的未来运动尽管在前几个最优的行动中仍然最理想。尽管时间一致包涵了一种非常可取的属性:即时间一致的动作计划仍然是最佳的,即使机器人的动作与计划有差异,例如,由于内在的动态不确定性或极端环境的干扰。我们的主要贡献是多试剂接触-避免游戏的计算效率算法,这种算法使得时间一致的解决办法。我们在模拟的驱动情景中展示了我们的方法,在模拟的驱动情景中我们构建了一种双人对抗游戏,以模拟一系列防御性驱动行为。

0

相关内容

优化器

【南京大学】量子计算 (Spring 2021)课程

专知会员服务

59+阅读 · 2021年4月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【AAAI Tutorials 2019】定价和拍卖自动化机制设计的新领域(New Frontiers of Automated Mechanism Design for Pricing and Auctions)

【AAAI Tutorials 2019】定价和拍卖自动化机制设计的新领域(New Frontiers of Automated Mechanism Design for Pricing and Auctions)

专知会员服务

8+阅读 · 2019年11月18日

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

专知会员服务

19+阅读 · 2019年11月5日

【O'Reilly AI Conference 2019】高管简报：从落后者到领导者-赢得AI竞赛（Executive Briefing: From laggard to leader—Winning the AI race），Anastasia Kouvela , Bharath Thota

【O'Reilly AI Conference 2019】高管简报：从落后者到领导者-赢得AI竞赛（Executive Briefing: From laggard to leader—Winning the AI race），Anastasia Kouvela , Bharath Thota

专知会员服务

8+阅读 · 2019年11月5日

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

专知会员服务

7+阅读 · 2019年11月5日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

计算机 | ISMAR 2019等国际会议信息8条

计算机 | ISMAR 2019等国际会议信息8条

Call4Papers

3+阅读 · 2019年3月5日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

计算机类 | LICS 2019等国际会议信息7条

计算机类 | LICS 2019等国际会议信息7条

Call4Papers

3+阅读 · 2018年12月17日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments

Arxiv

0+阅读 · 2021年11月7日

Algorithmic Information Design in Multi-Player Games: Possibility and Limits in Singleton Congestion

Arxiv

0+阅读 · 2021年11月6日

Monostatic sampling methods in limited-aperture configuration

Arxiv

0+阅读 · 2021年11月5日

Multi-Objective Constrained Optimization for Energy Applications via Tree Ensembles

Arxiv

0+阅读 · 2021年11月4日

Placement Optimization and Power Control in Intelligent Reflecting Surface Aided Multiuser System

Placement Optimization and Power Control in Intelligent Reflecting Surface Aided Multiuser System

Arxiv

0+阅读 · 2021年11月4日

Constrained Form-Finding of Tension-Compression Structures using Automatic Differentiation

Arxiv

0+阅读 · 2021年11月4日

Density Constrained Reinforcement Learning

Arxiv

6+阅读 · 2021年6月24日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Inverse Constrained Reinforcement Learning

Arxiv

8+阅读 · 2021年5月21日

Cellular-Connected UAVs over 5G: Deep Reinforcement Learning for Interference Management

Arxiv

4+阅读 · 2018年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

【南京大学】量子计算 (Spring 2021)课程

专知会员服务

59+阅读 · 2021年4月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【AAAI Tutorials 2019】定价和拍卖自动化机制设计的新领域(New Frontiers of Automated Mechanism Design for Pricing and Auctions)

【AAAI Tutorials 2019】定价和拍卖自动化机制设计的新领域(New Frontiers of Automated Mechanism Design for Pricing and Auctions)

专知会员服务

8+阅读 · 2019年11月18日

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

专知会员服务

19+阅读 · 2019年11月5日

【O'Reilly AI Conference 2019】高管简报：从落后者到领导者-赢得AI竞赛（Executive Briefing: From laggard to leader—Winning the AI race），Anastasia Kouvela , Bharath Thota

【O'Reilly AI Conference 2019】高管简报：从落后者到领导者-赢得AI竞赛（Executive Briefing: From laggard to leader—Winning the AI race），Anastasia Kouvela , Bharath Thota

专知会员服务

8+阅读 · 2019年11月5日

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

【O'Reilly AI Conference 2019】高管简报:展望在线定价和算法主导的共谋的未来（Executive Briefing: A look at the future of online pricing and algorithm-led collusion），Rebecca Gu (Electron), Cris Lowery (Baringa Partners)

专知会员服务

7+阅读 · 2019年11月5日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

计算机 | ISMAR 2019等国际会议信息8条

计算机 | ISMAR 2019等国际会议信息8条

Call4Papers

3+阅读 · 2019年3月5日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

计算机类 | LICS 2019等国际会议信息7条

计算机类 | LICS 2019等国际会议信息7条

Call4Papers

3+阅读 · 2018年12月17日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

相关论文

Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments

Arxiv

0+阅读 · 2021年11月7日

Algorithmic Information Design in Multi-Player Games: Possibility and Limits in Singleton Congestion

Arxiv

0+阅读 · 2021年11月6日

Monostatic sampling methods in limited-aperture configuration

Arxiv

0+阅读 · 2021年11月5日

Multi-Objective Constrained Optimization for Energy Applications via Tree Ensembles

Arxiv

0+阅读 · 2021年11月4日

Placement Optimization and Power Control in Intelligent Reflecting Surface Aided Multiuser System

Placement Optimization and Power Control in Intelligent Reflecting Surface Aided Multiuser System

Arxiv

0+阅读 · 2021年11月4日

Constrained Form-Finding of Tension-Compression Structures using Automatic Differentiation

Arxiv

0+阅读 · 2021年11月4日

Density Constrained Reinforcement Learning

Arxiv

6+阅读 · 2021年6月24日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Inverse Constrained Reinforcement Learning

Arxiv

8+阅读 · 2021年5月21日

Cellular-Connected UAVs over 5G: Deep Reinforcement Learning for Interference Management

Arxiv

4+阅读 · 2018年1月16日

微信扫码咨询专知VIP会员