制定规模可扩缩的规划和学习框架,解决群到群到群的接触问题 (Scalable Planning and Learning Framework Development for Swarm-to-Swarm Engagement Problems) - 专知论文

会员服务 ·

0

Learning · 控制器 · 相互独立的 · Guidance · 统计量 ·

2022 年 12 月 6 日

Scalable Planning and Learning Framework Development for Swarm-to-Swarm Engagement Problems

翻译：制定规模可扩缩的规划和学习框架,解决群到群到群的接触问题

Umut Demir,A. Sadik Satir,Gulay Goktas Sever,Cansu Yikilmaz,Nazim Kemal Ure

from arxiv, Accepted to SciTech2023

Development of guidance, navigation and control frameworks/algorithms for swarms attracted significant attention in recent years. That being said, algorithms for planning swarm allocations/trajectories for engaging with enemy swarms is largely an understudied problem. Although small-scale scenarios can be addressed with tools from differential game theory, existing approaches fail to scale for large-scale multi-agent pursuit evasion (PE) scenarios. In this work, we propose a reinforcement learning (RL) based framework to decompose to large-scale swarm engagement problems into a number of independent multi-agent pursuit-evasion games. We simulate a variety of multi-agent PE scenarios, where finite time capture is guaranteed under certain conditions. The calculated PE statistics are provided as a reward signal to the high level allocation layer, which uses an RL algorithm to allocate controlled swarm units to eliminate enemy swarm units with maximum efficiency. We verify our approach in large-scale swarm-to-swarm engagement simulations.

翻译：近年来,发展对群落的指导、导航和控制框架/参数/参数的开发吸引了相当多的注意力。说到这一点,规划与敌群群接触的群/轨迹分配/轨迹的算法在很大程度上是一个研究不足的问题。虽然可以通过不同游戏理论的工具来解决小规模的情景,但现有方法无法用于大型多剂追逐(PE)场景的大规模多剂追逐(PE)场景。在这项工作中,我们提议了一个基于强化学习(RL)的框架,将大规模群集参与问题分解为若干独立的多剂追逐-蒸发游戏。我们模拟了多种多剂PE场情景,保证在某些条件下有限时间捕捉到。计算出来的PE统计数据是作为奖励信号提供给高层分配层的,该层使用RL算法来分配受控的群温单位,以最大效率消灭敌群温单位。我们验证了大规模群到群集参与模拟中的方法。

0

相关内容

Learning

《机器学习模型中不确定性的量化和推理》CMU2022最新29页slides

《机器学习模型中不确定性的量化和推理》CMU2022最新29页slides

专知会员服务

56+阅读 · 2022年11月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

基于调度采样的网络化系统分布式控制策略研究

国家自然科学基金

0+阅读 · 2015年12月31日

不确定条件下基于分群策略的柔性Flow Shop调度问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

非系统性创业风险的识别和控制机制：基于认知视角的实证研究

国家自然科学基金

0+阅读 · 2012年12月31日

全球变化背景下伊犁山地草原苦豆子无性系种群生活史的响应及调节机理

国家自然科学基金

0+阅读 · 2012年12月31日

CuInGaSe2太阳能电池界面结构、界面态及其钝化

国家自然科学基金

0+阅读 · 2012年12月31日

基于多Agent系统的流域防洪智能调度研究

国家自然科学基金

0+阅读 · 2011年12月31日

养殖海域浮游动物群落结构演变对环境变化的生物地球化学响应

国家自然科学基金

0+阅读 · 2011年12月31日

Decentralized Riemannian Algorithm for Nonconvex Minimax Problems

Arxiv

0+阅读 · 2023年2月8日

Learning structured approximations of combinatorial optimization problems

Arxiv

1+阅读 · 2023年2月6日

Models and algorithms for simple disjunctive temporal problems

Arxiv

0+阅读 · 2023年2月6日

Learning Trees of $\ell_0$-Minimization Problems

Arxiv

0+阅读 · 2023年2月6日

First-Order Algorithms for Nonlinear Generalized Nash Equilibrium Problems

Arxiv

0+阅读 · 2023年2月5日

Learning Solution Manifolds for Control Problems via Energy Minimization

Arxiv

0+阅读 · 2023年2月4日

DeepPSL: End-to-end perception and reasoning

Arxiv

0+阅读 · 2023年2月4日

A Simple Approach for Local and Global Variable Importance in Nonlinear Regression Models

Arxiv

0+阅读 · 2023年2月3日

Benchmarking Algorithms for Submodular Optimization Problems Using IOHProfiler

Arxiv

0+阅读 · 2023年2月2日

Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world

Arxiv

0+阅读 · 2023年2月2日

VIP会员

文章信息

相关主题

相互独立的

相关VIP内容

《机器学习模型中不确定性的量化和推理》CMU2022最新29页slides

《机器学习模型中不确定性的量化和推理》CMU2022最新29页slides

专知会员服务

56+阅读 · 2022年11月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Decentralized Riemannian Algorithm for Nonconvex Minimax Problems

Arxiv

0+阅读 · 2023年2月8日

Learning structured approximations of combinatorial optimization problems

Arxiv

1+阅读 · 2023年2月6日

Models and algorithms for simple disjunctive temporal problems

Arxiv

0+阅读 · 2023年2月6日

Learning Trees of $\ell_0$-Minimization Problems

Arxiv

0+阅读 · 2023年2月6日

First-Order Algorithms for Nonlinear Generalized Nash Equilibrium Problems

Arxiv

0+阅读 · 2023年2月5日

Learning Solution Manifolds for Control Problems via Energy Minimization

Arxiv

0+阅读 · 2023年2月4日

DeepPSL: End-to-end perception and reasoning

Arxiv

0+阅读 · 2023年2月4日

A Simple Approach for Local and Global Variable Importance in Nonlinear Regression Models

Arxiv

0+阅读 · 2023年2月3日

Benchmarking Algorithms for Submodular Optimization Problems Using IOHProfiler

Arxiv

0+阅读 · 2023年2月2日

Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world

Arxiv

0+阅读 · 2023年2月2日

相关基金

基于调度采样的网络化系统分布式控制策略研究

国家自然科学基金

0+阅读 · 2015年12月31日

不确定条件下基于分群策略的柔性Flow Shop调度问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

非系统性创业风险的识别和控制机制：基于认知视角的实证研究

国家自然科学基金

0+阅读 · 2012年12月31日

全球变化背景下伊犁山地草原苦豆子无性系种群生活史的响应及调节机理

国家自然科学基金

0+阅读 · 2012年12月31日

CuInGaSe2太阳能电池界面结构、界面态及其钝化

国家自然科学基金

0+阅读 · 2012年12月31日

基于多Agent系统的流域防洪智能调度研究

国家自然科学基金

0+阅读 · 2011年12月31日

养殖海域浮游动物群落结构演变对环境变化的生物地球化学响应

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员