神经 -- -- 共和主义同时施虐运动会的平衡性 (Finite-horizon Equilibria for Neuro-symbolic Concurrent Stochastic Games) - 专知论文

会员服务 ·

0

Agent · 讲稿 · 反向归纳 · Automator · MoDELS ·

2022 年 6 月 18 日

Finite-horizon Equilibria for Neuro-symbolic Concurrent Stochastic Games

翻译：神经 -- -- 共和主义同时施虐运动会的平衡性

Rui Yan,Gabriel Santos,Xiaoming Duan,David Parker,Marta Kwiatkowska

from arxiv, 14 pages, 7 figures

We present novel techniques for neuro-symbolic concurrent stochastic games, a recently proposed modelling formalism to represent a set of probabilistic agents operating in a continuous-space environment using a combination of neural network based perception mechanisms and traditional symbolic methods. To date, only zero-sum variants of the model were studied, which is too restrictive when agents have distinct objectives. We formalise notions of equilibria for these models and present algorithms to synthesise them. Focusing on the finite-horizon setting, and (global) social welfare subgame-perfect optimality, we consider two distinct types: Nash equilibria and correlated equilibria. We first show that an exact solution based on backward induction may yield arbitrarily bad equilibria. We then propose an approximation algorithm called frozen subgame improvement, which proceeds through iterative solution of nonlinear programs. We develop a prototype implementation and demonstrate the benefits of our approach on two case studies: an automated car-parking system and an aircraft collision avoidance system.

翻译：我们提出了新颖的神经共振共振游戏技术,这是最近提议的一种模拟形式主义,代表了在连续空间环境中使用基于神经网络的感知机制和传统象征性方法的组合进行操作的一组概率性剂。迄今为止,只研究了模型的零和变体,这些变体在代理体有不同目标时过于严格。我们将这些模型的平衡概念正规化,并提出了合成这些模型的算法。我们侧重于有限和(全球)社会福利次游戏的最佳性,我们考虑了两种截然不同的类别:Nash equilibria和相对的平衡性。我们首先表明,基于后向感应的精确解决办法可能会产生任意的偏差。我们随后提出了一种叫作冻结子游戏改进的近似算法,它通过非线性方案的迭代解决方案进行。我们开发了一个原型实施,并展示了我们方法在两个案例研究上的好处:自动汽车定位系统和避免飞机碰撞系统。

0

相关内容

Agent

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

CML细胞鞘氨醇激酶去SUMO化修饰及在发病中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

Faecalibacterium prausnitzii协同LFA-1在炎症性肠病发生中调控淋巴细胞分化及功能的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

混联双级NTP系统协同DPF和SCR同步降低柴油机PM和NOx排放的化学反应机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Lee偏差在试验设计中的应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

水莱茵海默氏菌 (Rheinheimera aquimaris) 淬灭细菌群体感应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

用于可见光光谱重建的窄谱LED发光机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

短小芽孢杆菌TUBP1抗棉花黄萎病菌活性成分及作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

气候变化对疟疾传播影响的数学建模研究

国家自然科学基金

0+阅读 · 2011年12月31日

PAEs致雄性生殖内分泌毒性传代效应的表观遗传机制

国家自然科学基金

0+阅读 · 2011年12月31日

宽叶荨麻抗类风湿关节炎物质基础及作用机理的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Reinforcement Learning for Freight Booking Control Problems

Arxiv

0+阅读 · 2022年8月9日

The Right Kind of Non-Determinism: Using Concurrency to Verify C Programs with Underspecified Semantics

Arxiv

0+阅读 · 2022年8月9日

Formalization of a Stochastic Approximation Theorem

Arxiv

0+阅读 · 2022年8月8日

Regret Minimization and Convergence to Equilibria in General-sum Markov Games

Arxiv

0+阅读 · 2022年8月8日

Recurrent networks, hidden states and beliefs in partially observable environments

Arxiv

0+阅读 · 2022年8月6日

The Extended UCB Policies for Frequentist Multi-armed Bandit Problems

Arxiv

0+阅读 · 2022年8月6日

Data-driven Control of Agent-based Models: an Equation/Variable-free Machine Learning Approach

Arxiv

0+阅读 · 2022年8月5日

Hybrid cuckoo search algorithm for the minimum dominating set problem

Arxiv

0+阅读 · 2022年8月5日

Petri Nets for Concurrent Programming

Arxiv

0+阅读 · 2022年8月4日

Decentralized and Communication-Free Multi-Robot Navigation through Distributed Games

Arxiv

40+阅读 · 2021年9月15日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《人工智能绝不能完全自主》

《人工智能的法律与伦理：军事自主机器独特挑战的深度剖析》316页

从数据到主导：AI与兵棋推演构筑决策优势

《特洛伊木马货柜：武器化集装箱的战略威胁》最新报告

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Reinforcement Learning for Freight Booking Control Problems

Arxiv

0+阅读 · 2022年8月9日

The Right Kind of Non-Determinism: Using Concurrency to Verify C Programs with Underspecified Semantics

Arxiv

0+阅读 · 2022年8月9日

Formalization of a Stochastic Approximation Theorem

Arxiv

0+阅读 · 2022年8月8日

Regret Minimization and Convergence to Equilibria in General-sum Markov Games

Arxiv

0+阅读 · 2022年8月8日

Recurrent networks, hidden states and beliefs in partially observable environments

Arxiv

0+阅读 · 2022年8月6日

The Extended UCB Policies for Frequentist Multi-armed Bandit Problems

Arxiv

0+阅读 · 2022年8月6日

Data-driven Control of Agent-based Models: an Equation/Variable-free Machine Learning Approach

Arxiv

0+阅读 · 2022年8月5日

Hybrid cuckoo search algorithm for the minimum dominating set problem

Arxiv

0+阅读 · 2022年8月5日

Petri Nets for Concurrent Programming

Arxiv

0+阅读 · 2022年8月4日

Decentralized and Communication-Free Multi-Robot Navigation through Distributed Games

Arxiv

40+阅读 · 2021年9月15日

相关基金

CML细胞鞘氨醇激酶去SUMO化修饰及在发病中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

Faecalibacterium prausnitzii协同LFA-1在炎症性肠病发生中调控淋巴细胞分化及功能的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

混联双级NTP系统协同DPF和SCR同步降低柴油机PM和NOx排放的化学反应机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Lee偏差在试验设计中的应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

水莱茵海默氏菌 (Rheinheimera aquimaris) 淬灭细菌群体感应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

用于可见光光谱重建的窄谱LED发光机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

短小芽孢杆菌TUBP1抗棉花黄萎病菌活性成分及作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

气候变化对疟疾传播影响的数学建模研究

国家自然科学基金

0+阅读 · 2011年12月31日

PAEs致雄性生殖内分泌毒性传代效应的表观遗传机制

国家自然科学基金

0+阅读 · 2011年12月31日

宽叶荨麻抗类风湿关节炎物质基础及作用机理的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员