零和神经 -- -- 心理同步运动会战略综述 (Strategy Synthesis for Zero-sum Neuro-symbolic Concurrent Stochastic Games) - 专知论文

会员服务 ·

0

值迭代 · 状态空间 · 回合 · Neural Networks · Agent ·

2022 年 7 月 13 日

Strategy Synthesis for Zero-sum Neuro-symbolic Concurrent Stochastic Games

翻译：零和神经 -- -- 心理同步运动会战略综述

Rui Yan,Gabriel Santos,Gethin Norman,David Parker,Marta Kwiatkowska

from arxiv, 31 pages, 4 figures

Neuro-symbolic approaches to artificial intelligence, which combine neural networks with classical symbolic techniques, are growing in prominence, necessitating formal approaches to reason about their correctness. We propose a novel modelling formalism called neuro-symbolic concurrent stochastic games (NS-CSGs), which comprise probabilistic finite-state agents interacting in a shared continuous-state environment, observed through perception mechanisms implemented as neural networks. Since the environment state space is continuous, we focus on the class of NS-CSGs with Borel state spaces. We consider the problem of zero-sum discounted cumulative rewards and prove the existence of the value of NS-CSGs under Borel measurability and piecewise-constant restrictions on the components of the model. From an algorithmic perspective, existing methods to compute values and optimal strategies for CSGs focus on finite state spaces. We present, for the first time, implementable value iteration and policy iteration algorithms to solve a class of uncountable state space CSGs, namely NS-CSGs, and prove their convergence. Our approach works by exploiting the underlying game structures and then formulating piecewise linear or constant representations of the value functions and strategies of NS-CSGs. We illustrate our approach by applying a prototype implementation of value iteration to a dynamic vehicle parking case study.

翻译：将神经网络与古典象征性技术相结合的人工智能神经 -- -- 共振性方法日益突出,需要正式解释其正确性。我们建议采用新型的模型形式主义,称为神经-共振同时随机游戏(NS-CSGs),其中包括在共同的连续状态环境中互动的概率性限定国家代理人,通过神经网络的感知机制加以观察。由于环境状态空间是持续的,我们侧重于NS-CSG和波雷尔州空间的等级。我们考虑零和折扣累积奖励的问题,并证明在Borel的可计量性和对模型组成部分的零一致限制下NS-CSG的价值存在。从算法的角度,现有用于计算CSG的价值观和最佳战略以有限的国家空间为重点的方法。我们首次展示了可执行价值的循环和政策循环算法,以解决无法计算的国家空间(NS-CSGs)的类别,并证明在Borel中存在NS-CSGs的累积报酬问题,并证明在Borel的可计量性和对模型组成部分的细微一致性限制。我们的方法通过利用游戏结构的固定的模型模型模型研究,展示了我们对NCs-Csal-Csal-Csalview 的定位的定位模式的模型进行研究,从而展示了我们如何研究。

0

相关内容

值迭代

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

高斯序列与过程的极值理论

国家自然科学基金

2+阅读 · 2015年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

石斑鱼虹彩病毒编码的miR-homoHSV在病毒诱导程序性细胞死亡中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

白藜芦醇调节STIM1抑制血管平滑肌细胞增殖机制的探讨

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

认知无线网络中基于压缩感知的自适应非重构宽带频谱感知方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

神经元凋亡时Egr1对BH3-only蛋白Bim的转录调控

国家自然科学基金

0+阅读 · 2009年12月31日

SDF-1/CXCR4信号通路的干预及调节关节软骨退变的研究

国家自然科学基金

0+阅读 · 2008年12月31日

What does a platypus look like? Generating customized prompts for zero-shot image classification

Arxiv

0+阅读 · 2022年9月7日

Continual Depth-limited Responses for Computing Counter-strategies in Sequential Games

Arxiv

0+阅读 · 2022年9月7日

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

Arxiv

0+阅读 · 2022年9月7日

Weak Collocation Regression method: fast reveal hidden stochastic dynamics from high-dimensional aggregate data

Weak Collocation Regression method: fast reveal hidden stochastic dynamics from high-dimensional aggregate data

Arxiv

0+阅读 · 2022年9月6日

Adaptive Visual Servo Control for Autonomous Robots

Arxiv

0+阅读 · 2022年9月5日

Deep importance sampling using tensor-trains with application to a priori and a posteriori rare event estimation

Arxiv

0+阅读 · 2022年9月5日

Quantum Analysis of Continuous Time Stochastic Process

Arxiv

0+阅读 · 2022年9月5日

Learning Stochastic Dynamics with Statistics-Informed Neural Network

Arxiv

0+阅读 · 2022年9月2日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

最新《扩散模型原理》新书，470页pdf

无人机作战：演进、创新与未来战场

AI 智能体简史

多模态空间推理在大模型时代：综述与基准测试

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

What does a platypus look like? Generating customized prompts for zero-shot image classification

Arxiv

0+阅读 · 2022年9月7日

Continual Depth-limited Responses for Computing Counter-strategies in Sequential Games

Arxiv

0+阅读 · 2022年9月7日

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

Arxiv

0+阅读 · 2022年9月7日

Weak Collocation Regression method: fast reveal hidden stochastic dynamics from high-dimensional aggregate data

Weak Collocation Regression method: fast reveal hidden stochastic dynamics from high-dimensional aggregate data

Arxiv

0+阅读 · 2022年9月6日

Adaptive Visual Servo Control for Autonomous Robots

Arxiv

0+阅读 · 2022年9月5日

Deep importance sampling using tensor-trains with application to a priori and a posteriori rare event estimation

Arxiv

0+阅读 · 2022年9月5日

Quantum Analysis of Continuous Time Stochastic Process

Arxiv

0+阅读 · 2022年9月5日

Learning Stochastic Dynamics with Statistics-Informed Neural Network

Arxiv

0+阅读 · 2022年9月2日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

相关基金

高斯序列与过程的极值理论

国家自然科学基金

2+阅读 · 2015年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

石斑鱼虹彩病毒编码的miR-homoHSV在病毒诱导程序性细胞死亡中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

白藜芦醇调节STIM1抑制血管平滑肌细胞增殖机制的探讨

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

认知无线网络中基于压缩感知的自适应非重构宽带频谱感知方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

神经元凋亡时Egr1对BH3-only蛋白Bim的转录调控

国家自然科学基金

0+阅读 · 2009年12月31日

SDF-1/CXCR4信号通路的干预及调节关节软骨退变的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员