粉尘游戏中的阿雷亚独立性有限计量确定力 (Arena-Independent Finite-Memory Determinacy in Stochastic Games) - 专知论文

会员服务 ·

0

优化器 · 可约的 · CONCUR · 可理解性 · TOOLS ·

2021 年 5 月 3 日

Arena-Independent Finite-Memory Determinacy in Stochastic Games

翻译：粉尘游戏中的阿雷亚独立性有限计量确定力

Patricia Bouyer,Youssouf Oualhadj,Mickael Randour,Pierre Vandenhove

from arxiv, 38 pages, 4 figures

We study stochastic zero-sum games on graphs, which are prevalent tools to model decision-making in presence of an antagonistic opponent in a random environment. In this setting, an important question is the one of strategy complexity: what kinds of strategies are sufficient or required to play optimally (e.g., randomization or memory requirements)? Our contributions further the understanding of arena-independent finite-memory (AIFM) determinacy, i.e., the study of objectives for which memory is needed, but in a way that only depends on limited parameters of the game graphs. First, we show that objectives for which pure AIFM strategies suffice to play optimally also admit pure AIFM subgame perfect strategies. Second, we show that we can reduce the study of objectives for which pure AIFM strategies suffice in two-player stochastic games to the easier study of one-player stochastic games (i.e., Markov decision processes). Third, we characterize the sufficiency of AIFM strategies through two intuitive properties of objectives. This work extends a line of research started on deterministic games in [BLO+20] to stochastic ones. [BLO+20] Patricia Bouyer, St\'ephane Le Roux, Youssouf Oualhadj, Mickael Randour, and Pierre Vandenhove. Games Where You Can Play Optimally with Arena-Independent Finite Memory. CONCUR 2020.

翻译：我们研究图表上的零和游戏,这是在随机环境中当着对立对手进行模拟决策的常用工具。在这个背景下,一个重要的问题是战略复杂性:什么样的战略足够或需要最优化地发挥(如随机化或记忆要求)?我们的贡献进一步增进了对视场独立有限游戏(AIFM)确定性的理解,即对需要记忆的目标的研究,但只能依赖游戏图表的有限参数。首先,我们展示了纯的AIFM战略足以最理想地接受纯的AIFM亚游戏完美战略的目标:哪些战略足以或需要最优化地发挥(如随机化或记忆要求)?我们的贡献进一步增进了对单玩游戏(即Markov决定程序)的轻松研究。我们通过目标的两个直观性属性来描述AIFM战略的充足性。本项工作把关于确定性游戏的系列研究范围从PARIC+OVA、OFOA、ROFO和ROFOFO。

0

相关内容

优化器

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

230+阅读 · 2020年6月5日

【硬核书】博弈论导论，417页pdf，Game Theory: An Introduction，普林斯顿大学出版社

【硬核书】博弈论导论，417页pdf，Game Theory: An Introduction，普林斯顿大学出版社

专知会员服务

231+阅读 · 2020年4月21日

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

专知会员服务

158+阅读 · 2020年1月29日

《应用随机微分方程》(Applied Stochastic Differential Equations)324页pdf新书分享

《应用随机微分方程》(Applied Stochastic Differential Equations)324页pdf新书分享

专知会员服务

44+阅读 · 2019年10月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知

21+阅读 · 2020年5月30日

移动端机器学习资源合集

移动端机器学习资源合集

专知

8+阅读 · 2019年4月21日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

NeurIPS 2018最佳论文发布：华为诺亚方舟实验室等获奖，加拿大实力凸显

NeurIPS 2018最佳论文发布：华为诺亚方舟实验室等获奖，加拿大实力凸显

量子位

3+阅读 · 2018年12月4日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Stochastic Model for Sunk Cost Bias

Arxiv

0+阅读 · 2021年6月21日

First derivatives at the optimum analysis (\textit{fdao}): An approach to estimate the uncertainty in nonlinear regression involving stochastically independent variables

Arxiv

0+阅读 · 2021年6月21日

Defense Against Reward Poisoning Attacks in Reinforcement Learning

Arxiv

0+阅读 · 2021年6月20日

The Principles of Deep Learning Theory

Arxiv

66+阅读 · 2021年6月18日

Effective Mori-Zwanzig equation for the reduced-order modeling of stochastic systems

Arxiv

0+阅读 · 2021年6月18日

Wide stochastic networks: Gaussian limit and PAC-Bayesian training

Arxiv

0+阅读 · 2021年6月17日

Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence

Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence

Arxiv

0+阅读 · 2021年6月17日

Stochastic Bias-Reduced Gradient Methods

Arxiv

0+阅读 · 2021年6月17日

Fréchet derivatives of expected functionals of solutions to stochastic differential equations

Arxiv

0+阅读 · 2021年6月16日

Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes

Arxiv

0+阅读 · 2021年6月16日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

230+阅读 · 2020年6月5日

【硬核书】博弈论导论，417页pdf，Game Theory: An Introduction，普林斯顿大学出版社

【硬核书】博弈论导论，417页pdf，Game Theory: An Introduction，普林斯顿大学出版社

专知会员服务

231+阅读 · 2020年4月21日

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

专知会员服务

158+阅读 · 2020年1月29日

《应用随机微分方程》(Applied Stochastic Differential Equations)324页pdf新书分享

《应用随机微分方程》(Applied Stochastic Differential Equations)324页pdf新书分享

专知会员服务

44+阅读 · 2019年10月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

检索增强生成（RAG）技术，261页slides

美联参会指南-联合规划与执行概述及政策框架 | 32页

从DeepSeek-R1学到的三个核心经验

大规模视觉模型中的提示式适配：综述

相关资讯

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知

21+阅读 · 2020年5月30日

移动端机器学习资源合集

移动端机器学习资源合集

专知

8+阅读 · 2019年4月21日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

NeurIPS 2018最佳论文发布：华为诺亚方舟实验室等获奖，加拿大实力凸显

NeurIPS 2018最佳论文发布：华为诺亚方舟实验室等获奖，加拿大实力凸显

量子位

3+阅读 · 2018年12月4日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Stochastic Model for Sunk Cost Bias

Arxiv

0+阅读 · 2021年6月21日

First derivatives at the optimum analysis (\textit{fdao}): An approach to estimate the uncertainty in nonlinear regression involving stochastically independent variables

Arxiv

0+阅读 · 2021年6月21日

Defense Against Reward Poisoning Attacks in Reinforcement Learning

Arxiv

0+阅读 · 2021年6月20日

The Principles of Deep Learning Theory

Arxiv

66+阅读 · 2021年6月18日

Effective Mori-Zwanzig equation for the reduced-order modeling of stochastic systems

Arxiv

0+阅读 · 2021年6月18日

Wide stochastic networks: Gaussian limit and PAC-Bayesian training

Arxiv

0+阅读 · 2021年6月17日

Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence

Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence

Arxiv

0+阅读 · 2021年6月17日

Stochastic Bias-Reduced Gradient Methods

Arxiv

0+阅读 · 2021年6月17日

Fréchet derivatives of expected functionals of solutions to stochastic differential equations

Arxiv

0+阅读 · 2021年6月16日

Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes

Arxiv

0+阅读 · 2021年6月16日

微信扫码咨询专知VIP会员