PAC 亚近纳什在比马特里斯运动会中取得平衡 (PAC Learnability of Approximate Nash Equilibrium in Bimatrix Games) - 专知论文

会员服务 ·

0

纳什均衡 · 预测器/决策函数 · 近似 · PAC学习理论 · 概率近似正确 ·

2021 年 10 月 16 日

PAC Learnability of Approximate Nash Equilibrium in Bimatrix Games

翻译：PAC 亚近纳什在比马特里斯运动会中取得平衡

Zhijian Duan,Dinghuai Zhang,Wenhan Huang,Yali Du,Yaodong Yang,Jun Wang,Xiaotie Deng

Computing Nash equilibrium in bimatrix games is PPAD-hard, and many works have focused on the approximate solutions. When games are generated from a fixed unknown distribution, learning a Nash predictor via data-driven approaches can be preferable. In this paper, we study the learnability of approximate Nash equilibrium in bimatrix games. We prove that Lipschitz function class is agnostic Probably Approximately Correct (PAC) learnable with respect to Nash approximation loss. Additionally, to demonstrate the advantages of learning a Nash predictor, we develop a model that can efficiently approximate solutions for games under the same distribution. We show by experiments that the solutions from our Nash predictor can serve as effective initializing points for other Nash solvers.

翻译：电子计算比马特里克游戏中的纳什平衡是硬的,许多作品都集中在近似解决方案上。当游戏由固定的未知分布生成时, 最好通过数据驱动的方法学习纳什预测器。在本文中, 我们研究在双马特里克游戏中大约的纳什平衡的可学习性。我们证明利普西茨函数类在纳什近似损失方面是不可知的, 很可能是正确( PAC ) 。此外, 为了展示学习纳什预测器的好处, 我们开发了一种模型, 能够有效地将相同分布下的游戏解决方案相近。我们通过实验发现, 我们的纳什预测器的解决方案可以作为其他纳什解答者的有效初始点。

0

相关内容

纳什均衡

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【金融机器学习课程资料】Financial Machine Learning

专知会员服务

119+阅读 · 2019年12月24日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

282+阅读 · 2019年10月9日

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

专知会员服务

208+阅读 · 2019年9月30日

已删除

将门创投

11+阅读 · 2019年7月4日

Guaranteed a posteriori local error estimation for finite element solutions of boundary value problems

Arxiv

0+阅读 · 2021年12月16日

Budget-limited distribution learning in multifidelity problems

Arxiv

0+阅读 · 2021年12月16日

A Generalized Minimax Q-learning Algorithm for Two-Player Zero-Sum Stochastic Games

Arxiv

0+阅读 · 2021年12月16日

Approximation algorithms for $k$-median with lower-bound constraints

Arxiv

0+阅读 · 2021年12月15日

A parameterized approximation algorithm for $k$-median with lower-bound constraints

Arxiv

0+阅读 · 2021年12月13日

On Exact and Approximate Policies for Linear Tape Scheduling in Data Centers

Arxiv

0+阅读 · 2021年12月13日

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Arxiv

9+阅读 · 2021年2月23日

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

Arxiv

3+阅读 · 2020年6月15日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Arxiv

8+阅读 · 2018年12月18日

Coulomb GANs: Provably Optimal Nash Equilibria via Potential Fields

Arxiv

4+阅读 · 2018年1月30日

VIP会员

文章信息

相关主题

预测器/决策函数

PAC学习理论

概率近似正确

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【金融机器学习课程资料】Financial Machine Learning

专知会员服务

119+阅读 · 2019年12月24日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

282+阅读 · 2019年10月9日

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

专知会员服务

208+阅读 · 2019年9月30日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

已删除

将门创投

11+阅读 · 2019年7月4日

相关论文

Guaranteed a posteriori local error estimation for finite element solutions of boundary value problems

Arxiv

0+阅读 · 2021年12月16日

Budget-limited distribution learning in multifidelity problems

Arxiv

0+阅读 · 2021年12月16日

A Generalized Minimax Q-learning Algorithm for Two-Player Zero-Sum Stochastic Games

Arxiv

0+阅读 · 2021年12月16日

Approximation algorithms for $k$-median with lower-bound constraints

Arxiv

0+阅读 · 2021年12月15日

A parameterized approximation algorithm for $k$-median with lower-bound constraints

Arxiv

0+阅读 · 2021年12月13日

On Exact and Approximate Policies for Linear Tape Scheduling in Data Centers

Arxiv

0+阅读 · 2021年12月13日

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Arxiv

9+阅读 · 2021年2月23日

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

Arxiv

3+阅读 · 2020年6月15日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Arxiv

8+阅读 · 2018年12月18日

Coulomb GANs: Provably Optimal Nash Equilibria via Potential Fields

Arxiv

4+阅读 · 2018年1月30日

微信扫码咨询专知VIP会员