Stackelberg 游戏的反游戏理论: 受伤理性的祝福 (Inverse Game Theory for Stackelberg Games: the Blessing of Bounded Rationality)

Optimizing strategic decisions (a.k.a. computing equilibrium) is key to the success of many non-cooperative multi-agent applications. However, in many real-world situations, we may face the exact opposite of this game-theoretic problem -- instead of prescribing equilibrium of a given game, we may directly observe the agents' equilibrium behaviors but want to infer the underlying parameters of an unknown game. This research question, also known as inverse game theory, has been studied in multiple recent works in the context of Stackelberg games. Unfortunately, existing works exhibit quite negative results, showing statistical hardness and computational hardness, assuming follower's perfectly rational behaviors. Our work relaxes the perfect rationality agent assumption to the classic quantal response model, a more realistic behavior model of bounded rationality. Interestingly, we show that the smooth property brought by such bounded rationality model actually leads to provably more efficient learning of the follower utility parameters in general Stackelberg games. Systematic empirical experiments on synthesized games confirm our theoretical results and further suggest its robustness beyond the strict quantal response model.

翻译：优化战略决策(a.k.a.计算平衡)是许多不合作的多剂应用成功的关键。然而,在许多现实世界中,我们可能面临与游戏理论问题截然相反的游戏理论问题 -- -- 我们可能直接观察代理人的均衡行为,但想要推断出未知游戏的基本参数。这个研究问题,又称为反向游戏理论,在斯塔克尔贝格游戏的多项近期工作中已经进行了研究。不幸的是,现有工作表现出相当消极的结果,显示了统计的严谨性和计算性硬性,并假定了追随者完全理性的行为。我们的工作放松了完美的理性因素假设,将其推向典型的四方反应模型,这是一种更现实的、相互约束的合理性的行为模型。有趣的是,我们表明这种约束性理性模型带来的平稳财产实际上导致在一般斯塔克尔贝格游戏中以可比较有效的方式学习后续效用参数。关于综合游戏的系统实验实验证实了我们的理论结果,并进一步表明它超越严格的四方反应模型的稳健性。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日