为放松最佳反应和相互一致性而放松最佳反应和相互一致性的周界合理性:部分自我参考的信息理论模型 (Bounded rationality for relaxing best response and mutual consistency: An information-theoretic model of partial self-reference) - 专知论文

会员服务 ·

0

变分自由能 · Processing（编程语言） · MoDELS · INFORMS · 自由能 ·

2021 年 6 月 30 日

Bounded rationality for relaxing best response and mutual consistency: An information-theoretic model of partial self-reference

翻译：为放松最佳反应和相互一致性而放松最佳反应和相互一致性的周界合理性:部分自我参考的信息理论模型

Benjamin Patrick Evans,Mikhail Prokopenko

from arxiv, 35 pages, 15 figures

While game theory has been transformative for decision-making, the assumptions made can be overly restrictive in certain instances. In this work, we focus on some of the assumptions underlying rationality such as mutual consistency and best-response, and consider ways to relax these assumptions using concepts from level-$k$ reasoning and quantal response equilibrium (QRE) respectively. Specifically, we provide an information-theoretic two-parameter model that can relax both mutual consistency and best-response, but can recover approximations of level-$k$, QRE, or typical Nash equilibrium behaviour in the limiting cases. The proposed approach is based on a recursive form of the variational free energy principle, representing self-referential games as (pseudo) sequential decisions. Bounds in player processing abilities are captured as information costs, where future chains of reasoning are discounted, implying a hierarchy of players where lower-level players have fewer processing resources.

翻译：虽然游戏理论对决策具有变革性,但在某些情况下,所作的假设可能过于限制性。在这项工作中,我们侧重于一些理性基础的假设,如相互一致和最佳反应,并考虑如何分别利用从1千元推理和四舍五入反应平衡(QRE)的概念来放松这些假设。具体地说,我们提供了一个信息理论双参数模型,既可以放松相互一致性,也可以最佳反应,但在有限情况下可以恢复1千元、QRE或典型的纳什平衡行为的近似。提议的方法基于变异自由能源原则的累进形式,代表(假想)顺序决定的自我偏好游戏。玩家处理能力被记录为信息成本,而未来的推理链被打折扣,意味着低级参与者的处理资源较少。

0

相关内容

变分自由能

变分自由能

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

109+阅读 · 2021年8月27日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

不可错过！华盛顿大学最新《生成式模型》课程，附PPT

不可错过！华盛顿大学最新《生成式模型》课程，附PPT

专知会员服务

65+阅读 · 2020年12月11日

【论文推荐】文本摘要简述

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

【微软-ACL2020】TinyMBERT: Multi-Stage Distillation Framework for Massive Multi-lingual NER

【微软-ACL2020】TinyMBERT: Multi-Stage Distillation Framework for Massive Multi-lingual NER

专知会员服务

36+阅读 · 2020年4月14日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

On the Existence of the Augustin Mean

On the Existence of the Augustin Mean

Arxiv

0+阅读 · 2021年9月1日

Bayesian Inference of Globular Cluster Properties Using Distribution Functions

Arxiv

0+阅读 · 2021年8月30日

Policy Implications of Statistical Estimates: A General Bayesian Decision-Theoretic Model for Binary Outcomes

Arxiv

0+阅读 · 2021年8月29日

Partial Domain Adaptation without Domain Alignment

Arxiv

1+阅读 · 2021年8月29日

Information Design in Non-atomic Routing Games with Partial Participation: Computation and Properties

Arxiv

0+阅读 · 2021年8月29日

Learning Energy-Based Approximate Inference Networks for Structured Applications in NLP

Arxiv

0+阅读 · 2021年8月27日

Limit theorems for dependent combinatorial data, with applications in statistical inference

Limit theorems for dependent combinatorial data, with applications in statistical inference

Arxiv

0+阅读 · 2021年8月27日

Provable Tensor-Train Format Tensor Completion by Riemannian Optimization

Provable Tensor-Train Format Tensor Completion by Riemannian Optimization

Arxiv

0+阅读 · 2021年8月27日

Lifelong Infinite Mixture Model Based on Knowledge-Driven Dirichlet Process

Arxiv

0+阅读 · 2021年8月25日

Enriching Pre-trained Language Model with Entity Information for Relation Classification

Arxiv

5+阅读 · 2019年5月20日

VIP会员

文章信息

相关主题

变分自由能

Processing（编程语言）

相关VIP内容

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

109+阅读 · 2021年8月27日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

不可错过！华盛顿大学最新《生成式模型》课程，附PPT

不可错过！华盛顿大学最新《生成式模型》课程，附PPT

专知会员服务

65+阅读 · 2020年12月11日

【论文推荐】文本摘要简述

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

【微软-ACL2020】TinyMBERT: Multi-Stage Distillation Framework for Massive Multi-lingual NER

【微软-ACL2020】TinyMBERT: Multi-Stage Distillation Framework for Massive Multi-lingual NER

专知会员服务

36+阅读 · 2020年4月14日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

Deep Research（深度研究）：系统性综述

《革新战术战场空间能力：反无人机系统》报告

【普林斯顿博士论文】用于语音的生成式通用模型

螺旋式开发作为战略资产：美军启示

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

On the Existence of the Augustin Mean

On the Existence of the Augustin Mean

Arxiv

0+阅读 · 2021年9月1日

Bayesian Inference of Globular Cluster Properties Using Distribution Functions

Arxiv

0+阅读 · 2021年8月30日

Policy Implications of Statistical Estimates: A General Bayesian Decision-Theoretic Model for Binary Outcomes

Arxiv

0+阅读 · 2021年8月29日

Partial Domain Adaptation without Domain Alignment

Arxiv

1+阅读 · 2021年8月29日

Information Design in Non-atomic Routing Games with Partial Participation: Computation and Properties

Arxiv

0+阅读 · 2021年8月29日

Learning Energy-Based Approximate Inference Networks for Structured Applications in NLP

Arxiv

0+阅读 · 2021年8月27日

Limit theorems for dependent combinatorial data, with applications in statistical inference

Limit theorems for dependent combinatorial data, with applications in statistical inference

Arxiv

0+阅读 · 2021年8月27日

Provable Tensor-Train Format Tensor Completion by Riemannian Optimization

Provable Tensor-Train Format Tensor Completion by Riemannian Optimization

Arxiv

0+阅读 · 2021年8月27日

Lifelong Infinite Mixture Model Based on Knowledge-Driven Dirichlet Process

Arxiv

0+阅读 · 2021年8月25日

Enriching Pre-trained Language Model with Entity Information for Relation Classification

Arxiv

5+阅读 · 2019年5月20日

微信扫码咨询专知VIP会员