通过最大限度地增加信息收益,估算美元-兰克 (Estimating $α$-Rank by Maximizing Information Gain) - 专知论文

会员服务 ·

0

INFORMS · 估计/估计量 · 可辨认的 · Performer · 极大 ·

2021 年 1 月 22 日

Estimating $α$-Rank by Maximizing Information Gain

翻译：通过最大限度地增加信息收益,估算美元-兰克

Tabish Rashid,Cheng Zhang,Kamil Ciosek

Game theory has been increasingly applied in settings where the game is not known outright, but has to be estimated by sampling. For example, meta-games that arise in multi-agent evaluation can only be accessed by running a succession of expensive experiments that may involve simultaneous deployment of several agents. In this paper, we focus on $\alpha$-rank, a popular game-theoretic solution concept designed to perform well in such scenarios. We aim to estimate the $\alpha$-rank of the game using as few samples as possible. Our algorithm maximizes information gain between an epistemic belief over the $\alpha$-ranks and the observed payoff. This approach has two main benefits. First, it allows us to focus our sampling on the entries that matter the most for identifying the $\alpha$-rank. Second, the Bayesian formulation provides a facility to build in modeling assumptions by using a prior over game payoffs. We show the benefits of using information gain as compared to the confidence interval criterion of ResponseGraphUCB (Rowland et al. 2019), and provide theoretical results justifying our method.

翻译：游戏理论已越来越多地应用于游戏不完全已知,但必须用抽样来估计。例如,多试剂评价中出现的元游戏只能通过一系列昂贵的实验来获得,这些实验可能同时部署若干物剂。在本文中,我们注重于$\alpha$-rank,这是一个流行的游戏理论-理论解决方案概念,目的是在这种情景下很好地发挥作用。我们的目标是尽可能少地使用样本来估计游戏的 $\alpha$-rank。我们的算法最大限度地利用对$\alpha$-rank和观察到的收益的认知性信仰之间的信息收益。这个方法有两个主要好处。首先,它使我们能够把抽样集中在对确定$\alpha$-rank最为重要的条目上。第二,Bayesian 配方提供了一种工具,通过使用先前的游戏回报来建立模型假设。我们展示了使用信息收益的好处,与响应GraphUCB(Rowland等人等人,2019)的可信度间隔标准相比,并提供理论结果来证明我们的方法。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【经典书】C语言傻瓜式入门（第二版），411页pdf

【经典书】C语言傻瓜式入门（第二版），411页pdf

专知会员服务

54+阅读 · 2020年8月16日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【经典书】信息系统导论，626页pdf，Introduction to Information System

【经典书】信息系统导论，626页pdf，Introduction to Information System

专知会员服务

35+阅读 · 2020年7月10日

经典书《机器学习：概率视角》（Machine Learning: a Probabilistic Perspective）第二版Python代码，附1098页pdf下载

经典书《机器学习：概率视角》（Machine Learning: a Probabilistic Perspective）第二版Python代码，附1098页pdf下载

专知会员服务

275+阅读 · 2019年10月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

老铁，邀请你来免费学习人工智能！！！

老铁，邀请你来免费学习人工智能！！！

量化投资与机器学习

4+阅读 · 2017年11月14日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Fast Rates for Contextual Linear Optimization

Arxiv

0+阅读 · 2021年3月17日

Determining the maximum information gain and optimising experimental design in neutron reflectometry using the Fisher information

Arxiv

0+阅读 · 2021年3月16日

A Minimax Probability Machine for Non-Decomposable Performance Measures

Arxiv

0+阅读 · 2021年3月15日

Learning One Representation to Optimize All Rewards

Arxiv

0+阅读 · 2021年3月14日

Treatment Effect Estimation using Invariant Risk Minimization

Arxiv

0+阅读 · 2021年3月13日

Optimal Dual Schemes for Adaptive Grid Based Hexmeshing

Arxiv

0+阅读 · 2021年3月13日

Maximizing Marginal Fairness for Dynamic Learning to Rank

Arxiv

7+阅读 · 2021年2月18日

Implicit Maximum Likelihood Estimation

Implicit Maximum Likelihood Estimation

Arxiv

7+阅读 · 2018年9月24日

A Restricted-Domain Dual Formulation for Two-Phase Image Segmentation

A Restricted-Domain Dual Formulation for Two-Phase Image Segmentation

Arxiv

3+阅读 · 2018年7月30日

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Arxiv

7+阅读 · 2018年7月20日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【经典书】C语言傻瓜式入门（第二版），411页pdf

【经典书】C语言傻瓜式入门（第二版），411页pdf

专知会员服务

54+阅读 · 2020年8月16日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【经典书】信息系统导论，626页pdf，Introduction to Information System

【经典书】信息系统导论，626页pdf，Introduction to Information System

专知会员服务

35+阅读 · 2020年7月10日

经典书《机器学习：概率视角》（Machine Learning: a Probabilistic Perspective）第二版Python代码，附1098页pdf下载

经典书《机器学习：概率视角》（Machine Learning: a Probabilistic Perspective）第二版Python代码，附1098页pdf下载

专知会员服务

275+阅读 · 2019年10月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

老铁，邀请你来免费学习人工智能！！！

老铁，邀请你来免费学习人工智能！！！

量化投资与机器学习

4+阅读 · 2017年11月14日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Fast Rates for Contextual Linear Optimization

Arxiv

0+阅读 · 2021年3月17日

Determining the maximum information gain and optimising experimental design in neutron reflectometry using the Fisher information

Arxiv

0+阅读 · 2021年3月16日

A Minimax Probability Machine for Non-Decomposable Performance Measures

Arxiv

0+阅读 · 2021年3月15日

Learning One Representation to Optimize All Rewards

Arxiv

0+阅读 · 2021年3月14日

Treatment Effect Estimation using Invariant Risk Minimization

Arxiv

0+阅读 · 2021年3月13日

Optimal Dual Schemes for Adaptive Grid Based Hexmeshing

Arxiv

0+阅读 · 2021年3月13日

Maximizing Marginal Fairness for Dynamic Learning to Rank

Arxiv

7+阅读 · 2021年2月18日

Implicit Maximum Likelihood Estimation

Implicit Maximum Likelihood Estimation

Arxiv

7+阅读 · 2018年9月24日

A Restricted-Domain Dual Formulation for Two-Phase Image Segmentation

A Restricted-Domain Dual Formulation for Two-Phase Image Segmentation

Arxiv

3+阅读 · 2018年7月30日

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Arxiv

7+阅读 · 2018年7月20日

微信扫码咨询专知VIP会员