最大惊喜(最大惊喜) (BONUS! Maximizing Surprise) - 专知论文

会员服务 ·

0

CASE · 优化器 · 贝塔分布 · 极大 · AIM ·

2021 年 7 月 17 日

BONUS! Maximizing Surprise

翻译：最大惊喜(最大惊喜)

Zhihuan Huang,Yuqing Kong,Tracy Xiao Liu,Grant Schoenebeck,Shengwei Xu

Multi-round competitions often double or triple the points awarded in the final round, calling it a bonus, to maximize spectators' excitement. In a two-player competition with $n$ rounds, we aim to derive the optimal bonus size to maximize the audience's overall expected surprise (as defined in [7]). We model the audience's prior belief over the two players' ability levels as a beta distribution. Using a novel analysis that clarifies and simplifies the computation, we find that the optimal bonus depends greatly upon the prior belief and obtain solutions of various forms for both the case of a finite number of rounds and the asymptotic case. In an interesting special case, we show that the optimal bonus approximately and asymptotically equals to the "expected lead", the number of points the weaker player will need to come back in expectation. Moreover, we observe that priors with a higher skewness lead to a higher optimal bonus size, and in the symmetric case, priors with a higher uncertainty also lead to a higher optimal bonus size. This matches our intuition since a highly asymmetric prior leads to a high "expected lead", and a highly uncertain symmetric prior often leads to a lopsided game, which again benefits from a larger bonus.

翻译：多个回合的竞赛往往使最后一轮中授予的分数翻倍或三倍,称为奖金,以最大限度地提高观众的兴奋程度。在一次用美元回合进行的双玩者竞赛中,我们的目标是获得最佳的奖金规模,以最大限度地实现观众预期的总体惊喜(如[7]所定义 ) 。我们把观众先前对两个玩家能力水平的信念作为贝塔分布模型。我们用澄清和简化计算方法的新分析,发现最佳奖金在很大程度上取决于先前的信念,并获得各种形式的解决方案,用于数量有限的回合和无药用案例。在一个有趣的特殊案例中,我们显示最佳奖金大约和无药用数量相等于“预期铅”,弱势玩家需要恢复预期的点数。此外,我们观察到,更偏差的前期导致最高最佳的奖金规模,而在更不确定之前,不确定性也导致更高的最佳奖金规模。这与我们的直觉相匹配,因为在高度不对称之前导致高“偏差”前,往往导致更大规模的奖金。

0

相关内容

CASE

ICML 2021论文收录

ICML 2021论文收录

专知会员服务

123+阅读 · 2021年5月8日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【SIGIR2020】学习词项区分性，Learning Term Discrimination

【SIGIR2020】学习词项区分性，Learning Term Discrimination

专知会员服务

16+阅读 · 2020年4月28日

【NAACL 2019 workshop】优化和评估神经语言生成方法 Methods for Optimizing and Evaluating Neural Language Generation，卡内基梅隆大学| Graham Neubig，纽约大学| He He

【NAACL 2019 workshop】优化和评估神经语言生成方法 Methods for Optimizing and Evaluating Neural Language Generation，卡内基梅隆大学| Graham Neubig，纽约大学| He He

专知会员服务

4+阅读 · 2019年12月5日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【DeepMind】多智能体学习231页PPT总结

【DeepMind】多智能体学习231页PPT总结

深度强化学习实验室

15+阅读 · 2020年6月23日

每周一起读 | ACL 2019 & NAACL 2019：文本关系抽取专题沙龙

每周一起读 | ACL 2019 & NAACL 2019：文本关系抽取专题沙龙

PaperWeekly

43+阅读 · 2019年6月26日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

人工智能 | NIPS 2019等国际会议信息8条

人工智能 | NIPS 2019等国际会议信息8条

Call4Papers

7+阅读 · 2019年3月21日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

The power of private likelihood-ratio tests for goodness-of-fit in frequency tables

Arxiv

0+阅读 · 2021年9月20日

A Reinforcement Learning Approach to the Stochastic Cutting Stock Problem

Arxiv

0+阅读 · 2021年9月20日

Generalized Translation and Scale Invariant Online Algorithm for Adversarial Multi-Armed Bandits

Arxiv

0+阅读 · 2021年9月19日

Arboricity Games: the Core and the Nucleolus

Arxiv

0+阅读 · 2021年9月18日

Cross-Leverage Scores for Selecting Subsets of Explanatory Variables

Arxiv

0+阅读 · 2021年9月17日

Automatic prior selection for meta Bayesian optimization with a case study on tuning deep neural network optimizers

Arxiv

0+阅读 · 2021年9月16日

Policy Gradient Bayesian Robust Optimization for Imitation Learning

Arxiv

5+阅读 · 2021年6月11日

Network Inference and Influence Maximization from Samples

Arxiv

7+阅读 · 2021年6月7日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

Demystifying MMD GANs

Arxiv

12+阅读 · 2018年1月12日

VIP会员

文章信息

相关主题

相关VIP内容

ICML 2021论文收录

ICML 2021论文收录

专知会员服务

123+阅读 · 2021年5月8日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【SIGIR2020】学习词项区分性，Learning Term Discrimination

【SIGIR2020】学习词项区分性，Learning Term Discrimination

专知会员服务

16+阅读 · 2020年4月28日

【NAACL 2019 workshop】优化和评估神经语言生成方法 Methods for Optimizing and Evaluating Neural Language Generation，卡内基梅隆大学| Graham Neubig，纽约大学| He He

【NAACL 2019 workshop】优化和评估神经语言生成方法 Methods for Optimizing and Evaluating Neural Language Generation，卡内基梅隆大学| Graham Neubig，纽约大学| He He

专知会员服务

4+阅读 · 2019年12月5日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【普林斯顿博士论文】以奖励推动生成式人工智能的发展：奖励引导生成的理论与方法

中文版 | 火力支援与巡飞弹药的未来（附原文）

中文版 | 人工智能时代的任务式指挥

扩散模型中的 Transformer：图像生成及其延展应用询问 ChatGPT

相关资讯

【DeepMind】多智能体学习231页PPT总结

【DeepMind】多智能体学习231页PPT总结

深度强化学习实验室

15+阅读 · 2020年6月23日

每周一起读 | ACL 2019 & NAACL 2019：文本关系抽取专题沙龙

每周一起读 | ACL 2019 & NAACL 2019：文本关系抽取专题沙龙

PaperWeekly

43+阅读 · 2019年6月26日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

人工智能 | NIPS 2019等国际会议信息8条

人工智能 | NIPS 2019等国际会议信息8条

Call4Papers

7+阅读 · 2019年3月21日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

The power of private likelihood-ratio tests for goodness-of-fit in frequency tables

Arxiv

0+阅读 · 2021年9月20日

A Reinforcement Learning Approach to the Stochastic Cutting Stock Problem

Arxiv

0+阅读 · 2021年9月20日

Generalized Translation and Scale Invariant Online Algorithm for Adversarial Multi-Armed Bandits

Arxiv

0+阅读 · 2021年9月19日

Arboricity Games: the Core and the Nucleolus

Arxiv

0+阅读 · 2021年9月18日

Cross-Leverage Scores for Selecting Subsets of Explanatory Variables

Arxiv

0+阅读 · 2021年9月17日

Automatic prior selection for meta Bayesian optimization with a case study on tuning deep neural network optimizers

Arxiv

0+阅读 · 2021年9月16日

Policy Gradient Bayesian Robust Optimization for Imitation Learning

Arxiv

5+阅读 · 2021年6月11日

Network Inference and Influence Maximization from Samples

Arxiv

7+阅读 · 2021年6月7日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

Demystifying MMD GANs

Arxiv

12+阅读 · 2018年1月12日

微信扫码咨询专知VIP会员