学习鼓励信息获取:正确排序规则符合主要代理商模式</s> (Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model) - 专知论文

会员服务 ·

0

INFORMS · Agent · 得分 · MoDELS · HER ·

2023 年 3 月 15 日

Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model

翻译：学习鼓励信息获取:正确排序规则符合主要代理商模式

Siyu Chen,Jibang Wu,Yifan Wu,Zhuoran Yang

from arxiv, 34 pages, Optimal information acquisition via proper scoring rule

We study the incentivized information acquisition problem, where a principal hires an agent to gather information on her behalf. Such a problem is modeled as a Stackelberg game between the principal and the agent, where the principal announces a scoring rule that specifies the payment, and then the agent then chooses an effort level that maximizes her own profit and reports the information. We study the online setting of such a problem from the principal's perspective, i.e., designing the optimal scoring rule by repeatedly interacting with the strategic agent. We design a provably sample efficient algorithm that tailors the UCB algorithm (Auer et al., 2002) to our model, which achieves a sublinear $T^{2/3}$-regret after $T$ iterations. Our algorithm features a delicate estimation procedure for the optimal profit of the principal, and a conservative correction scheme that ensures the desired agent's actions are incentivized. Furthermore, a key feature of our regret bound is that it is independent of the number of states of the environment.

翻译：我们研究有激励的信息获取问题,即委托人雇用一名代理人代表她收集信息。这样一个问题以委托人和代理人之间的Stackelberg游戏为模范,由委托人宣布一个具体规定付款的评分规则,然后代理人选择一个使自己利润最大化的努力水平,并报告信息。我们从委托人的角度研究这一问题的在线设置,即通过与战略代理人反复互动来设计最佳评分规则。我们设计了一个精巧的抽样有效算法,将UCB算法(Auer等人,2002年)与我们的模型进行裁剪裁剪,该算法将达到亚线值$T<unk> 2/3美元($+2/3)-gret)在美元重复后达到一个亚线值。我们的算法为本金的最佳利润设定了一个微妙的估计程序,并且有一个保守的纠正计划,确保委托人的行动受到激励。此外,我们遗憾的关键特征是,它独立于环境状况的数量。</s>

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Ginzburg-Landau涡旋现象中的非线性椭圆问题

国家自然科学基金

0+阅读 · 2015年12月31日

基于时滞/时滞导数二维分解的时滞系统分析与设计

国家自然科学基金

0+阅读 · 2013年12月31日

波导耦合波理论反演问题的迭代求解方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

知识与生态关联视角下的城市新区空间发展研究：以珠三角区域4个战略性新区为例

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

纳米钨酸盐异质结光催化剂的合成、性能和机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

一种适用于高维问题的Co-kriging代理模型新方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

非线性不连续系统的稳定与镇定

国家自然科学基金

0+阅读 · 2008年12月31日

Energy-Efficient URLLC Service Provision via a Near-Space Information Network

Arxiv

0+阅读 · 2023年5月5日

Spatial State-Action Features for General Games

Arxiv

0+阅读 · 2023年5月4日

Credibility of high $R^2$ in regression problems: a permutation approach

Arxiv

0+阅读 · 2023年5月4日

A Rigorous Information-Theoretic Definition of Redundancy and Relevancy in Feature Selection Based on (Partial) Information Decomposition

Arxiv

0+阅读 · 2023年5月4日

A framework for the emergence and analysis of language in social learning agents

Arxiv

0+阅读 · 2023年5月4日

Understanding the Spectral Bias of Coordinate Based MLPs Via Training Dynamics

Arxiv

0+阅读 · 2023年5月4日

Decentralised Active Perception in Continuous Action Spaces for the Coordinated Escort Problem

Arxiv

0+阅读 · 2023年5月3日

Learning to Propagate for Graph Meta-Learning

Arxiv

14+阅读 · 2019年9月11日

How to train your MAML

Arxiv

26+阅读 · 2019年3月5日

Multimodal Machine Learning: A Survey and Taxonomy

Arxiv

151+阅读 · 2017年8月1日

VIP会员

文章信息

相关主题

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

视觉-语言-动作模型解析：从模块构成到里程碑与挑战

《解析陆域作战方向：一个概念性框架》报告

【博士论文】基于多模态基础模型的上下文学习

追寻真正的AI自主性：从遗留思维到战场优势

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Energy-Efficient URLLC Service Provision via a Near-Space Information Network

Arxiv

0+阅读 · 2023年5月5日

Spatial State-Action Features for General Games

Arxiv

0+阅读 · 2023年5月4日

Credibility of high $R^2$ in regression problems: a permutation approach

Arxiv

0+阅读 · 2023年5月4日

A Rigorous Information-Theoretic Definition of Redundancy and Relevancy in Feature Selection Based on (Partial) Information Decomposition

Arxiv

0+阅读 · 2023年5月4日

A framework for the emergence and analysis of language in social learning agents

Arxiv

0+阅读 · 2023年5月4日

Understanding the Spectral Bias of Coordinate Based MLPs Via Training Dynamics

Arxiv

0+阅读 · 2023年5月4日

Decentralised Active Perception in Continuous Action Spaces for the Coordinated Escort Problem

Arxiv

0+阅读 · 2023年5月3日

Learning to Propagate for Graph Meta-Learning

Arxiv

14+阅读 · 2019年9月11日

How to train your MAML

Arxiv

26+阅读 · 2019年3月5日

Multimodal Machine Learning: A Survey and Taxonomy

Arxiv

151+阅读 · 2017年8月1日

相关基金

Ginzburg-Landau涡旋现象中的非线性椭圆问题

国家自然科学基金

0+阅读 · 2015年12月31日

基于时滞/时滞导数二维分解的时滞系统分析与设计

国家自然科学基金

0+阅读 · 2013年12月31日

波导耦合波理论反演问题的迭代求解方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

知识与生态关联视角下的城市新区空间发展研究：以珠三角区域4个战略性新区为例

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

纳米钨酸盐异质结光催化剂的合成、性能和机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

一种适用于高维问题的Co-kriging代理模型新方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

非线性不连续系统的稳定与镇定

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员