激励遵守等级工具 (Incentivizing Compliance with Algorithmic Instruments) - 专知论文

会员服务 ·

0

INTERACT · 可辨认的 · 估计/估计量 · 优化器 · 有偏 ·

2021 年 7 月 28 日

Incentivizing Compliance with Algorithmic Instruments

翻译：激励遵守等级工具

Daniel Ngo,Logan Stapleton,Vasilis Syrgkanis,Zhiwei Steven Wu

from arxiv, In Proceedings of the Thirty-eighth International Conference on Machine Learning (ICML 2021), 17 pages of main text, 53 pages total, 3 figures

Randomized experiments can be susceptible to selection bias due to potential non-compliance by the participants. While much of the existing work has studied compliance as a static behavior, we propose a game-theoretic model to study compliance as dynamic behavior that may change over time. In rounds, a social planner interacts with a sequence of heterogeneous agents who arrive with their unobserved private type that determines both their prior preferences across the actions (e.g., control and treatment) and their baseline rewards without taking any treatment. The planner provides each agent with a randomized recommendation that may alter their beliefs and their action selection. We develop a novel recommendation mechanism that views the planner's recommendation as a form of instrumental variable (IV) that only affects an agents' action selection, but not the observed rewards. We construct such IVs by carefully mapping the history -- the interactions between the planner and the previous agents -- to a random recommendation. Even though the initial agents may be completely non-compliant, our mechanism can incentivize compliance over time, thereby enabling the estimation of the treatment effect of each treatment, and minimizing the cumulative regret of the planner whose goal is to identify the optimal treatment.

翻译：由于参与者可能不遵守规定,随机实验可能具有选择偏见。虽然许多现有工作已经将守规视为静态行为,但我们提议了一个游戏理论模型,将守规视为可能随时间变化的动态行为。在回合中,社会规划者会与一组异类代理人进行互动,这些代理人到达时没有观察到的私人类型,这些类型决定了他们先前对各种行动(例如控制和治疗)和基线奖励的偏好,而没有采取任何治疗。规划者向每个代理人提供随机建议,可能改变他们的信仰和行动选择。我们开发了一个新的建议机制,将规划者的建议视为一种工具变量(四)的形式,仅影响代理人的行动选择,而不会影响观察到的回报。我们通过仔细绘制历史图解析,即规划者和前代理人之间的互动关系,来随机提出建议。即使最初的代理人可能完全不遵守规定,但我们的机制可以鼓励遵守,从而能够估计每次治疗的治疗效果,并最大限度地减少计划者为确定最佳待遇而累积的遗憾。

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

【NUS-Xavier 教授】图神经网络应用概述，15页ppt

专知会员服务

54+阅读 · 2021年6月30日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

近期必读的 NeurIPS2020 80多篇【图机器学习】相关论文

专知会员服务

54+阅读 · 2020年11月3日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

意识是一种数学模式

意识是一种数学模式

CreateAMind

3+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

美国化学会 (ACS) 北京代表处招聘

美国化学会 (ACS) 北京代表处招聘

知社学术圈

11+阅读 · 2018年9月4日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

Beyond Pigouvian Taxes: A Worst Case Analysis

Arxiv

0+阅读 · 2021年9月28日

SightBi: Exploring Cross-View Data Relationships with Biclusters

Arxiv

0+阅读 · 2021年9月28日

Structural Stability of a Family of Group Formation Games

Arxiv

0+阅读 · 2021年9月27日

Randomization-only Inference in Experiments with Interference

Arxiv

0+阅读 · 2021年9月26日

Decentralized Governance of Stablecoins with Option Pricing

Arxiv

0+阅读 · 2021年9月26日

Algorithmic Information Design in Multi-Player Games: Possibility and Limits in Singleton Congestion

Arxiv

0+阅读 · 2021年9月25日

Groups Influence with Minimum Cost in Social Networks

Arxiv

0+阅读 · 2021年9月25日

Treatment Effects in Market Equilibrium

Arxiv

0+阅读 · 2021年9月23日

Bayesian inference of an uncertain generalized diffusion operator

Arxiv

0+阅读 · 2021年9月23日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【NUS-Xavier 教授】图神经网络应用概述，15页ppt

专知会员服务

54+阅读 · 2021年6月30日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

近期必读的 NeurIPS2020 80多篇【图机器学习】相关论文

专知会员服务

54+阅读 · 2020年11月3日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】多目标奖励与偏好优化：理论与算法

《无形的防御者？将定向能武器集成到反无人机框架的机遇与挑战》报告

自主化海军：海上无人系统与未来海战

迈向智能体系统规模化的科学

相关资讯

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

意识是一种数学模式

意识是一种数学模式

CreateAMind

3+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

美国化学会 (ACS) 北京代表处招聘

美国化学会 (ACS) 北京代表处招聘

知社学术圈

11+阅读 · 2018年9月4日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

相关论文

Beyond Pigouvian Taxes: A Worst Case Analysis

Arxiv

0+阅读 · 2021年9月28日

SightBi: Exploring Cross-View Data Relationships with Biclusters

Arxiv

0+阅读 · 2021年9月28日

Structural Stability of a Family of Group Formation Games

Arxiv

0+阅读 · 2021年9月27日

Randomization-only Inference in Experiments with Interference

Arxiv

0+阅读 · 2021年9月26日

Decentralized Governance of Stablecoins with Option Pricing

Arxiv

0+阅读 · 2021年9月26日

Algorithmic Information Design in Multi-Player Games: Possibility and Limits in Singleton Congestion

Arxiv

0+阅读 · 2021年9月25日

Groups Influence with Minimum Cost in Social Networks

Arxiv

0+阅读 · 2021年9月25日

Treatment Effects in Market Equilibrium

Arxiv

0+阅读 · 2021年9月23日

Bayesian inference of an uncertain generalized diffusion operator

Arxiv

0+阅读 · 2021年9月23日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

微信扫码咨询专知VIP会员