长期购买数据:动态数据驱动决策的优化战略 (Buying Data Over Time: Approximately Optimal Strategies for Dynamic Data-Driven Decisions) - 专知论文

会员服务 ·

0

优化器 · 样本 · 近似 · Better · MoDELS ·

2021 年 1 月 18 日

Buying Data Over Time: Approximately Optimal Strategies for Dynamic Data-Driven Decisions

翻译：长期购买数据:动态数据驱动决策的优化战略

Nicole Immorlica,Ian Kash,Brendan Lucier

We consider a model where an agent has a repeated decision to make and wishes to maximize their total payoff. Payoffs are influenced by an action taken by the agent, but also an unknown state of the world that evolves over time. Before choosing an action each round, the agent can purchase noisy samples about the state of the world. The agent has a budget to spend on these samples, and has flexibility in deciding how to spread that budget across rounds. We investigate the problem of choosing a sampling algorithm that optimizes total expected payoff. For example: is it better to buy samples steadily over time, or to buy samples in batches? We solve for the optimal policy, and show that it is a natural instantiation of the latter. Under a more general model that includes per-round fixed costs, we prove that a variation on this batching policy is a 2-approximation.

翻译：我们考虑的是代理商反复决定并希望最大限度地获得全部报酬的模式。报酬受代理商行动的影响, 但也受时间变化的未知世界状态的影响。在选择每轮行动之前,代理商可以购买关于世界状况的噪音样本。代理商有预算可以花在这些样本上, 在决定如何将预算分散到各轮方面有灵活性。我们调查了选择一个能够优化预期总收益的抽样算法的问题。比如: 最好在一段时间内稳步购买样品, 还是分批购买样品? 我们解决最佳政策,并表明这是后者的自然即时。在包括每轮固定成本的更一般模式下,我们证明这种分批政策的变化是2倍一致的。

0

相关内容

优化器

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【2020新书】概率机器学习，附212页pdf与slides

【2020新书】概率机器学习，附212页pdf与slides

专知会员服务

112+阅读 · 2020年11月12日

【经典书】C语言傻瓜式入门（第二版），411页pdf

【经典书】C语言傻瓜式入门（第二版），411页pdf

专知会员服务

54+阅读 · 2020年8月16日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【KDD2019|讲座推荐】大时间序列预测的理论与实践：Forecasting Big Time Series: Theory and Practice

【KDD2019|讲座推荐】大时间序列预测的理论与实践：Forecasting Big Time Series: Theory and Practice

专知会员服务

117+阅读 · 2019年12月6日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】卷积神经网络类间不平衡问题系统研究

【推荐】卷积神经网络类间不平衡问题系统研究

机器学习研究会

6+阅读 · 2017年10月18日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Resolution Limits of 20 Questions Search Strategies for Moving Targets

Arxiv

0+阅读 · 2021年3月15日

Causal Markov Boundaries

Arxiv

0+阅读 · 2021年3月12日

A Resampling Approach For causal Inference On Novel Two-Point Time-Series With Application To Identify Risk Factors For Type-2 Diabetes And Cardiovascular Disease

Arxiv

0+阅读 · 2021年3月12日

Optimal sequential decision making with probabilistic digital twins

Arxiv

0+阅读 · 2021年3月12日

An Efficient Hypergraph Approach to Robust Point Cloud Resampling

Arxiv

0+阅读 · 2021年3月11日

Optimal Program Synthesis Over Noisy Data

Arxiv

0+阅读 · 2021年3月11日

Centralized Cooperation for Connected and Automated Vehicles at Intersections by Proximal Policy Optimization

Arxiv

0+阅读 · 2021年3月11日

Differential Dynamic Programming Neural Optimizer

Arxiv

7+阅读 · 2020年6月29日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Arxiv

8+阅读 · 2018年12月18日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【2020新书】概率机器学习，附212页pdf与slides

【2020新书】概率机器学习，附212页pdf与slides

专知会员服务

112+阅读 · 2020年11月12日

【经典书】C语言傻瓜式入门（第二版），411页pdf

【经典书】C语言傻瓜式入门（第二版），411页pdf

专知会员服务

54+阅读 · 2020年8月16日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【KDD2019|讲座推荐】大时间序列预测的理论与实践：Forecasting Big Time Series: Theory and Practice

【KDD2019|讲座推荐】大时间序列预测的理论与实践：Forecasting Big Time Series: Theory and Practice

专知会员服务

117+阅读 · 2019年12月6日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

机器人领域中最佳的三维场景表示是什么？——从几何表示到基础模型

《多域作战兵棋推演：运用形态学分析与人工智能加强国防人员训练》

【博士论文】快速高效的归一化流及其在图像生成模型中的应用

仿生机器人技术的军事应用

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】卷积神经网络类间不平衡问题系统研究

【推荐】卷积神经网络类间不平衡问题系统研究

机器学习研究会

6+阅读 · 2017年10月18日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Resolution Limits of 20 Questions Search Strategies for Moving Targets

Arxiv

0+阅读 · 2021年3月15日

Causal Markov Boundaries

Arxiv

0+阅读 · 2021年3月12日

A Resampling Approach For causal Inference On Novel Two-Point Time-Series With Application To Identify Risk Factors For Type-2 Diabetes And Cardiovascular Disease

Arxiv

0+阅读 · 2021年3月12日

Optimal sequential decision making with probabilistic digital twins

Arxiv

0+阅读 · 2021年3月12日

An Efficient Hypergraph Approach to Robust Point Cloud Resampling

Arxiv

0+阅读 · 2021年3月11日

Optimal Program Synthesis Over Noisy Data

Arxiv

0+阅读 · 2021年3月11日

Centralized Cooperation for Connected and Automated Vehicles at Intersections by Proximal Policy Optimization

Arxiv

0+阅读 · 2021年3月11日

Differential Dynamic Programming Neural Optimizer

Arxiv

7+阅读 · 2020年6月29日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Arxiv

8+阅读 · 2018年12月18日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

微信扫码咨询专知VIP会员