AP 联合勘查和日程安排:背景强盗办法 (Joint AP Probing and Scheduling: A Contextual Bandit Approach) - 专知论文

会员服务 ·

0

上下文赌博机/上下文老虎机 · 赌博机/老虎机 · Extensibility · MoDELS · contrastive ·

2021 年 8 月 13 日

Joint AP Probing and Scheduling: A Contextual Bandit Approach

翻译：AP 联合勘查和日程安排:背景强盗办法

Tianyi Xu,Ding Zhang,Parth H. Pathak,Zizhan Zheng

We consider a set of APs with unknown data rates that cooperatively serve a mobile client. The data rate of each link is i.i.d. sampled from a distribution that is unknown a priori. In contrast to traditional link scheduling problems under uncertainty, we assume that in each time step, the device can probe a subset of links before deciding which one to use. We model this problem as a contextual bandit problem with probing (CBwP) and present an efficient algorithm. We further establish the regret of our algorithm for links with Bernoulli data rates. Our CBwP model is a novel extension of the classic contextual bandit model and can potentially be applied to a large class of sequential decision-making problems that involve joint probing and play under uncertainty.

翻译：我们考虑的是一组数据率不明的AP, 这些数据率是合作为移动客户服务的。每个链接的数据率是先验的分布样本。与不确定的传统的链接时间安排问题相反, 我们假设在每一步, 设备可以在决定使用之前先探测一组链接。我们把这个问题作为调查( CBwP) 的背景强盗问题来模型, 并展示一个高效的算法。我们进一步确认了我们对与Bernoulli数据率连接的算法的遗憾。我们的 CBwP模型是经典背景强盗模型的新扩展, 并有可能适用于涉及在不确定情况下联合探测和玩耍的一大批顺序决策问题。

0

相关内容

上下文赌博机/上下文老虎机

上下文赌博机/上下文老虎机

ICML2021接受论文列表出炉！1184篇论文都在这了！

专知会员服务

92+阅读 · 2021年6月3日

【AAAI2021】记忆门控循环网络

【AAAI2021】记忆门控循环网络

专知会员服务

50+阅读 · 2020年12月28日

【Google】梯度下降，48页ppt

【Google】梯度下降，48页ppt

专知会员服务

81+阅读 · 2020年12月5日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

【2020新书】C++20 特性第二版，A Problem-Solution Approach

【2020新书】C++20 特性第二版，A Problem-Solution Approach

专知会员服务

60+阅读 · 2020年4月26日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Optimal rate of convergence for approximations of SPDEs with non-regular drift

Arxiv

0+阅读 · 2021年10月12日

Corrupted Contextual Bandits with Action Order Constraints

Arxiv

0+阅读 · 2021年10月12日

Self-guided Approximate Linear Programs

Arxiv

0+阅读 · 2021年10月12日

A Burden Shared is a Burden Halved: A Fairness-Adjusted Approach to Binary Classification

Arxiv

0+阅读 · 2021年10月12日

Adaptive Temporal Difference Learning with Linear Function Approximation

Arxiv

0+阅读 · 2021年10月11日

Pareto Optimization for Subset Selection with Dynamic Cost Constraints

Arxiv

0+阅读 · 2021年10月10日

Many Proxy Controls

Many Proxy Controls

Arxiv

0+阅读 · 2021年10月8日

Approximate Post-Selective Inference for Regression with the Group LASSO

Arxiv

0+阅读 · 2021年10月7日

MSc Dissertation: Exclusive Row Biclustering for Gene Expression Using a Combinatorial Auction Approach

MSc Dissertation: Exclusive Row Biclustering for Gene Expression Using a Combinatorial Auction Approach

Arxiv

6+阅读 · 2018年9月13日

Entire Space Multi-Task Model: An Effective Approach for Estimating Post-Click Conversion Rate

Arxiv

7+阅读 · 2018年4月24日

VIP会员

文章信息

相关主题

上下文赌博机/上下文老虎机

赌博机/老虎机

相关VIP内容

ICML2021接受论文列表出炉！1184篇论文都在这了！

专知会员服务

92+阅读 · 2021年6月3日

【AAAI2021】记忆门控循环网络

【AAAI2021】记忆门控循环网络

专知会员服务

50+阅读 · 2020年12月28日

【Google】梯度下降，48页ppt

【Google】梯度下降，48页ppt

专知会员服务

81+阅读 · 2020年12月5日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

【2020新书】C++20 特性第二版，A Problem-Solution Approach

【2020新书】C++20 特性第二版，A Problem-Solution Approach

专知会员服务

60+阅读 · 2020年4月26日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025教程】基础模型遇见具身智能体

军事机器学习设计：关于开发自动化任务摘要系统的梯次化设计科学研究 | 2025最新93页

扩散模型中的缓存方法综述：迈向高效的多模态生成

【ICCV2025教程】《迈向视觉语言模型的全面推理》

相关资讯

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Optimal rate of convergence for approximations of SPDEs with non-regular drift

Arxiv

0+阅读 · 2021年10月12日

Corrupted Contextual Bandits with Action Order Constraints

Arxiv

0+阅读 · 2021年10月12日

Self-guided Approximate Linear Programs

Arxiv

0+阅读 · 2021年10月12日

A Burden Shared is a Burden Halved: A Fairness-Adjusted Approach to Binary Classification

Arxiv

0+阅读 · 2021年10月12日

Adaptive Temporal Difference Learning with Linear Function Approximation

Arxiv

0+阅读 · 2021年10月11日

Pareto Optimization for Subset Selection with Dynamic Cost Constraints

Arxiv

0+阅读 · 2021年10月10日

Many Proxy Controls

Many Proxy Controls

Arxiv

0+阅读 · 2021年10月8日

Approximate Post-Selective Inference for Regression with the Group LASSO

Arxiv

0+阅读 · 2021年10月7日

MSc Dissertation: Exclusive Row Biclustering for Gene Expression Using a Combinatorial Auction Approach

MSc Dissertation: Exclusive Row Biclustering for Gene Expression Using a Combinatorial Auction Approach

Arxiv

6+阅读 · 2018年9月13日

Entire Space Multi-Task Model: An Effective Approach for Estimating Post-Click Conversion Rate

Arxiv

7+阅读 · 2018年4月24日

微信扫码咨询专知VIP会员