Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits - 专知论文

会员服务 ·

0

Agent · 方差 · 时间步 · 约束 · motivation ·

2023 年 6 月 2 日

Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits

翻译：暂无翻译

Yunlong Hou,Vincent Y. F. Tan,Zixin Zhong

from arxiv, To be presented at ICML 2023. 57 pages, 6 figures

Motivated by concerns about making online decisions that incur undue amount of risk at each time step, in this paper, we formulate the probably anytime-safe stochastic combinatorial semi-bandits problem. In this problem, the agent is given the option to select a subset of size at most $K$ from a set of $L$ ground items. Each item is associated to a certain mean reward as well as a variance that represents its risk. To mitigate the risk that the agent incurs, we require that with probability at least $1-\delta$, over the entire horizon of time $T$, each of the choices that the agent makes should contain items whose sum of variances does not exceed a certain variance budget. We call this probably anytime-safe constraint. Under this constraint, we design and analyze an algorithm {\sc PASCombUCB} that minimizes the regret over the horizon of time $T$. By developing accompanying information-theoretic lower bounds, we show that under both the problem-dependent and problem-independent paradigms, {\sc PASCombUCB} is almost asymptotically optimal. Experiments are conducted to corroborate our theoretical findings. Our problem setup, the proposed {\sc PASCombUCB} algorithm, and novel analyses are applicable to domains such as recommendation systems and transportation in which an agent is allowed to choose multiple items at a single time step and wishes to control the risk over the whole time horizon.

翻译：暂无翻译

0

相关内容

Agent

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

专知

25+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

(10-100) keV 单能X射线源注量测量方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Sb、Te掺杂Bi2Se3拓扑绝缘体纳米片的制备及表面态输运性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

三维椭圆方程Cauchy问题的正则化方法

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

远红外Te基硫系玻璃光子晶体光纤制备及其非线性光学性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

胚胎干细胞多潜能相关转录因子Zscan10的转录调控分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

集装箱多式联运服务组合拍卖机制设计与优化模型研究

国家自然科学基金

0+阅读 · 2009年12月31日

拟南芥WRKY71转录因子对腋分生组织形成及侧芽发育的调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

镶嵌掺杂稀土的镧系氯化物纳米晶的硫卤基纳米复合材料及其中红外发光研究

国家自然科学基金

0+阅读 · 2009年12月31日

Contextual Bandits and Imitation Learning via Preference-Based Active Queries

Arxiv

0+阅读 · 2023年7月24日

Breaking the $3/4$ Barrier for Approximate Maximin Share

Arxiv

0+阅读 · 2023年7月24日

Safe Opponent Exploitation For Epsilon Equilibrium Strategies

Arxiv

0+阅读 · 2023年7月23日

Estimate-Then-Optimize versus Integrated-Estimation-Optimization versus Sample Average Approximation: A Stochastic Dominance Perspective

Arxiv

0+阅读 · 2023年7月23日

Quasi Maximum Likelihood Estimation of High-Dimensional Factor Models: A Critical Review

Arxiv

0+阅读 · 2023年7月22日

Provable Reset-free Reinforcement Learning by No-Regret Reduction

Arxiv

0+阅读 · 2023年7月22日

Survey Design and Estimating Equations when Combining Big Data with Probability Samples

Arxiv

0+阅读 · 2023年7月22日

Bandits with Deterministically Evolving States

Arxiv

0+阅读 · 2023年7月21日

Estimating and using information in inverse problems

Arxiv

0+阅读 · 2023年7月21日

Exact recovery for the non-uniform Hypergraph Stochastic Block Model

Arxiv

0+阅读 · 2023年7月20日

VIP会员

文章信息

相关主题

相关VIP内容

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025教程】人类–AI 对齐：基础、方法、实践与挑战

中文版《未来战争：杀伤链优势与俄乌战争启示》报告

中国信通院规划所发布《人工智能算力基础设施赋能研究报告（2025年）》

人机编队将赢得未来战争

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

专知

25+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Contextual Bandits and Imitation Learning via Preference-Based Active Queries

Arxiv

0+阅读 · 2023年7月24日

Breaking the $3/4$ Barrier for Approximate Maximin Share

Arxiv

0+阅读 · 2023年7月24日

Safe Opponent Exploitation For Epsilon Equilibrium Strategies

Arxiv

0+阅读 · 2023年7月23日

Estimate-Then-Optimize versus Integrated-Estimation-Optimization versus Sample Average Approximation: A Stochastic Dominance Perspective

Arxiv

0+阅读 · 2023年7月23日

Quasi Maximum Likelihood Estimation of High-Dimensional Factor Models: A Critical Review

Arxiv

0+阅读 · 2023年7月22日

Provable Reset-free Reinforcement Learning by No-Regret Reduction

Arxiv

0+阅读 · 2023年7月22日

Survey Design and Estimating Equations when Combining Big Data with Probability Samples

Arxiv

0+阅读 · 2023年7月22日

Bandits with Deterministically Evolving States

Arxiv

0+阅读 · 2023年7月21日

Estimating and using information in inverse problems

Arxiv

0+阅读 · 2023年7月21日

Exact recovery for the non-uniform Hypergraph Stochastic Block Model

Arxiv

0+阅读 · 2023年7月20日

相关基金

(10-100) keV 单能X射线源注量测量方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Sb、Te掺杂Bi2Se3拓扑绝缘体纳米片的制备及表面态输运性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

三维椭圆方程Cauchy问题的正则化方法

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

远红外Te基硫系玻璃光子晶体光纤制备及其非线性光学性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

胚胎干细胞多潜能相关转录因子Zscan10的转录调控分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

集装箱多式联运服务组合拍卖机制设计与优化模型研究

国家自然科学基金

0+阅读 · 2009年12月31日

拟南芥WRKY71转录因子对腋分生组织形成及侧芽发育的调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

镶嵌掺杂稀土的镧系氯化物纳米晶的硫卤基纳米复合材料及其中红外发光研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员