在非静止环境中进行MNL-在非静止环境中进行Bandit</s> (MNL-Bandit in non-stationary environments) - 专知论文

会员服务 ·

0

回合 · 平稳的 · 估计/估计量 · 有偏 · 讲稿 ·

2023 年 3 月 4 日

MNL-Bandit in non-stationary environments

翻译：在非静止环境中进行MNL-在非静止环境中进行Bandit

Ayoub Foussoul,Vineet Goyal,Varun Gupta

In this paper, we study the MNL-Bandit problem in a non-stationary environment and present an algorithm with worst-case dynamic regret of $\tilde{O}\left( \min \left\{ \sqrt{NTL}\;,\; N^{\frac{1}{3}}(\Delta_{\infty}^{K})^{\frac{1}{3}} T^{\frac{2}{3}} + \sqrt{NT}\right\}\right)$. Here $N$ is the number of arms, $L$ is the number of switches and $\Delta_{\infty}^K$ is a variation measure of the unknown parameters. We also show that our algorithm is near-optimal (up to logarithmic factors). Our algorithm builds upon the epoch-based algorithm for stationary MNL-Bandit in Agrawal et al. 2016. However, non-stationarity poses several challenges and we introduce new techniques and ideas to address these. In particular, we give a tight characterization for the bias introduced in the estimators due to non stationarity and derive new concentration bounds.

翻译：在本文中,我们研究了非静止环境中的MNL-Bandit问题,并提出了一个最差情况动态后悔$tilde{O<unk> left(min\left\\\\ sqrt{NTL});\\;\;N<unk> frac{1<unk> 3}(\\Delta<unk> infty}K}){{1<unk> 3}{1<unk> 3<unk> T<unk> frac{2<unk> 3<unk> +\sqrt{NT<unk> rt{right}right$的算法。这里是武器的数量,$L$是开关的数量,$\Delta}infty}K$是未知参数的变异度。我们还表明我们的算法接近最佳(最高为对数因素 ) 。我们的算法建立在基于阿格拉瓦尔等人的基于恒定的 MNNNL-Banditi算法的以尿算法基础上的算法。然而,不透明性提出了几项挑战,我们提出了解决这些问题的新技术和新想法。我们特别对由于新的制式和新制式,在静态中引入了对定的集中器中引入的偏差的偏差进行了精确的定性。</s>

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

单原子填充方钴矿热电材料微观力学行为的分子动力学模拟研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于连续循环平移理论的Shearlet域稀疏表示SAR图像去噪算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

冷胁迫诱导柽柳ThCAP基因表达的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Riemann-Hilbert 方法和随机矩阵谱分析中的 Painleve 渐近

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

豆科植物早期根瘤形成和发育中蛋白质相互作用及其调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

非线性软测量系统递推量子随机滤波方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

潜水面与包气带水的动态耦合机制及其尺度效应

国家自然科学基金

0+阅读 · 2008年12月31日

Approximation Algorithms for Envy-Free Cake Division with Connected Pieces

Arxiv

0+阅读 · 2023年4月27日

Nonlinear approximation in bounded orthonormal product bases

Arxiv

0+阅读 · 2023年4月27日

Convergence of Adam Under Relaxed Assumptions

Arxiv

0+阅读 · 2023年4月27日

Improved Stabilizer Estimation via Bell Difference Sampling

Arxiv

0+阅读 · 2023年4月27日

A Deep Learning Synthetic Likelihood Approximation of a Non-stationary Spatial Model for Extreme Streamflow Forecasting

Arxiv

0+阅读 · 2023年4月26日

A Simplicity Bubble Problem in Formal-Theoretic Learning Systems

Arxiv

0+阅读 · 2023年4月25日

Non-agency interventions for causal mediation in the presence of intermediate confounding

Arxiv

0+阅读 · 2023年4月25日

Maximum Likelihood Estimation in Gaussian Process Regression is Ill-Posed

Arxiv

0+阅读 · 2023年4月25日

Real-time Safety Assessment of Dynamic Systems in Non-stationary Environments: A Review of Methods and Techniques

Arxiv

0+阅读 · 2023年4月25日

Instance-Optimality in Interactive Decision Making: Toward a Non-Asymptotic Theory

Arxiv

0+阅读 · 2023年4月24日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Approximation Algorithms for Envy-Free Cake Division with Connected Pieces

Arxiv

0+阅读 · 2023年4月27日

Nonlinear approximation in bounded orthonormal product bases

Arxiv

0+阅读 · 2023年4月27日

Convergence of Adam Under Relaxed Assumptions

Arxiv

0+阅读 · 2023年4月27日

Improved Stabilizer Estimation via Bell Difference Sampling

Arxiv

0+阅读 · 2023年4月27日

A Deep Learning Synthetic Likelihood Approximation of a Non-stationary Spatial Model for Extreme Streamflow Forecasting

Arxiv

0+阅读 · 2023年4月26日

A Simplicity Bubble Problem in Formal-Theoretic Learning Systems

Arxiv

0+阅读 · 2023年4月25日

Non-agency interventions for causal mediation in the presence of intermediate confounding

Arxiv

0+阅读 · 2023年4月25日

Maximum Likelihood Estimation in Gaussian Process Regression is Ill-Posed

Arxiv

0+阅读 · 2023年4月25日

Real-time Safety Assessment of Dynamic Systems in Non-stationary Environments: A Review of Methods and Techniques

Arxiv

0+阅读 · 2023年4月25日

Instance-Optimality in Interactive Decision Making: Toward a Non-Asymptotic Theory

Arxiv

0+阅读 · 2023年4月24日

相关基金

单原子填充方钴矿热电材料微观力学行为的分子动力学模拟研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于连续循环平移理论的Shearlet域稀疏表示SAR图像去噪算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

冷胁迫诱导柽柳ThCAP基因表达的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Riemann-Hilbert 方法和随机矩阵谱分析中的 Painleve 渐近

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

豆科植物早期根瘤形成和发育中蛋白质相互作用及其调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

非线性软测量系统递推量子随机滤波方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

潜水面与包气带水的动态耦合机制及其尺度效应

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员