Stochastic differential equations for limiting description of UCB rule for Gaussian multi-armed bandits - 专知论文

会员服务 ·

0

赌博机/老虎机 · 规范化的 · 控制器 · 上置信界限 · Performer ·

2023 年 5 月 11 日

Stochastic differential equations for limiting description of UCB rule for Gaussian multi-armed bandits

翻译：暂无翻译

from arxiv, 9 pages, 2 figures

We consider the upper confidence bound strategy for Gaussian multi-armed bandits with known control horizon sizes $N$ and build its limiting description with a system of stochastic differential equations and ordinary differential equations. Rewards for the arms are assumed to have unknown expected values and known variances. A set of Monte-Carlo simulations was performed for the case of close distributions of rewards, when mean rewards differ by the magnitude of order $N^{-1/2}$, as it yields the highest normalized regret, to verify the validity of the obtained description. The minimal size of the control horizon when the normalized regret is not noticeably larger than maximum possible was estimated.

翻译：暂无翻译

0

相关内容

赌博机/老虎机

赌博机/老虎机

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

265页《数值线性代数基础》，密西西比大学Seongjai Kim教授最新讲义，Fundamentals of Numerical Linear Algebra

265页《数值线性代数基础》，密西西比大学Seongjai Kim教授最新讲义，Fundamentals of Numerical Linear Algebra

专知会员服务

45+阅读 · 2022年3月18日

最浅显的奇异值分解(SVD)介绍，《Singular Value Decomposition as Simply as Possible》

最浅显的奇异值分解(SVD)介绍，《Singular Value Decomposition as Simply as Possible》

专知会员服务

12+阅读 · 2022年3月14日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

Resveratrol联合MSCs移植对阿尔茨海默鼠的干预效果及Sirt1分子信号的介导作用

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

快裂变颈部发射的同位旋效应与亚饱和密区对称能的约束

国家自然科学基金

0+阅读 · 2012年12月31日

基于碳化硅晶体色心的自旋单量子态构筑和测量的理论模拟

国家自然科学基金

0+阅读 · 2012年12月31日

HC-SCR反应中乙醇催化制氢与还原剂活化耦合研究

国家自然科学基金

0+阅读 · 2011年12月31日

益气活血中药（灯盏生脉方）干预缺血性中风二级预防的代谢组学研究

国家自然科学基金

0+阅读 · 2011年12月31日

双光子跃迁主导超冷原子系综的量子相干操控

国家自然科学基金

0+阅读 · 2011年12月31日

Finite-Sample Analysis of Learning High-Dimensional Single ReLU Neuron

Arxiv

0+阅读 · 2023年6月26日

Analysis of the Decoder Width for Parametric Partial Differential Equations

Arxiv

0+阅读 · 2023年6月26日

Numerical approximation of the invariant distribution for a class of stochastic damped wave equations

Arxiv

0+阅读 · 2023年6月24日

Computational multiscale methods for nondivergence-form elliptic partial differential equations

Arxiv

0+阅读 · 2023年6月24日

On Convex Data-Driven Inverse Optimal Control for Nonlinear, Non-stationary and Stochastic Systems

Arxiv

0+阅读 · 2023年6月24日

Mass, momentum and energy preserving FEEC and broken-FEEC schemes for the incompressible Navier-Stokes equations

Arxiv

0+阅读 · 2023年6月23日

Sharp analysis of EM for learning mixtures of pairwise differences

Arxiv

0+阅读 · 2023年6月22日

RANS-PINN based Simulation Surrogates for Predicting Turbulent Flows

RANS-PINN based Simulation Surrogates for Predicting Turbulent Flows

Arxiv

1+阅读 · 2023年6月22日

Faster Compression of Deterministic Finite Automata

Arxiv

0+阅读 · 2023年6月22日

The Cost of Informing Decision-Makers in Multi-Agent Maximum Coverage Problems with Random Resource Values

Arxiv

0+阅读 · 2023年6月21日

VIP会员

文章信息

相关主题

赌博机/老虎机

上置信界限

相关VIP内容

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

265页《数值线性代数基础》，密西西比大学Seongjai Kim教授最新讲义，Fundamentals of Numerical Linear Algebra

265页《数值线性代数基础》，密西西比大学Seongjai Kim教授最新讲义，Fundamentals of Numerical Linear Algebra

专知会员服务

45+阅读 · 2022年3月18日

最浅显的奇异值分解(SVD)介绍，《Singular Value Decomposition as Simply as Possible》

最浅显的奇异值分解(SVD)介绍，《Singular Value Decomposition as Simply as Possible》

专知会员服务

12+阅读 · 2022年3月14日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《为多域数字战场变革装甲力量》报告

《多域训练：利用开放标准将太空与网络域同陆、海、空域训练相整合》报告

面向城市战：欧美徒步作战新装备

《人工智能增强监视分析：利用跨网络、陆地、空中及海上领域的威胁向量实时建模》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Finite-Sample Analysis of Learning High-Dimensional Single ReLU Neuron

Arxiv

0+阅读 · 2023年6月26日

Analysis of the Decoder Width for Parametric Partial Differential Equations

Arxiv

0+阅读 · 2023年6月26日

Numerical approximation of the invariant distribution for a class of stochastic damped wave equations

Arxiv

0+阅读 · 2023年6月24日

Computational multiscale methods for nondivergence-form elliptic partial differential equations

Arxiv

0+阅读 · 2023年6月24日

On Convex Data-Driven Inverse Optimal Control for Nonlinear, Non-stationary and Stochastic Systems

Arxiv

0+阅读 · 2023年6月24日

Mass, momentum and energy preserving FEEC and broken-FEEC schemes for the incompressible Navier-Stokes equations

Arxiv

0+阅读 · 2023年6月23日

Sharp analysis of EM for learning mixtures of pairwise differences

Arxiv

0+阅读 · 2023年6月22日

RANS-PINN based Simulation Surrogates for Predicting Turbulent Flows

RANS-PINN based Simulation Surrogates for Predicting Turbulent Flows

Arxiv

1+阅读 · 2023年6月22日

Faster Compression of Deterministic Finite Automata

Arxiv

0+阅读 · 2023年6月22日

The Cost of Informing Decision-Makers in Multi-Agent Maximum Coverage Problems with Random Resource Values

Arxiv

0+阅读 · 2023年6月21日

相关基金

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

Resveratrol联合MSCs移植对阿尔茨海默鼠的干预效果及Sirt1分子信号的介导作用

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

快裂变颈部发射的同位旋效应与亚饱和密区对称能的约束

国家自然科学基金

0+阅读 · 2012年12月31日

基于碳化硅晶体色心的自旋单量子态构筑和测量的理论模拟

国家自然科学基金

0+阅读 · 2012年12月31日

HC-SCR反应中乙醇催化制氢与还原剂活化耦合研究

国家自然科学基金

0+阅读 · 2011年12月31日

益气活血中药（灯盏生脉方）干预缺血性中风二级预防的代谢组学研究

国家自然科学基金

0+阅读 · 2011年12月31日

双光子跃迁主导超冷原子系综的量子相干操控

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员