对Bernoulli盗匪的对称双武装Bernoulli Bandit的PDE分析 (A PDE-Based Analysis of the Symmetric Two-Armed Bernoulli Bandit) - 专知论文

会员服务 ·

0

赌博机/老虎机 · Analysis · 线性的 · 优化器 · 缩放 ·

2022 年 9 月 9 日

A PDE-Based Analysis of the Symmetric Two-Armed Bernoulli Bandit

翻译：对Bernoulli盗匪的对称双武装Bernoulli Bandit的PDE分析

Vladimir A. Kobzar,Robert V. Kohn

from arxiv, Improved results and presentation

This work addresses a version of the two-armed Bernoulli bandit problem where the sum of the means of the arms is one (the symmetric two-armed Bernoulli bandit). In a regime where the gap between these means goes to zero and the number of prediction periods approaches infinity, we obtain the leading order terms of the expected regret and pseudoregret for this problem by associating each of them with a solution of a linear parabolic partial differential equation. Our results improve upon the previously known results; specifically, we explicitly compute the leading order term of the optimal regret and pseudoregret in three different scaling regimes for the gap. Additionally, we obtain new non-asymptotic bounds for any given time horizon.

翻译：这项工作解决了双臂伯努利土匪问题的一个版本,即武器手段的总和是一个(对称双臂伯努利土匪 ) 。在这两个手段之间的差距达到零和预测期数接近无限的政权中,我们获得了这一问题预期遗憾和假象的主要顺序条件,将其中每个人与线性抛物线部分差别方程式的解决方案联系起来。我们的结果比先前已知的结果有所改进;具体地说,我们明确计算了最佳遗憾和假冒雷布雷特在三种不同差距缩放制度中的主要顺序。此外,我们获得了任何特定时间跨度的新的非救济界限。

0

相关内容

赌博机/老虎机

赌博机/老虎机

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

集值优化问题的逼近解及二阶最优性条件

国家自然科学基金

0+阅读 · 2014年12月31日

Markovian 跳变广义随机切换系统的稳定性及滑模控制与应用研究

国家自然科学基金

1+阅读 · 2014年12月31日

Kahler 曲面中特殊曲面的研究

国家自然科学基金

0+阅读 · 2014年12月31日

轴对称的Navier-Stokes方程

国家自然科学基金

1+阅读 · 2011年12月31日

Davey-Stewartson 型方程组的适定性和爆破性研究

国家自然科学基金

1+阅读 · 2011年12月31日

A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear Models

A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear Models

Arxiv

0+阅读 · 2022年10月21日

Improved Regret Analysis for Variance-Adaptive Linear Bandits and Horizon-Free Linear Mixture MDPs

Arxiv

0+阅读 · 2022年10月20日

BELIEF in Dependence: Leveraging Atomic Linearity in Data Bits for Rethinking Generalized Linear Models

Arxiv

0+阅读 · 2022年10月19日

Constrained Factor Models for High-Dimensional Matrix-Variate Time Series

Arxiv

0+阅读 · 2022年10月19日

Gaussian Process Sampling and Optimization with Approximate Upper and Lower Bounds

Arxiv

0+阅读 · 2022年10月19日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《将空中力量带向海洋：美国海军航空发展的四条竞争路径及其教训》报告

【MIT博士论文】以语言为中心的医学影像理解

《无人机系统 - 反无人机系统：测试方法》364页

《无人机蜂群攻击防御的预测建模：面向美军战备的人工智能轨迹预测与最优拦截策略设计》最新报告

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear Models

A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear Models

Arxiv

0+阅读 · 2022年10月21日

Improved Regret Analysis for Variance-Adaptive Linear Bandits and Horizon-Free Linear Mixture MDPs

Arxiv

0+阅读 · 2022年10月20日

BELIEF in Dependence: Leveraging Atomic Linearity in Data Bits for Rethinking Generalized Linear Models

Arxiv

0+阅读 · 2022年10月19日

Constrained Factor Models for High-Dimensional Matrix-Variate Time Series

Arxiv

0+阅读 · 2022年10月19日

Gaussian Process Sampling and Optimization with Approximate Upper and Lower Bounds

Arxiv

0+阅读 · 2022年10月19日

相关基金

集值优化问题的逼近解及二阶最优性条件

国家自然科学基金

0+阅读 · 2014年12月31日

Markovian 跳变广义随机切换系统的稳定性及滑模控制与应用研究

国家自然科学基金

1+阅读 · 2014年12月31日

Kahler 曲面中特殊曲面的研究

国家自然科学基金

0+阅读 · 2014年12月31日

轴对称的Navier-Stokes方程

国家自然科学基金

1+阅读 · 2011年12月31日

Davey-Stewartson 型方程组的适定性和爆破性研究

国家自然科学基金

1+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员