向没有碰撞遥感信息的多层强盗争取最佳最佳算法 (Towards Optimal Algorithms for Multi-Player Bandits without Collision Sensing Information) - 专知论文

会员服务 ·

0

赌博机/老虎机 · INFORMS · state-of-the-art · 优化器 · Performer ·

2022 年 6 月 6 日

Towards Optimal Algorithms for Multi-Player Bandits without Collision Sensing Information

翻译：向没有碰撞遥感信息的多层强盗争取最佳最佳算法

Wei Huang,Richard Combes,Cindy Trinh

from arxiv, 24 pages, COLT 2022

We propose a novel algorithm for multi-player multi-armed bandits without collision sensing information. Our algorithm circumvents two problems shared by all state-of-the-art algorithms: it does not need as an input a lower bound on the minimal expected reward of an arm, and its performance does not scale inversely proportionally to the minimal expected reward. We prove a theoretical regret upper bound to justify these claims. We complement our theoretical results with numerical experiments, showing that the proposed algorithm outperforms state-of-the-art in practice as well.

翻译：我们为没有碰撞感测信息的多玩家多武装强盗提出了一个新的算法。我们的算法回避了所有最先进的算法共有的两个问题:它不需要作为投入对一个手臂的最低预期报酬有较低的约束,它的性能并不与最低预期报酬成反比。我们证明理论上的遗憾是用来解释这些说法的。我们用数字实验来补充我们的理论结果,表明拟议的算法在实践中也比最新水平好。

0

相关内容

赌博机/老虎机

赌博机/老虎机

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于组学方法研究西藏野生大麦特异种质的耐铝机制

国家自然科学基金

0+阅读 · 2013年12月31日

水稻苗期耐盐基因qSTS8的精细定位与候选基因分析

国家自然科学基金

0+阅读 · 2013年12月31日

采用pinball loss的MEE算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

高盐胁迫条件下嗜盐四联球菌生理应答及盐胁迫抗性机制解析

国家自然科学基金

0+阅读 · 2013年12月31日

地衣芽孢杆菌耐热性的遗传基础

国家自然科学基金

0+阅读 · 2012年12月31日

绵羊全基因组CNVs及其与肌肉生长的关联分析

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

香蕉抗旱的生理遗传研究

国家自然科学基金

0+阅读 · 2008年12月31日

E2N: Error Estimation Networks for Goal-Oriented Mesh Adaptation

Arxiv

0+阅读 · 2022年7月22日

Optimism in Face of a Context: Regret Guarantees for Stochastic Contextual MDP

Optimism in Face of a Context: Regret Guarantees for Stochastic Contextual MDP

Arxiv

0+阅读 · 2022年7月22日

Scale-aware direct monocular odometry

Arxiv

0+阅读 · 2022年7月22日

Reinforcement Learning Approaches for the Orienteering Problem with Stochastic and Dynamic Release Dates

Arxiv

0+阅读 · 2022年7月22日

Optimal Algorithms for Free Order Multiple-Choice Secretary

Arxiv

0+阅读 · 2022年7月21日

Optimal precision for GANs

Optimal precision for GANs

Arxiv

0+阅读 · 2022年7月21日

Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Arxiv

0+阅读 · 2022年7月21日

FOCUS: Fairness via Agent-Awareness for Federated Learning on Heterogeneous Data

Arxiv

0+阅读 · 2022年7月21日

Enhanced Laplace Approximation

Arxiv

0+阅读 · 2022年7月20日

Error-in-variables modelling for operator learning

Arxiv

0+阅读 · 2022年7月19日

VIP会员

文章信息

相关主题

赌博机/老虎机

state-of-the-art

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型幻觉：系统综述

《分析与预测陆军战斗体能测试表现：统计与机器学习方法》2025最新137页

【博士论文】数据与任务的物理学：深度学习中的局部性与组合性理论

代理式人工智能时代的决策优势

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

E2N: Error Estimation Networks for Goal-Oriented Mesh Adaptation

Arxiv

0+阅读 · 2022年7月22日

Optimism in Face of a Context: Regret Guarantees for Stochastic Contextual MDP

Optimism in Face of a Context: Regret Guarantees for Stochastic Contextual MDP

Arxiv

0+阅读 · 2022年7月22日

Scale-aware direct monocular odometry

Arxiv

0+阅读 · 2022年7月22日

Reinforcement Learning Approaches for the Orienteering Problem with Stochastic and Dynamic Release Dates

Arxiv

0+阅读 · 2022年7月22日

Optimal Algorithms for Free Order Multiple-Choice Secretary

Arxiv

0+阅读 · 2022年7月21日

Optimal precision for GANs

Optimal precision for GANs

Arxiv

0+阅读 · 2022年7月21日

Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Arxiv

0+阅读 · 2022年7月21日

FOCUS: Fairness via Agent-Awareness for Federated Learning on Heterogeneous Data

Arxiv

0+阅读 · 2022年7月21日

Enhanced Laplace Approximation

Arxiv

0+阅读 · 2022年7月20日

Error-in-variables modelling for operator learning

Arxiv

0+阅读 · 2022年7月19日

相关基金

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于组学方法研究西藏野生大麦特异种质的耐铝机制

国家自然科学基金

0+阅读 · 2013年12月31日

水稻苗期耐盐基因qSTS8的精细定位与候选基因分析

国家自然科学基金

0+阅读 · 2013年12月31日

采用pinball loss的MEE算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

高盐胁迫条件下嗜盐四联球菌生理应答及盐胁迫抗性机制解析

国家自然科学基金

0+阅读 · 2013年12月31日

地衣芽孢杆菌耐热性的遗传基础

国家自然科学基金

0+阅读 · 2012年12月31日

绵羊全基因组CNVs及其与肌肉生长的关联分析

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

香蕉抗旱的生理遗传研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员