反对称多武装匪徒通用通用翻译和缩放变量在线算法 (Generalized Translation and Scale Invariant Online Algorithm for Adversarial Multi-Armed Bandits) - 专知论文

会员服务 ·

0

赌博机/老虎机 · Performer · 尺度不变性 · Bandits · 缩放 ·

2021 年 9 月 19 日

Generalized Translation and Scale Invariant Online Algorithm for Adversarial Multi-Armed Bandits

翻译：反对称多武装匪徒通用通用翻译和缩放变量在线算法

Kaan Gokcesu,Hakan Gokcesu

from arxiv, arXiv admin note: substantial text overlap with arXiv:2009.04372

We study the adversarial multi-armed bandit problem and create a completely online algorithmic framework that is invariant under arbitrary translations and scales of the arm losses. We study the expected performance of our algorithm against a generic competition class, which makes it applicable for a wide variety of problem scenarios. Our algorithm works from a universal prediction perspective and the performance measure used is the expected regret against arbitrary arm selection sequences, which is the difference between our losses and a competing loss sequence. The competition class can be designed to include fixed arm selections, switching bandits, contextual bandits, or any other competition of interest. The sequences in the competition class are generally determined by the specific application at hand and should be designed accordingly. Our algorithm neither uses nor needs any preliminary information about the loss sequences and is completely online. Its performance bounds are the second order bounds in terms of sum of the squared losses, where any affine transform of the losses has no effect on the normalized regret.

翻译：我们研究对抗性多武装土匪问题,并创建一个完全在线的算法框架,这种框架在任意翻译和武器损失的尺度下是无差别的。我们研究我们的算法对通用竞争等级的预期性能,这使它适用于各种各样的问题。我们的算法从普遍预测的角度和所使用的业绩计量是预期对任意选择武器序列的遗憾,即我们的损失和竞争性损失顺序之间的差别。竞争等级可以设计成包括固定的手臂选择、换换手、背景强盗或任何其他利益竞争。竞争等级的顺序一般由手头的具体应用程序决定,因此应当据此设计。我们的算法既不使用也不需要关于损失顺序的任何初步信息,而是完全在线的。其性能界限是平方损失总和的第二顺序,即损失的任何折叠式转换都不会对归正的遗憾产生影响。

0

相关内容

赌博机/老虎机

赌博机/老虎机

【ICML2021】核持续学习，Kernel Continual Learning

专知会员服务

32+阅读 · 2021年7月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

ICML2021接受论文列表出炉！1184篇论文都在这了！

专知会员服务

92+阅读 · 2021年6月3日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

专知会员服务

66+阅读 · 2020年4月17日

【课程】纽约大学 DS-GA 1003 Machine Learning

【课程】纽约大学 DS-GA 1003 Machine Learning

专知会员服务

46+阅读 · 2019年10月29日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Generative Adversarial Text to Image Synthesis论文解读

Generative Adversarial Text to Image Synthesis论文解读

统计学习与视觉计算组

13+阅读 · 2017年6月9日

Deterministic Algorithms for the Hidden Subgroup Problem

Arxiv

0+阅读 · 2021年11月10日

Beyond Tikhonov: Faster Learning with Self-Concordant Losses via Iterative Regularization

Arxiv

0+阅读 · 2021年11月10日

Almost Optimal Universal Lower Bound for Learning Causal DAGs with Atomic Interventions

Arxiv

0+阅读 · 2021年11月9日

Minimax Optimal Quantile and Semi-Adversarial Regret via Root-Logarithmic Regularizers

Arxiv

0+阅读 · 2021年11月7日

Bandits with many optimal arms

Arxiv

0+阅读 · 2021年11月5日

The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems

Arxiv

0+阅读 · 2021年11月5日

A Modern Introduction to Online Learning

A Modern Introduction to Online Learning

Arxiv

21+阅读 · 2019年12月31日

Risk-Aware Active Inverse Reinforcement Learning

Risk-Aware Active Inverse Reinforcement Learning

Arxiv

8+阅读 · 2019年1月8日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

Arxiv

5+阅读 · 2017年11月24日

VIP会员

文章信息

相关主题

赌博机/老虎机

尺度不变性

相关VIP内容

【ICML2021】核持续学习，Kernel Continual Learning

专知会员服务

32+阅读 · 2021年7月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

ICML2021接受论文列表出炉！1184篇论文都在这了！

专知会员服务

92+阅读 · 2021年6月3日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

专知会员服务

66+阅读 · 2020年4月17日

【课程】纽约大学 DS-GA 1003 Machine Learning

【课程】纽约大学 DS-GA 1003 Machine Learning

专知会员服务

46+阅读 · 2019年10月29日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Generative Adversarial Text to Image Synthesis论文解读

Generative Adversarial Text to Image Synthesis论文解读

统计学习与视觉计算组

13+阅读 · 2017年6月9日

相关论文

Deterministic Algorithms for the Hidden Subgroup Problem

Arxiv

0+阅读 · 2021年11月10日

Beyond Tikhonov: Faster Learning with Self-Concordant Losses via Iterative Regularization

Arxiv

0+阅读 · 2021年11月10日

Almost Optimal Universal Lower Bound for Learning Causal DAGs with Atomic Interventions

Arxiv

0+阅读 · 2021年11月9日

Minimax Optimal Quantile and Semi-Adversarial Regret via Root-Logarithmic Regularizers

Arxiv

0+阅读 · 2021年11月7日

Bandits with many optimal arms

Arxiv

0+阅读 · 2021年11月5日

The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems

Arxiv

0+阅读 · 2021年11月5日

A Modern Introduction to Online Learning

A Modern Introduction to Online Learning

Arxiv

21+阅读 · 2019年12月31日

Risk-Aware Active Inverse Reinforcement Learning

Risk-Aware Active Inverse Reinforcement Learning

Arxiv

8+阅读 · 2019年1月8日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

Arxiv

5+阅读 · 2017年11月24日

微信扫码咨询专知VIP会员