机器学习中UCB赌博机算法近乎最优的对抗性攻击 (Near Optimal Adversarial Attack on UCB Bandits) - 专知论文

会员服务 ·

0

赌博机 · 攻击 · 对抗性攻击 · 最优 · 下界 ·

2023 年 3 月 24 日

Near Optimal Adversarial Attack on UCB Bandits

翻译：机器学习中UCB赌博机算法近乎最优的对抗性攻击

We consider a stochastic multi-arm bandit problem where rewards are subject to adversarial corruption. We propose a novel attack strategy that manipulates a UCB principle into pulling some non-optimal target arm $T - o(T)$ times with a cumulative cost that scales as $\sqrt{\log T}$, where $T$ is the number of rounds. We also prove the first lower bound on the cumulative attack cost. Our lower bound matches our upper bound up to $\log \log T$ factors, showing our attack to be near optimal.

翻译：我们考虑一种随机的多臂赌博机问题，其中奖励遭到对抗性攻击。我们提出了一种新颖的攻击策略，将UCB算法引导到多次拉取某个非最优的目标臂，攻击累积成本的规模为$ \sqrt{\log T}$，其中$T$是回合数。我们还证明了攻击的累积成本的第一个下界，该下界与我们的上界相匹配，误差仅为$ \log \log T$。这显示出我们的攻击是近乎最优的。

0

相关内容

赌博机

【AI+兵棋推演】60页paper速读：美国空军兵棋推演多物网络行动路线自动分析方法，The wargame commodity course of action automated analysis method

【AI+兵棋推演】60页paper速读：美国空军兵棋推演多物网络行动路线自动分析方法，The wargame commodity course of action automated analysis method

专知会员服务

91+阅读 · 2022年3月18日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

7 Papers & Radios | IJCAI 2022杰出论文；苹果2D GAN转3D

7 Papers & Radios | IJCAI 2022杰出论文；苹果2D GAN转3D

机器之心

0+阅读 · 2022年7月31日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

布尔可满足性算法和单调布尔函数的复杂性

国家自然科学基金

0+阅读 · 2015年12月31日

代数几何和组合方法在Hash函数族构造中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于似然估计的梯度优化在变量带误差模型辨识中的收敛性分析

国家自然科学基金

0+阅读 · 2013年12月31日

Lai-Massey分组密码模型的安全性研究

国家自然科学基金

1+阅读 · 2012年12月31日

可证安全代理密码系统研究

国家自然科学基金

0+阅读 · 2009年12月31日

A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs

Arxiv

0+阅读 · 2023年5月15日

On Authentication against a Myopic Adversary using Stochastic Codes

Arxiv

0+阅读 · 2023年5月12日

Stratified Adversarial Robustness with Rejection

Arxiv

0+阅读 · 2023年5月12日

Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers

Arxiv

0+阅读 · 2023年5月10日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

VIP会员

文章信息

相关主题

对抗性攻击

相关VIP内容

【AI+兵棋推演】60页paper速读：美国空军兵棋推演多物网络行动路线自动分析方法，The wargame commodity course of action automated analysis method

【AI+兵棋推演】60页paper速读：美国空军兵棋推演多物网络行动路线自动分析方法，The wargame commodity course of action automated analysis method

专知会员服务

91+阅读 · 2022年3月18日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

热门VIP内容

开通专知VIP会员享更多权益服务

从代码基础模型到智能体与应用：代码智能的全面综述与实践指南

《北约认知战概念报告》

【MIT博士论文】高效的视觉合成生成模型

美海军放弃星座级转而采用国家安全巡逻舰设计

相关资讯

7 Papers & Radios | IJCAI 2022杰出论文；苹果2D GAN转3D

7 Papers & Radios | IJCAI 2022杰出论文；苹果2D GAN转3D

机器之心

0+阅读 · 2022年7月31日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs

Arxiv

0+阅读 · 2023年5月15日

On Authentication against a Myopic Adversary using Stochastic Codes

Arxiv

0+阅读 · 2023年5月12日

Stratified Adversarial Robustness with Rejection

Arxiv

0+阅读 · 2023年5月12日

Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers

Arxiv

0+阅读 · 2023年5月10日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

相关基金

布尔可满足性算法和单调布尔函数的复杂性

国家自然科学基金

0+阅读 · 2015年12月31日

代数几何和组合方法在Hash函数族构造中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于似然估计的梯度优化在变量带误差模型辨识中的收敛性分析

国家自然科学基金

0+阅读 · 2013年12月31日

Lai-Massey分组密码模型的安全性研究

国家自然科学基金

1+阅读 · 2012年12月31日

可证安全代理密码系统研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员