不同私人多武装强盗在洗牌模型中 (Differentially Private Multi-Armed Bandits in the Shuffle Model) - 专知论文

会员服务 ·

0

赌博机/老虎机 · MoDELS · ARM · 计算学习理论 ·

2021 年 6 月 8 日

Differentially Private Multi-Armed Bandits in the Shuffle Model

翻译：不同私人多武装强盗在洗牌模型中

Jay Tenenbaum,Haim Kaplan,Yishay Mansour,Uri Stemmer

We give an $(\varepsilon,\delta)$-differentially private algorithm for the multi-armed bandit (MAB) problem in the shuffle model with a distribution-dependent regret of $O\left(\left(\sum_{a\in [k]:\Delta_a>0}\frac{\log T}{\Delta_a}\right)+\frac{k\sqrt{\log\frac{1}{\delta}}\log T}{\varepsilon}\right)$, and a distribution-independent regret of $O\left(\sqrt{kT\log T}+\frac{k\sqrt{\log\frac{1}{\delta}}\log T}{\varepsilon}\right)$, where $T$ is the number of rounds, $\Delta_a$ is the suboptimality gap of the arm $a$, and $k$ is the total number of arms. Our upper bound almost matches the regret of the best known algorithms for the centralized model, and significantly outperforms the best known algorithm in the local model.

翻译：我们给出了美元( varepsilon,\ delta) 美元, 不同私人的算法, 并给出了在洗牌模型中多武装土匪问题( MAB) 的配发( MAB), 并附有基于分配的遗憾 $Oleft( left) (\\\ sum ⁇ a\ a\ in [ k]:\ Delta_ a> 0\\\\\\ frac\ log T\ k\ t\ k\ t\ log\ frac{ 1\\\\ delta ⁇ log Tunvarepsilon ⁇ right) $, 其中$T是弹数, $\ delta_ a 是手臂的亚最佳差距 $, $( $) 是武器的总数。我们的上层几乎匹配了最著名的中央模型的已知算法的遗憾, 并且大大超越了本地最已知的算法。

0

相关内容

赌博机/老虎机

赌博机/老虎机

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

60+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【经典书】C语言傻瓜式入门（第二版），411页pdf

【经典书】C语言傻瓜式入门（第二版），411页pdf

专知会员服务

54+阅读 · 2020年8月16日

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

专知会员服务

77+阅读 · 2020年6月28日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

康奈尔大学Jon Kleinberg经典书《算法设计Algorithm Design》课件PPT与电子书，864页pdf

康奈尔大学Jon Kleinberg经典书《算法设计Algorithm Design》课件PPT与电子书，864页pdf

专知会员服务

241+阅读 · 2020年1月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

样本贡献不均：Focal Loss和 Gradient Harmonizing Mechanism

样本贡献不均：Focal Loss和 Gradient Harmonizing Mechanism

极市平台

25+阅读 · 2019年4月25日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

已删除

将门创投

3+阅读 · 2017年9月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Privacy-Aware Rejection Sampling

Arxiv

0+阅读 · 2021年8月2日

NeuralDP Differentially private neural networks by design

Arxiv

0+阅读 · 2021年8月2日

Faster Rates of Differentially Private Stochastic Convex Optimization

Arxiv

0+阅读 · 2021年7月31日

Privacy Enhancement via Dummy Points in the Shuffle Model

Arxiv

0+阅读 · 2021年7月31日

Pure Exploration and Regret Minimization in Matching Bandits

Arxiv

0+阅读 · 2021年7月31日

Towards General Function Approximation in Zero-Sum Markov Games

Arxiv

0+阅读 · 2021年7月30日

Distribution free optimality intervals for clustering

Arxiv

0+阅读 · 2021年7月30日

Digital Passport and Visa Asset Management Using Private and Permissioned Blockchain

Arxiv

0+阅读 · 2021年7月27日

Large Scale Private Learning via Low-rank Reparametrization

Arxiv

5+阅读 · 2021年6月17日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

VIP会员

文章信息

相关主题

赌博机/老虎机

计算学习理论

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

60+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【经典书】C语言傻瓜式入门（第二版），411页pdf

【经典书】C语言傻瓜式入门（第二版），411页pdf

专知会员服务

54+阅读 · 2020年8月16日

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

专知会员服务

77+阅读 · 2020年6月28日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

康奈尔大学Jon Kleinberg经典书《算法设计Algorithm Design》课件PPT与电子书，864页pdf

康奈尔大学Jon Kleinberg经典书《算法设计Algorithm Design》课件PPT与电子书，864页pdf

专知会员服务

241+阅读 · 2020年1月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

人机协同时代的军事指挥控制演进

《英国智库：瓦解俄罗斯防空系统生产，夺回制空权》最新报告

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

《战术突击工具包：军队的“边缘”操作系统》报告

相关资讯

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

样本贡献不均：Focal Loss和 Gradient Harmonizing Mechanism

样本贡献不均：Focal Loss和 Gradient Harmonizing Mechanism

极市平台

25+阅读 · 2019年4月25日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

已删除

将门创投

3+阅读 · 2017年9月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Privacy-Aware Rejection Sampling

Arxiv

0+阅读 · 2021年8月2日

NeuralDP Differentially private neural networks by design

Arxiv

0+阅读 · 2021年8月2日

Faster Rates of Differentially Private Stochastic Convex Optimization

Arxiv

0+阅读 · 2021年7月31日

Privacy Enhancement via Dummy Points in the Shuffle Model

Arxiv

0+阅读 · 2021年7月31日

Pure Exploration and Regret Minimization in Matching Bandits

Arxiv

0+阅读 · 2021年7月31日

Towards General Function Approximation in Zero-Sum Markov Games

Arxiv

0+阅读 · 2021年7月30日

Distribution free optimality intervals for clustering

Arxiv

0+阅读 · 2021年7月30日

Digital Passport and Visa Asset Management Using Private and Permissioned Blockchain

Arxiv

0+阅读 · 2021年7月27日

Large Scale Private Learning via Low-rank Reparametrization

Arxiv

5+阅读 · 2021年6月17日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

微信扫码咨询专知VIP会员