变化环境中按顺序决策的当地差异隐私 (Local Differential Privacy for Sequential Decision Making in a Changing Environment) - 专知论文

会员服务 ·

0

回合 · 赌博机/老虎机 · 性能度量 · Performer · Bandits ·

2023 年 1 月 2 日

Local Differential Privacy for Sequential Decision Making in a Changing Environment

翻译：变化环境中按顺序决策的当地差异隐私

from arxiv, Accepted at AAAI Privacy Preserving Artificial Intelligence (PPAI), 2023. arXiv admin note: text overlap with arXiv:1708.05033

We study the problem of preserving privacy while still providing high utility in sequential decision making scenarios in a changing environment. We consider abruptly changing environment: the environment remains constant during periods and it changes at unknown time instants. To formulate this problem, we propose a variant of multi-armed bandits called non-stationary stochastic corrupt bandits. We construct an algorithm called SW-KLUCB-CF and prove an upper bound on its utility using the performance measure of regret. The proven regret upper bound for SW-KLUCB-CF is near-optimal in the number of time steps and matches the best known bound for analogous problems in terms of the number of time steps and the number of changes. Moreover, we present a provably optimal mechanism which can guarantee the desired level of local differential privacy while providing high utility.

翻译：我们研究保护隐私的问题,同时在不断变化的环境中在连续决策情景中仍然提供很高的效用。我们考虑突然变化的环境:环境在一段时间内保持不变,在未知的时间瞬间发生变化。为了解决这个问题,我们提出一个多武装匪徒的变种,称为非静止的随机腐败匪徒。我们建立了一个叫作SW-KLUCB-CF的算法,并用遗憾的性能衡量方法证明其效用的上限。事实证明,SW-KLUCB-CF的上限遗憾程度在时间步骤的数目上是接近最佳的,在时间步骤的数目和变化的数目上与已知的类似问题最接近的一致。此外,我们提出了一个可以保证当地差异隐私达到理想水平,同时提供高度效用的最佳机制。

0

相关内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

RegIII信号通路与SOCS3甲基化协同调控胰腺炎症恶性转化的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于景观格局演变的鄱阳湖典型流域水环境响应及其优化模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

Navier-Stokes方程解的适定性和粘性消失问题

国家自然科学基金

0+阅读 · 2011年12月31日

我国中尺度对流涡旋及其与中尺度对流系统相互作用研究

国家自然科学基金

0+阅读 · 2008年12月31日

Contextual Linear Types for Differential Privacy

Arxiv

0+阅读 · 2023年3月1日

IQ-Flow: Mechanism Design for Inducing Cooperative Behavior to Self-Interested Agents in Sequential Social Dilemmas

Arxiv

0+阅读 · 2023年2月28日

On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process

Arxiv

0+阅读 · 2023年2月28日

On Differentially Private Federated Linear Contextual Bandits

Arxiv

0+阅读 · 2023年2月27日

Active Membership Inference Attack under Local Differential Privacy in Federated Learning

Arxiv

0+阅读 · 2023年2月24日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

相关论文

Contextual Linear Types for Differential Privacy

Arxiv

0+阅读 · 2023年3月1日

IQ-Flow: Mechanism Design for Inducing Cooperative Behavior to Self-Interested Agents in Sequential Social Dilemmas

Arxiv

0+阅读 · 2023年2月28日

On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process

Arxiv

0+阅读 · 2023年2月28日

On Differentially Private Federated Linear Contextual Bandits

Arxiv

0+阅读 · 2023年2月27日

Active Membership Inference Attack under Local Differential Privacy in Federated Learning

Arxiv

0+阅读 · 2023年2月24日

相关基金

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

RegIII信号通路与SOCS3甲基化协同调控胰腺炎症恶性转化的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于景观格局演变的鄱阳湖典型流域水环境响应及其优化模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

Navier-Stokes方程解的适定性和粘性消失问题

国家自然科学基金

0+阅读 · 2011年12月31日

我国中尺度对流涡旋及其与中尺度对流系统相互作用研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员