Online Learning with Adversaries: A Differential Inclusion Analysis (Online Learning with Adversaries: A Differential Inclusion Analysis) - 专知论文

会员服务 ·

0

Analysis · 在线 · 样本 · 算法 · Learning ·

2023 年 4 月 4 日

Online Learning with Adversaries: A Differential Inclusion Analysis

翻译：Online Learning with Adversaries: A Differential Inclusion Analysis

Swetha Ganesh,Alexandre Reiffers-Masson,Gugan Thoppe

We consider the measurement model $Y = AX,$ where $X$ and, hence, $Y$ are random variables and $A$ is an a priori known tall matrix. At each time instance, a sample of one of $Y$'s coordinates is available, and the goal is to estimate $\mu := \mathbb{E}[X]$ via these samples. However, the challenge is that a small but unknown subset of $Y$'s coordinates are controlled by adversaries with infinite power: they can return any real number each time they are queried for a sample. For such an adversarial setting, we propose the first asynchronous online algorithm that converges to $\mu$ almost surely. We prove this result using a novel differential inclusion based two-timescale analysis. Two key highlights of our proof include: (a) the use of a novel Lyapunov function for showing that $\mu$ is the unique global attractor for our algorithm's limiting dynamics, and (b) the use of martingale and stopping time theory to show that our algorithm's iterates are almost surely bounded.

翻译：与对手的在线学习：微分包容分析我们考虑测量模型 $Y=AX$，其中 $X$ 及其结果 $Y$ 是随机变量，$A$ 为先验已知的纤瘦矩阵。在每个时间点，一个 $Y$ 坐标的样本可用，并通过这些样本估计 $\mu := \mathbb{E}[X]$。但是，挑战在于，由不知名的少数点组成的 $Y$ 坐标受到拥有无限权力的对手的控制：每次请求样本时，他们都可以返回任何实数。针对这种对抗性设置，我们提出了一种异步在线算法，该算法几乎确定地收敛于 $\mu$。我们使用新颖的微分包容两个时间尺度的分析来证明此结果。我们证明的两个关键亮点包括：（a）使用新颖的 Lyapunov 函数表明 $\mu$ 是我们算法极限动态的唯一全局吸引子，以及（b）使用鞅和停时理论表明算法的迭代几乎无疑是有界的。

0

相关内容

Analysis

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

斯坦福大学《博弈论基础简介》2017版，A Brief Introduction to the Basics of Game Theory，21页论文

斯坦福大学《博弈论基础简介》2017版，A Brief Introduction to the Basics of Game Theory，21页论文

专知会员服务

33+阅读 · 2022年4月1日

【ACL2022】一种基于三阶张量同构的高效实体对齐译码算法, An Effective and Efficient Entity Alignment Decoding Algorithm via Third-Order Tensor Isomorphism

【ACL2022】一种基于三阶张量同构的高效实体对齐译码算法, An Effective and Efficient Entity Alignment Decoding Algorithm via Third-Order Tensor Isomorphism

专知会员服务

13+阅读 · 2022年3月24日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

基于Transformer嵌入模型的个性化产品搜索，A Transformer-based Embedding Model for Personalized Product Search

基于Transformer嵌入模型的个性化产品搜索，A Transformer-based Embedding Model for Personalized Product Search

专知会员服务

31+阅读 · 2020年5月20日

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

专知会员服务

55+阅读 · 2020年3月26日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

RL解决'LunarLander-v2' (SOTA)

RL解决'LunarLander-v2' (SOTA)

CreateAMind

62+阅读 · 2019年9月27日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Heisenberg群与Minkowski空间中的非线性椭圆方程

国家自然科学基金

0+阅读 · 2014年12月31日

与微分算子相关的加权Hardy型空间实变理论及应用

国家自然科学基金

0+阅读 · 2014年12月31日

变分框架下的一类非局部的椭圆问题

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

统一框架下奇异Markov跳变时滞系统的多目标控制与滤波

国家自然科学基金

0+阅读 · 2012年12月31日

幂零李群上热核估计的几个问题

国家自然科学基金

0+阅读 · 2012年12月31日

间断Galerkin方法在透射特征值问题中的分析、计算和应用

国家自然科学基金

0+阅读 · 2012年12月31日

面向属性的CPN建模及On the Fly辅助的测试生成方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

非线性系统的全局能控性与最优控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Causal Discovery from Subsampled Time Series with Proxy Variables

Arxiv

0+阅读 · 2023年5月24日

Provable Offline Reinforcement Learning with Human Feedback

Arxiv

0+阅读 · 2023年5月24日

On the (Im)Possibility of Estimating Various Notions of Differential Privacy

Arxiv

0+阅读 · 2023年5月23日

First- and Second-Order Bounds for Adversarial Linear Contextual Bandits

Arxiv

0+阅读 · 2023年5月22日

Feasibility of Transfer Learning: A Mathematical Framework

Arxiv

0+阅读 · 2023年5月22日

Diversity and Inclusion in Artificial Intelligence

Arxiv

0+阅读 · 2023年5月22日

Conditional normalization in time series analysis

Arxiv

0+阅读 · 2023年5月22日

The dual reciprocity boundary elements method for one-dimensional nonlinear parabolic partial differential equations

Arxiv

0+阅读 · 2023年5月20日

Differentiable Model Selection for Ensemble Learning

Arxiv

0+阅读 · 2023年5月19日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

VIP会员

文章信息

相关主题

相关VIP内容

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

斯坦福大学《博弈论基础简介》2017版，A Brief Introduction to the Basics of Game Theory，21页论文

斯坦福大学《博弈论基础简介》2017版，A Brief Introduction to the Basics of Game Theory，21页论文

专知会员服务

33+阅读 · 2022年4月1日

【ACL2022】一种基于三阶张量同构的高效实体对齐译码算法, An Effective and Efficient Entity Alignment Decoding Algorithm via Third-Order Tensor Isomorphism

【ACL2022】一种基于三阶张量同构的高效实体对齐译码算法, An Effective and Efficient Entity Alignment Decoding Algorithm via Third-Order Tensor Isomorphism

专知会员服务

13+阅读 · 2022年3月24日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

基于Transformer嵌入模型的个性化产品搜索，A Transformer-based Embedding Model for Personalized Product Search

基于Transformer嵌入模型的个性化产品搜索，A Transformer-based Embedding Model for Personalized Product Search

专知会员服务

31+阅读 · 2020年5月20日

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

专知会员服务

55+阅读 · 2020年3月26日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

热门VIP内容

开通专知VIP会员享更多权益服务

《美陆军徒步机动作战条令手册》最新168页

【博士论文】基于不确定性的可靠性：现代机器学习中的选择性预测与可信部署

军事后勤数字化未来展望

《美海军后勤体系整合与创新挑战》最新报告

相关资讯

RL解决'LunarLander-v2' (SOTA)

RL解决'LunarLander-v2' (SOTA)

CreateAMind

62+阅读 · 2019年9月27日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Causal Discovery from Subsampled Time Series with Proxy Variables

Arxiv

0+阅读 · 2023年5月24日

Provable Offline Reinforcement Learning with Human Feedback

Arxiv

0+阅读 · 2023年5月24日

On the (Im)Possibility of Estimating Various Notions of Differential Privacy

Arxiv

0+阅读 · 2023年5月23日

First- and Second-Order Bounds for Adversarial Linear Contextual Bandits

Arxiv

0+阅读 · 2023年5月22日

Feasibility of Transfer Learning: A Mathematical Framework

Arxiv

0+阅读 · 2023年5月22日

Diversity and Inclusion in Artificial Intelligence

Arxiv

0+阅读 · 2023年5月22日

Conditional normalization in time series analysis

Arxiv

0+阅读 · 2023年5月22日

The dual reciprocity boundary elements method for one-dimensional nonlinear parabolic partial differential equations

Arxiv

0+阅读 · 2023年5月20日

Differentiable Model Selection for Ensemble Learning

Arxiv

0+阅读 · 2023年5月19日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Heisenberg群与Minkowski空间中的非线性椭圆方程

国家自然科学基金

0+阅读 · 2014年12月31日

与微分算子相关的加权Hardy型空间实变理论及应用

国家自然科学基金

0+阅读 · 2014年12月31日

变分框架下的一类非局部的椭圆问题

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

统一框架下奇异Markov跳变时滞系统的多目标控制与滤波

国家自然科学基金

0+阅读 · 2012年12月31日

幂零李群上热核估计的几个问题

国家自然科学基金

0+阅读 · 2012年12月31日

间断Galerkin方法在透射特征值问题中的分析、计算和应用

国家自然科学基金

0+阅读 · 2012年12月31日

面向属性的CPN建模及On the Fly辅助的测试生成方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

非线性系统的全局能控性与最优控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员