精炼的可接近性算法和应用,以尽量减少全球成本并降低全球成本 (Refined approachability algorithms and application to regret minimization with global costs) - 专知论文

会员服务 ·

0

欧几里得距离 · 正则化项 · 情景 · SimPLe · CASE ·

2021 年 9 月 7 日

Refined approachability algorithms and application to regret minimization with global costs

翻译：精炼的可接近性算法和应用,以尽量减少全球成本并降低全球成本

Blackwell's approachability is a framework where two players, the Decision Maker and the Environment, play a repeated game with vector-valued payoffs. The goal of the Decision Maker is to make the average payoff converge to a given set called the target. When this is indeed possible, simple algorithms which guarantee the convergence are known. This abstract tool was successfully used for the construction of optimal strategies in various repeated games, but also found several applications in online learning. By extending an approach proposed by (Abernethy et al., 2011), we construct and analyze a class of Follow the Regularized Leader algorithms (FTRL) for Blackwell's approachability which are able to minimize not only the Euclidean distance to the target set (as it is often the case in the context of Blackwell's approachability) but a wide range of distance-like quantities. This flexibility enables us to apply these algorithms to closely minimize the quantity of interest in various online learning problems. In particular, for regret minimization with $\ell_p$ global costs, we obtain the first bounds with explicit dependence in $p$ and the dimension $d$.

翻译：Blackwell的可接近性是一个框架,让两个角色,即决策者和环境,用矢量估值的回报来玩一个重复游戏。决策者的目标是让平均回报集中到一个称为目标的组合中。如果这是可能的,则可以知道能够保证趋同的简单算法。这个抽象工具被成功地用于在多次游戏中构建最佳战略,但在网上学习中也发现了一些应用。通过推广由(Abernethy等人,2011年)提议的方法,我们建造和分析了一组跟踪Blackwell正规化领导算法(FTRL)的可接近性(FTRL),不仅能够最大限度地减少Euclidean与目标集的距离(在Blackwell的可接近性情况下经常是这样),而且能够有广泛的距离。这种灵活性使我们能够应用这些算法,以密切减少各种在线学习问题中的兴趣数量。特别是,为将全球成本的美元减到最低,我们获得了以美元和美元这一维值为明确依赖度的首层。

0

相关内容

欧几里得距离

欧几里得距离

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

网络表示学习算法综述

专知会员服务

66+阅读 · 2020年9月24日

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

专知会员服务

117+阅读 · 2020年4月3日

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

专知会员服务

158+阅读 · 2020年1月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Improved Strongly Polynomial Algorithms for Deterministic MDPs, 2VPI Feasibility, and Discounted All-Pairs Shortest Paths

Arxiv

0+阅读 · 2021年10月28日

An Efficient Reversible Algorithm for Linear Regression

Arxiv

0+阅读 · 2021年10月27日

Implicit Regularization in Matrix Sensing via Mirror Descent

Arxiv

0+阅读 · 2021年10月27日

Interaction Maxima in Distributed Systems

Arxiv

0+阅读 · 2021年10月27日

Local Differential Privacy for Regret Minimization in Reinforcement Learning

Arxiv

0+阅读 · 2021年10月27日

Continuation Newton methods with deflation techniques and quasi-genetic evolution for global optimization problems

Arxiv

0+阅读 · 2021年10月27日

Optimal Algorithms for Stochastic Multi-Armed Bandits with Heavy Tailed Rewards

Arxiv

0+阅读 · 2021年10月27日

A General Framework for Bandit Problems Beyond Cumulative Objectives

Arxiv

0+阅读 · 2021年10月26日

Adversarial Robustness of Streaming Algorithms through Importance Sampling

Arxiv

0+阅读 · 2021年10月26日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

VIP会员

文章信息

相关主题

欧几里得距离

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

网络表示学习算法综述

专知会员服务

66+阅读 · 2020年9月24日

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

对话推荐系统综述论文，35页pdf，A Survey on Conversational Recommender Systems

专知会员服务

117+阅读 · 2020年4月3日

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

专知会员服务

158+阅读 · 2020年1月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【NTU博士论文】反事实推理在多模态对话生成中的应用

基于强化学习的智能体化搜索全面综述：基础、角色、优化、评估与应用

ICCV最佳论文出炉，朱俊彦团队用砖块积木摘得桂冠

面向具身操作的高效视觉–语言–动作模型：系统综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Improved Strongly Polynomial Algorithms for Deterministic MDPs, 2VPI Feasibility, and Discounted All-Pairs Shortest Paths

Arxiv

0+阅读 · 2021年10月28日

An Efficient Reversible Algorithm for Linear Regression

Arxiv

0+阅读 · 2021年10月27日

Implicit Regularization in Matrix Sensing via Mirror Descent

Arxiv

0+阅读 · 2021年10月27日

Interaction Maxima in Distributed Systems

Arxiv

0+阅读 · 2021年10月27日

Local Differential Privacy for Regret Minimization in Reinforcement Learning

Arxiv

0+阅读 · 2021年10月27日

Continuation Newton methods with deflation techniques and quasi-genetic evolution for global optimization problems

Arxiv

0+阅读 · 2021年10月27日

Optimal Algorithms for Stochastic Multi-Armed Bandits with Heavy Tailed Rewards

Arxiv

0+阅读 · 2021年10月27日

A General Framework for Bandit Problems Beyond Cumulative Objectives

Arxiv

0+阅读 · 2021年10月26日

Adversarial Robustness of Streaming Algorithms through Importance Sampling

Arxiv

0+阅读 · 2021年10月26日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

微信扫码咨询专知VIP会员