Conic Blackwell 算法: 参数- 自由集聚- 凝固- 聚聚点拼接- 点解 (Conic Blackwell Algorithm: Parameter-Free Convex-Concave Saddle-Point Solving) - 专知论文

会员服务 ·

0

单纯形 · SimPLe · Performer · state-of-the-art · 置信度 ·

2021 年 5 月 27 日

Conic Blackwell Algorithm: Parameter-Free Convex-Concave Saddle-Point Solving

翻译：Conic Blackwell 算法: 参数- 自由集聚- 凝固- 聚聚点拼接- 点解

Julien Grand-Clément,Christian Kroer

We develop new parameter and scale-free algorithms for solving convex-concave saddle-point problems. Our results are based on a new simple regret minimizer, the Conic Blackwell Algorithm$^+$ (CBA$^+$), which attains $O(1/\sqrt{T})$ average regret. Intuitively, our approach generalizes to other decision sets of interest ideas from the Counterfactual Regret minimization (CFR$^+$) algorithm, which has very strong practical performance for solving sequential games on simplexes. We show how to implement CBA$^+$ for the simplex, $\ell_{p}$ norm balls, and ellipsoidal confidence regions in the simplex, and we present numerical experiments for solving matrix games and distributionally robust optimization problems. Our empirical results show that CBA$^+$ is a simple algorithm that outperforms state-of-the-art methods on synthetic data and real data instances, without the need for any choice of step sizes or other algorithmic parameters.

翻译：我们开发了新的参数和无比例值算法来解决 convex- conculve ship- pold-point 问题。我们的结果基于一个新的简单的最小遗憾最小化器 — — Conic Blackwell Algorithm$ $ $( CBA$ $ $ ), 达到美元( $1 /\\ sqrt{T} $ ) 的平均遗憾。直观地说, 我们的方法将反事实最小化( CFR$ $ $ $ ) 算法中的其他决定性利益概念概括化为普通最小化( comfactal Regret 最小化( CFR$ $ $ $ $ ) 。我们展示了如何在简单x 、 $\\ ell\ } 标准球和线性信任区执行 CBBA$ $ 。我们展示了用于解决矩阵游戏和分布强度优化优化问题的数字实验。我们的经验结果表明, CBA$ 是一个简单的算法, 超越合成数据和真实数据中的最新方法,, 并不需要任何步骤大小的选择。

0

相关内容

单纯形

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【CVPR2021】自监督几何感知

【CVPR2021】自监督几何感知

专知会员服务

46+阅读 · 2021年3月6日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

已删除

将门创投

4+阅读 · 2018年7月31日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

大数据的分布式算法

大数据的分布式算法

待字闺中

3+阅读 · 2017年6月13日

Solving for best linear approximates

Arxiv

0+阅读 · 2021年7月19日

Solving high-dimensional parabolic PDEs using the tensor train format

Arxiv

0+阅读 · 2021年7月17日

On Efficient Optimal Transport: An Analysis of Greedy and Accelerated Mirror Descent Algorithms

Arxiv

0+阅读 · 2021年7月17日

Optimal Approximation Rate of ReLU Networks in terms of Width and Depth

Arxiv

0+阅读 · 2021年7月17日

Clustering Data with Nonignorable Missingness using Semi-Parametric Mixture Models

Arxiv

0+阅读 · 2021年7月16日

A Riemannian Block Coordinate Descent Method for Computing the Projection Robust Wasserstein Distance

Arxiv

0+阅读 · 2021年7月16日

USCO-Solver: Solving Undetermined Stochastic Combinatorial Optimization Problems

Arxiv

0+阅读 · 2021年7月15日

Quantum Speedup for Graph Sparsification, Cut Approximation and Laplacian Solving

Arxiv

0+阅读 · 2021年7月15日

Variational Bayesian Reinforcement Learning with Regret Bounds

Arxiv

3+阅读 · 2018年7月25日

Reinforcement Learning for Solving the Vehicle Routing Problem

Arxiv

3+阅读 · 2018年5月21日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【CVPR2021】自监督几何感知

【CVPR2021】自监督几何感知

专知会员服务

46+阅读 · 2021年3月6日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《基于大型语言模型的软件工程自动化研究》最新264页

《基于大型语言模型的信号处理管线研究：推进军事电子情报工作流程》最新76页

中文版 | 战争算法：生成式人工智能在战场的崛起

中文版《美国陆军：战术行为性远程医疗实施观察与建议》

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

已删除

将门创投

4+阅读 · 2018年7月31日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

大数据的分布式算法

大数据的分布式算法

待字闺中

3+阅读 · 2017年6月13日

相关论文

Solving for best linear approximates

Arxiv

0+阅读 · 2021年7月19日

Solving high-dimensional parabolic PDEs using the tensor train format

Arxiv

0+阅读 · 2021年7月17日

On Efficient Optimal Transport: An Analysis of Greedy and Accelerated Mirror Descent Algorithms

Arxiv

0+阅读 · 2021年7月17日

Optimal Approximation Rate of ReLU Networks in terms of Width and Depth

Arxiv

0+阅读 · 2021年7月17日

Clustering Data with Nonignorable Missingness using Semi-Parametric Mixture Models

Arxiv

0+阅读 · 2021年7月16日

A Riemannian Block Coordinate Descent Method for Computing the Projection Robust Wasserstein Distance

Arxiv

0+阅读 · 2021年7月16日

USCO-Solver: Solving Undetermined Stochastic Combinatorial Optimization Problems

Arxiv

0+阅读 · 2021年7月15日

Quantum Speedup for Graph Sparsification, Cut Approximation and Laplacian Solving

Arxiv

0+阅读 · 2021年7月15日

Variational Bayesian Reinforcement Learning with Regret Bounds

Arxiv

3+阅读 · 2018年7月25日

Reinforcement Learning for Solving the Vehicle Routing Problem

Arxiv

3+阅读 · 2018年5月21日

微信扫码咨询专知VIP会员