重新思考Sinkhorn算法的初始化 (Rethinking Initialization of the Sinkhorn Algorithm) - 专知论文

会员服务 ·

0

初始化 · 正则化 · 算法 · 雅克比 · 输运 ·

2023 年 4 月 5 日

Rethinking Initialization of the Sinkhorn Algorithm

翻译：重新思考Sinkhorn算法的初始化

James Thornton,Marco Cuturi

While the optimal transport (OT) problem was originally formulated as a linear program, the addition of entropic regularization has proven beneficial both computationally and statistically, for many applications. The Sinkhorn fixed-point algorithm is the most popular approach to solve this regularized problem, and, as a result, multiple attempts have been made to reduce its runtime using, e.g., annealing in the regularization parameter, momentum or acceleration. The premise of this work is that initialization of the Sinkhorn algorithm has received comparatively little attention, possibly due to two preconceptions: since the regularized OT problem is convex, it may not be worth crafting a good initialization, since any is guaranteed to work; secondly, because the outputs of the Sinkhorn algorithm are often unrolled in end-to-end pipelines, a data-dependent initialization would bias Jacobian computations. We challenge this conventional wisdom, and show that data-dependent initializers result in dramatic speed-ups, with no effect on differentiability as long as implicit differentiation is used. Our initializations rely on closed-forms for exact or approximate OT solutions that are known in the 1D, Gaussian or GMM settings. They can be used with minimal tuning, and result in consistent speed-ups for a wide variety of OT problems.

翻译：---- 虽然最优输运问题最初被制定为线性规划，但加入熵正则化已被证明在许多应用中具有计算和统计上的优势。Sinkhorn不动点算法是解决这个正则化问题的最流行方法。因此，已经尝试多种方法通过在正则化参数中使用退火、动量或加速来减少其运行时间。这项工作的前提是Sinkhorn算法的初始化相对较少被关注，可能由于两种偏见：因为正则化的OT问题是凸的，所以可能不值得制定一个好的初始化，因为任何一个都是有保证的。其次，因为Sinkhorn算法的输出通常在端到端的管道中展开，因此数据相关的初始化会导致Jacobian计算的偏见。我们挑战这种传统智慧，并显示出数据相关初始化会导致显著加速，在使用隐式微分时对可微性没有影响。我们的初始化依赖于一维、高斯或GMM设置中已知的精确或近似OT解的解析形式。它们可以使用最小的调整，并在各种OT问题中获得一致的加速。

0

相关内容

初始化

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【ICML2021】低秩Sinkhorn 分解

专知会员服务

39+阅读 · 2021年8月20日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

37+阅读 · 2021年4月27日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

浅聊对比学习（Contrastive Learning）

浅聊对比学习（Contrastive Learning）

极市平台

2+阅读 · 2022年7月26日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

基于双基系统的椭圆曲线标量乘算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

带稀疏约束不适定问题的算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

适定的多元样条逼近方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

多任务学习的理论分析与应用

国家自然科学基金

6+阅读 · 2013年12月31日

基于似然估计的梯度优化在变量带误差模型辨识中的收敛性分析

国家自然科学基金

0+阅读 · 2013年12月31日

几类无线通信中的非凸矩阵优化问题及算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于小波变换的仿射不变形状表示算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

随机微分方程的逼近

国家自然科学基金

0+阅读 · 2009年12月31日

The First Proven Performance Guarantees for the Non-Dominated Sorting Genetic Algorithm II (NSGA-II) on a Combinatorial Optimization Problem

Arxiv

0+阅读 · 2023年5月22日

Error estimates of a theta-scheme for second-order mean field games

Arxiv

0+阅读 · 2023年5月22日

A two-way heterogeneity model for dynamic networks

Arxiv

0+阅读 · 2023年5月22日

A Novel Framework for Improving the Breakdown Point of Robust Regression Algorithms

Arxiv

0+阅读 · 2023年5月20日

On the approximability and energy-flow modeling of the electric vehicle sharing problem

Arxiv

0+阅读 · 2023年5月20日

On the Relationship between Markov Switching Models and Fuzzy Clustering: a Nonparametric Method to Detect the Number of States

Arxiv

0+阅读 · 2023年5月20日

Rankability and Linear Ordering Problem: New Probabilistic Insight and Algorithms

Arxiv

0+阅读 · 2023年5月19日

The Mori-Zwanzig formulation of deep learning

Arxiv

0+阅读 · 2023年5月19日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【ICML2021】低秩Sinkhorn 分解

专知会员服务

39+阅读 · 2021年8月20日

WWW21最新「比较学习」教程，135页PPT阐述从排名数据中学习

专知会员服务

37+阅读 · 2021年4月27日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

浅聊对比学习（Contrastive Learning）

浅聊对比学习（Contrastive Learning）

极市平台

2+阅读 · 2022年7月26日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

相关论文

The First Proven Performance Guarantees for the Non-Dominated Sorting Genetic Algorithm II (NSGA-II) on a Combinatorial Optimization Problem

Arxiv

0+阅读 · 2023年5月22日

Error estimates of a theta-scheme for second-order mean field games

Arxiv

0+阅读 · 2023年5月22日

A two-way heterogeneity model for dynamic networks

Arxiv

0+阅读 · 2023年5月22日

A Novel Framework for Improving the Breakdown Point of Robust Regression Algorithms

Arxiv

0+阅读 · 2023年5月20日

On the approximability and energy-flow modeling of the electric vehicle sharing problem

Arxiv

0+阅读 · 2023年5月20日

On the Relationship between Markov Switching Models and Fuzzy Clustering: a Nonparametric Method to Detect the Number of States

Arxiv

0+阅读 · 2023年5月20日

Rankability and Linear Ordering Problem: New Probabilistic Insight and Algorithms

Arxiv

0+阅读 · 2023年5月19日

The Mori-Zwanzig formulation of deep learning

Arxiv

0+阅读 · 2023年5月19日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

相关基金

基于双基系统的椭圆曲线标量乘算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

带稀疏约束不适定问题的算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

适定的多元样条逼近方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

多任务学习的理论分析与应用

国家自然科学基金

6+阅读 · 2013年12月31日

基于似然估计的梯度优化在变量带误差模型辨识中的收敛性分析

国家自然科学基金

0+阅读 · 2013年12月31日

几类无线通信中的非凸矩阵优化问题及算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于小波变换的仿射不变形状表示算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

随机微分方程的逼近

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员