斯切德牛顿-拉夫森 (Sketched Newton-Raphson) - 专知论文

会员服务 ·

0

CASE · SGD · 可约的 · 优化器 · 广义线性模型 ·

2021 年 1 月 17 日

Sketched Newton-Raphson

翻译：斯切德牛顿-拉夫森

Rui Yuan,Alessandro Lazaric,Robert M. Gower

from arxiv, 43 pages, 4 figures, 3 tables

We propose a new globally convergent stochastic second order method. Our starting point is the development of a new Sketched Newton-Raphson (SNR) method for solving large scale nonlinear equations of the form $F(x)=0$ with $F:\mathbb{R}^d \rightarrow \mathbb{R}^n$. We then show how to design several stochastic second order optimization methods by re-writing the optimization problem of interest as a system of nonlinear equations and applying SNR. For instance, by applying SNR to find a stationary point of a generalized linear model (GLM), we derive completely new and scalable stochastic second order methods. We show that the resulting method is very competitive as compared to state-of-the-art variance reduced methods. Furthermore, using a variable splitting trick, we also show that the Stochastic Newton method (SNM) is a special case of SNR, and use this connection to establish the first global convergence theory of SNM. We establish the global convergence of SNR by showing that it is a variant of the stochastic gradient descent (SGD) method, and then leveraging proof techniques of SGD. As a special case, our theory also provides a new global convergence theory for the original Newton-Raphson method under strictly weaker assumptions as compared to the classic monotone convergence theory.

翻译：我们提出一个新的全球趋同的第二顺序方法。我们的出发点是开发一个新的 Sketched Newton- Raphson (SNR) 方法, 以解决以$F:\\mathbb{R ⁇ d\rightrow \mathbb{R ⁇ n$$美元为单位的大规模非线性方程式。我们然后展示如何设计几种随机性第二顺序优化方法, 将最佳利益问题重新写成非线性方程式系统, 并应用 SNR。例如, 应用 SNR 来找到一个通用线性模型(GLM) 的固定点, 我们得出了全新的和可伸缩的第二顺序法的大规模非线性非线性方程式方程式。我们显示, 由此产生的方法与降低状态的方法相比, 是非常有竞争力的。此外, 我们用变异的分裂策略, 我们还显示, 托卡式牛顿方法(SNM) 是SNM的第一个全球趋同理论(SNM)下的第一个全球趋同的典型理论, 通过显示一个全球趋同性理论, 将SNBIGGI的新的理论作为新的推。

0

相关内容

CASE

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【MIT】约束最小-最大优化的复杂性，84页pdf

专知会员服务

44+阅读 · 2020年9月25日

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

专知会员服务

33+阅读 · 2020年8月14日

【优化基准：最佳实践，54页pdf】Benchmarking in Optimization: Best Practice and Open Issues

【优化基准：最佳实践，54页pdf】Benchmarking in Optimization: Best Practice and Open Issues

专知会员服务

25+阅读 · 2020年7月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Gartner：2019 年 MSP 魔力象限

Gartner：2019 年 MSP 魔力象限

云头条

15+阅读 · 2019年3月6日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

专知

14+阅读 · 2018年3月28日

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

机器学习研究会

6+阅读 · 2018年1月6日

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

机器学习研究会

36+阅读 · 2017年12月10日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

A structure-preserving doubling algorithm for solving a class of quadratic matrix equation with $M$-matrix

Arxiv

0+阅读 · 2021年3月12日

Structural Properties of Bichromatic Non-crossing Matchings

Arxiv

0+阅读 · 2021年3月12日

A finite difference method for the variational $p$-Laplacian

Arxiv

0+阅读 · 2021年3月11日

Residuation for Soft Constraints: Lexicographic Orders and Approximation Techniques

Arxiv

0+阅读 · 2021年3月11日

Arc-Completion of 2-Colored Best Match Graphs to Binary-Explainable Best Match Graphs

Arxiv

0+阅读 · 2021年3月11日

A Deamortization Approach for Dynamic Spanner and Dynamic Maximal Matching

Arxiv

0+阅读 · 2021年3月11日

A discussion on the approximate solutions of first order systems of non-linear ordinary equations

Arxiv

0+阅读 · 2021年3月11日

Generalized continuation Newton methods and the trust-region updating strategy for the underdetermined system

Arxiv

0+阅读 · 2021年3月10日

Improved Speech Enhancement with the Wave-U-Net

Arxiv

8+阅读 · 2018年11月27日

Matching Networks for One Shot Learning

Arxiv

10+阅读 · 2017年12月29日

VIP会员

文章信息

相关主题

广义线性模型

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【MIT】约束最小-最大优化的复杂性，84页pdf

专知会员服务

44+阅读 · 2020年9月25日

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

专知会员服务

33+阅读 · 2020年8月14日

【优化基准：最佳实践，54页pdf】Benchmarking in Optimization: Best Practice and Open Issues

【优化基准：最佳实践，54页pdf】Benchmarking in Optimization: Best Practice and Open Issues

专知会员服务

25+阅读 · 2020年7月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《人与智能体在系统工程建模语言V2任务中的性能表现：基于用户中心化的评估方法》308页

《数据安全国家标准体系（2025版）》征求意见稿

AlphaMosaic：人工智能赋能的作战管理系统

《军事行动中通信平台的战略价值：提升战术效能与作战优势》

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Gartner：2019 年 MSP 魔力象限

Gartner：2019 年 MSP 魔力象限

云头条

15+阅读 · 2019年3月6日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

专知

14+阅读 · 2018年3月28日

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

机器学习研究会

6+阅读 · 2018年1月6日

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

机器学习研究会

36+阅读 · 2017年12月10日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

A structure-preserving doubling algorithm for solving a class of quadratic matrix equation with $M$-matrix

Arxiv

0+阅读 · 2021年3月12日

Structural Properties of Bichromatic Non-crossing Matchings

Arxiv

0+阅读 · 2021年3月12日

A finite difference method for the variational $p$-Laplacian

Arxiv

0+阅读 · 2021年3月11日

Residuation for Soft Constraints: Lexicographic Orders and Approximation Techniques

Arxiv

0+阅读 · 2021年3月11日

Arc-Completion of 2-Colored Best Match Graphs to Binary-Explainable Best Match Graphs

Arxiv

0+阅读 · 2021年3月11日

A Deamortization Approach for Dynamic Spanner and Dynamic Maximal Matching

Arxiv

0+阅读 · 2021年3月11日

A discussion on the approximate solutions of first order systems of non-linear ordinary equations

Arxiv

0+阅读 · 2021年3月11日

Generalized continuation Newton methods and the trust-region updating strategy for the underdetermined system

Arxiv

0+阅读 · 2021年3月10日

Improved Speech Enhancement with the Wave-U-Net

Arxiv

8+阅读 · 2018年11月27日

Matching Networks for One Shot Learning

Arxiv

10+阅读 · 2017年12月29日

微信扫码咨询专知VIP会员