ISAAC Newton: Input-based Approximate Curvature for Newton's Method - 专知论文

会员服务 ·

0

曲率 · ISAAC · 近似 · INFORMS · Batch Size ·

2023 年 5 月 1 日

ISAAC Newton: Input-based Approximate Curvature for Newton's Method

翻译：暂无翻译

Felix Petersen,Tobias Sutter,Christian Borgelt,Dongsung Huh,Hilde Kuehne,Yuekai Sun,Oliver Deussen

from arxiv, Published at ICLR 2023, Code @ https://github.com/Felix-Petersen/isaac, Video @ https://youtu.be/7RKRX-MdwqM

We present ISAAC (Input-baSed ApproximAte Curvature), a novel method that conditions the gradient using selected second-order information and has an asymptotically vanishing computational overhead, assuming a batch size smaller than the number of neurons. We show that it is possible to compute a good conditioner based on only the input to a respective layer without a substantial computational overhead. The proposed method allows effective training even in small-batch stochastic regimes, which makes it competitive to first-order as well as second-order methods.

翻译：暂无翻译

0

相关内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

51+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

最优控制问题H1-Galerkin混合有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

全空间中临界Surface Quasi-geostrophic方程的全局吸引子及其分形维数

国家自然科学基金

0+阅读 · 2014年12月31日

基于微波原位生长方法的介观结构TiO2/CNTs复合材料的制备及其光催化性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

TM2+:II-VI@LnF3纳米晶硫系玻璃复合材料的制备及中红外发光性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

三元纳米晶的组份调控机制及其光学性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

还原敏感触发式纳米Pickering乳递药系统的构建及靶向肝癌的研究

国家自然科学基金

0+阅读 · 2012年12月31日

喷雾热解耦合流化床还原技术合成空心铜基复合纳米材料用于Rochow反应的研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型无机－有机杂化微球的制备及其在分离科学中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

能量级联定向纳米有机复合薄膜的制备及其光伏性能的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Provably Efficient Bayesian Optimization with Unbiased Gaussian Process Hyperparameter Estimation

Arxiv

0+阅读 · 2023年6月12日

A Weighted Randomized Sparse Kaczmarz Method for Solving Linear Systems

Arxiv

0+阅读 · 2023年6月12日

Convergence of Momentum-Based Heavy Ball Method with Batch Updating and/or Approximate Gradients

Arxiv

0+阅读 · 2023年6月10日

Local object crop collision network for efficient simulation of non-convex objects in GPU-based simulators

Arxiv

0+阅读 · 2023年6月10日

Approximations of Time-Dependent Nonlinear Partial Differential Equations using Galerkin Optimal Auxiliary Function Method

Arxiv

0+阅读 · 2023年6月10日

Causal Effect Estimation from Observational and Interventional Data Through Matrix Weighted Linear Estimators

Arxiv

0+阅读 · 2023年6月9日

Adaptive Contextual Perception: How to Generalize to New Backgrounds and Ambiguous Objects

Arxiv

0+阅读 · 2023年6月9日

Newton-based alternating methods for the ground state of a class of multi-component Bose-Einstein condensates

Arxiv

0+阅读 · 2023年6月9日

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Arxiv

10+阅读 · 2022年2月10日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

51+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Provably Efficient Bayesian Optimization with Unbiased Gaussian Process Hyperparameter Estimation

Arxiv

0+阅读 · 2023年6月12日

A Weighted Randomized Sparse Kaczmarz Method for Solving Linear Systems

Arxiv

0+阅读 · 2023年6月12日

Convergence of Momentum-Based Heavy Ball Method with Batch Updating and/or Approximate Gradients

Arxiv

0+阅读 · 2023年6月10日

Local object crop collision network for efficient simulation of non-convex objects in GPU-based simulators

Arxiv

0+阅读 · 2023年6月10日

Approximations of Time-Dependent Nonlinear Partial Differential Equations using Galerkin Optimal Auxiliary Function Method

Arxiv

0+阅读 · 2023年6月10日

Causal Effect Estimation from Observational and Interventional Data Through Matrix Weighted Linear Estimators

Arxiv

0+阅读 · 2023年6月9日

Adaptive Contextual Perception: How to Generalize to New Backgrounds and Ambiguous Objects

Arxiv

0+阅读 · 2023年6月9日

Newton-based alternating methods for the ground state of a class of multi-component Bose-Einstein condensates

Arxiv

0+阅读 · 2023年6月9日

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Arxiv

10+阅读 · 2022年2月10日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

最优控制问题H1-Galerkin混合有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

全空间中临界Surface Quasi-geostrophic方程的全局吸引子及其分形维数

国家自然科学基金

0+阅读 · 2014年12月31日

基于微波原位生长方法的介观结构TiO2/CNTs复合材料的制备及其光催化性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

TM2+:II-VI@LnF3纳米晶硫系玻璃复合材料的制备及中红外发光性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

三元纳米晶的组份调控机制及其光学性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

还原敏感触发式纳米Pickering乳递药系统的构建及靶向肝癌的研究

国家自然科学基金

0+阅读 · 2012年12月31日

喷雾热解耦合流化床还原技术合成空心铜基复合纳米材料用于Rochow反应的研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型无机－有机杂化微球的制备及其在分离科学中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

能量级联定向纳米有机复合薄膜的制备及其光伏性能的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员