可缩放高斯进程超参数优化 (Preconditioning for Scalable Gaussian Process Hyperparameter Optimization) - 专知论文

会员服务 ·

0

优化器 · 超参数 · 估计/估计量 · Processing（编程语言） · 可约的 ·

2022 年 6 月 12 日

Preconditioning for Scalable Gaussian Process Hyperparameter Optimization

翻译：可缩放高斯进程超参数优化

Jonathan Wenger,Geoff Pleiss,Philipp Hennig,John P. Cunningham,Jacob R. Gardner

from arxiv, International Conference on Machine Learning (ICML)

Gaussian process hyperparameter optimization requires linear solves with, and log-determinants of, large kernel matrices. Iterative numerical techniques are becoming popular to scale to larger datasets, relying on the conjugate gradient method (CG) for the linear solves and stochastic trace estimation for the log-determinant. This work introduces new algorithmic and theoretical insights for preconditioning these computations. While preconditioning is well understood in the context of CG, we demonstrate that it can also accelerate convergence and reduce variance of the estimates for the log-determinant and its derivative. We prove general probabilistic error bounds for the preconditioned computation of the log-determinant, log-marginal likelihood and its derivatives. Additionally, we derive specific rates for a range of kernel-preconditioner combinations, showing that up to exponential convergence can be achieved. Our theoretical results enable provably efficient optimization of kernel hyperparameters, which we validate empirically on large-scale benchmark problems. There our approach accelerates training by up to an order of magnitude.

翻译：Gausian 进程超光度优化要求与大型内核矩阵进行线性解析和对数值的确定。循环数字技术正在变得日益流行,以更大的数据集为尺度,依靠线性解析和对日志-确定性估算的共振梯度法(CG)和随机痕量估计。这项工作为这些计算的先决条件引入了新的算法和理论洞察力。在CG的背景下,我们非常理解先决条件,但我们也证明它能够加速对日志-确定性及其衍生物的估计数的趋同并减少其差异。我们证明对日志-确定性、日志-边缘可能性及其衍生物的预设计算存在一般概率误差。此外,我们为一系列内核-预设性组合得出了具体的比率,表明可以达到指数趋同。我们的理论结果可以使内核超常分数的精度优化,我们从经验角度验证了大规模基准问题。我们的方法将培训速度加速到一定的高度。

0

相关内容

优化器

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

De Brujin图和Kautz图的交叉数算法及应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

电磁场特征值问题的间断 Galerkin 算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于混合协同智能算法的变截面涡旋膨胀机集成优化研究

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt信号调节蛋白Gpr177在小鼠上腭发育中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

GPU/CPU协同加速的双变网格最小二乘逆时偏移理论方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于计算智能的群体行为控制模型及路径生成研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于压缩感知的OFDM系统的PAPR减小和切削噪声消除

国家自然科学基金

0+阅读 · 2012年12月31日

基于多Agent系统的流域防洪智能调度研究

国家自然科学基金

0+阅读 · 2011年12月31日

关于区间矩阵特征值计算的若干问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

图的正则性和胞腔代数

国家自然科学基金

0+阅读 · 2008年12月31日

Stochastic Gradient Line Bayesian Optimization for Efficient Noise-Robust Optimization of Parameterized Quantum Circuits

Arxiv

0+阅读 · 2022年8月3日

An $L^p$- Primal-Dual Weak Galerkin method for div-curl Systems

Arxiv

0+阅读 · 2022年8月2日

A Recursive Partitioning Approach for Dynamic Discrete Choice Modeling in High Dimensional Settings

Arxiv

0+阅读 · 2022年8月2日

A uniform preconditioner for a Newton algorithm for total-variation minimization and minimum-surface problems

Arxiv

0+阅读 · 2022年8月2日

Neural Stochastic PDEs: Resolution-Invariant Learning of Continuous Spatiotemporal Dynamics

Arxiv

0+阅读 · 2022年8月2日

Convex duality for stochastic shortest path problems in known and unknown environments

Arxiv

0+阅读 · 2022年8月2日

Residual Tensor Train: A Quantum-inspired Approach for Learning Multiple Multilinear Correlations

Arxiv

0+阅读 · 2022年8月2日

Formal guarantees for heuristic optimization algorithms used in machine learning

Arxiv

0+阅读 · 2022年7月31日

YAHPO Gym -- An Efficient Multi-Objective Multi-Fidelity Benchmark for Hyperparameter Optimization

Arxiv

0+阅读 · 2022年7月30日

Sampling Attacks on Meta Reinforcement Learning: A Minimax Formulation and Complexity Analysis

Arxiv

1+阅读 · 2022年7月29日

VIP会员

文章信息

相关主题

估计/估计量

Processing（编程语言）

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Stochastic Gradient Line Bayesian Optimization for Efficient Noise-Robust Optimization of Parameterized Quantum Circuits

Arxiv

0+阅读 · 2022年8月3日

An $L^p$- Primal-Dual Weak Galerkin method for div-curl Systems

Arxiv

0+阅读 · 2022年8月2日

A Recursive Partitioning Approach for Dynamic Discrete Choice Modeling in High Dimensional Settings

Arxiv

0+阅读 · 2022年8月2日

A uniform preconditioner for a Newton algorithm for total-variation minimization and minimum-surface problems

Arxiv

0+阅读 · 2022年8月2日

Neural Stochastic PDEs: Resolution-Invariant Learning of Continuous Spatiotemporal Dynamics

Arxiv

0+阅读 · 2022年8月2日

Convex duality for stochastic shortest path problems in known and unknown environments

Arxiv

0+阅读 · 2022年8月2日

Residual Tensor Train: A Quantum-inspired Approach for Learning Multiple Multilinear Correlations

Arxiv

0+阅读 · 2022年8月2日

Formal guarantees for heuristic optimization algorithms used in machine learning

Arxiv

0+阅读 · 2022年7月31日

YAHPO Gym -- An Efficient Multi-Objective Multi-Fidelity Benchmark for Hyperparameter Optimization

Arxiv

0+阅读 · 2022年7月30日

Sampling Attacks on Meta Reinforcement Learning: A Minimax Formulation and Complexity Analysis

Arxiv

1+阅读 · 2022年7月29日

相关基金

De Brujin图和Kautz图的交叉数算法及应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

电磁场特征值问题的间断 Galerkin 算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于混合协同智能算法的变截面涡旋膨胀机集成优化研究

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt信号调节蛋白Gpr177在小鼠上腭发育中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

GPU/CPU协同加速的双变网格最小二乘逆时偏移理论方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于计算智能的群体行为控制模型及路径生成研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于压缩感知的OFDM系统的PAPR减小和切削噪声消除

国家自然科学基金

0+阅读 · 2012年12月31日

基于多Agent系统的流域防洪智能调度研究

国家自然科学基金

0+阅读 · 2011年12月31日

关于区间矩阵特征值计算的若干问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

图的正则性和胞腔代数

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员