高高斯进程快速和可缩放的斯派克和斯拉布变量选择 (Fast and Scalable Spike and Slab Variable Selection in High-Dimensional Gaussian Processes) - 专知论文

会员服务 ·

0

尖峰和平板 · FAST · Processing（编程语言） · 模型平均 · 易处理的 ·

2021 年 11 月 8 日

Fast and Scalable Spike and Slab Variable Selection in High-Dimensional Gaussian Processes

翻译：高高斯进程快速和可缩放的斯派克和斯拉布变量选择

Hugh Dance,Brooks Paige

Variable selection in Gaussian processes (GPs) is typically undertaken by thresholding the inverse lengthscales of `automatic relevance determination' kernels, but in high-dimensional datasets this approach can be unreliable. A more probabilistically principled alternative is to use spike and slab priors and infer a posterior probability of variable inclusion. However, existing implementations in GPs are extremely costly to run in both high-dimensional and large-$n$ datasets, or are intractable for most kernels. As such, we develop a fast and scalable variational inference algorithm for the spike and slab GP that is tractable with arbitrary differentiable kernels. We improve our algorithm's ability to adapt to the sparsity of relevant variables by Bayesian model averaging over hyperparameters, and achieve substantial speed ups using zero temperature posterior restrictions, dropout pruning and nearest neighbour minibatching. In experiments our method consistently outperforms vanilla and sparse variational GPs whilst retaining similar runtimes (even when $n=10^6$) and performs competitively with a spike and slab GP using MCMC but runs up to $1000$ times faster.

翻译：Gausian 进程( GPs) 的变量选择通常采用“ 自动关联度确定” 内核的反长尺度, 而在高维数据集中, 这种方法可能是不可靠的。更概率原则性的替代办法是使用钉钉和板的前缀, 并推出一个包含变量的外在概率。但是, Gosian 中的现有实施对于在高维和大一美元数据集运行,或者对于大多数内核来说都是难处理的。因此, 我们开发了一个快速和可扩缩的“ 自动关联度确定” 内核螺旋内核的变异性算法, 这个方法可以随任意的不同内核而可移动。我们提高我们的算法能力, 以平均高于超光度光度计的方式适应Bayesian 模型的相关变量的宽度, 并且使用零温度后、抛出和近邻的微型燃烧。在实验中, 我们的方法始终优于 Vanilla 和稀薄的变异性GPs, 同时保留类似的运行时间( 即使当 $M= 10) 和快速运行到和 10 MS 。

0

相关内容

尖峰和平板

尖峰和平板

不可错过！「强化学习导论」多伦多大学2021课程，附SLIDES与140页pdf

不可错过！「强化学习导论」多伦多大学2021课程，附SLIDES与140页pdf

专知会员服务

67+阅读 · 2021年3月24日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【香港中文大学-VLDB2020】Dash:可扩展的持久内存哈希，Scalable Hashing

【香港中文大学-VLDB2020】Dash:可扩展的持久内存哈希，Scalable Hashing

专知会员服务

25+阅读 · 2020年3月17日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

专知会员服务

6+阅读 · 2019年12月1日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【推荐】xLearn：一款专门针对大规模稀疏数据的机器学习库

【推荐】xLearn：一款专门针对大规模稀疏数据的机器学习库

机器学习研究会

3+阅读 · 2017年11月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

语音识别之--扑朔迷“离”

语音识别之--扑朔迷“离”

微信AI

6+阅读 · 2017年8月9日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Escaping the curse of dimensionality in Bayesian model based clustering

Arxiv

0+阅读 · 2022年1月11日

Quantum Unsupervised and Supervised Learning on Superconducting Processors

Arxiv

0+阅读 · 2022年1月6日

On the optimization of hyperparameters in Gaussian process regression with the help of low-order high-dimensional model representation

Arxiv

0+阅读 · 2022年1月6日

Functional-Input Gaussian Processes with Applications to Inverse Scattering Problems

Arxiv

0+阅读 · 2022年1月5日

An approximate Bayes factor based high dimensional MANOVA using Random Projections

Arxiv

0+阅读 · 2022年1月5日

High-dimensional variable selection with heterogeneous signals: A precise asymptotic perspective

Arxiv

0+阅读 · 2022年1月5日

Partial Separability and Functional Graphical Models for Multivariate Gaussian Processes

Arxiv

0+阅读 · 2022年1月4日

Modeling Human Driver Interactions Using an Infinite Policy Space Through Gaussian Processes

Arxiv

0+阅读 · 2022年1月3日

Large-Scale Stochastic Sampling from the Probability Simplex

Arxiv

3+阅读 · 2018年6月19日

Being Robust (in High Dimensions) Can Be Practical

Arxiv

3+阅读 · 2017年12月14日

VIP会员

文章信息

相关主题

尖峰和平板

Processing（编程语言）

相关VIP内容

不可错过！「强化学习导论」多伦多大学2021课程，附SLIDES与140页pdf

不可错过！「强化学习导论」多伦多大学2021课程，附SLIDES与140页pdf

专知会员服务

67+阅读 · 2021年3月24日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【香港中文大学-VLDB2020】Dash:可扩展的持久内存哈希，Scalable Hashing

【香港中文大学-VLDB2020】Dash:可扩展的持久内存哈希，Scalable Hashing

专知会员服务

25+阅读 · 2020年3月17日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

专知会员服务

6+阅读 · 2019年12月1日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【推荐】xLearn：一款专门针对大规模稀疏数据的机器学习库

【推荐】xLearn：一款专门针对大规模稀疏数据的机器学习库

机器学习研究会

3+阅读 · 2017年11月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

语音识别之--扑朔迷“离”

语音识别之--扑朔迷“离”

微信AI

6+阅读 · 2017年8月9日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Escaping the curse of dimensionality in Bayesian model based clustering

Arxiv

0+阅读 · 2022年1月11日

Quantum Unsupervised and Supervised Learning on Superconducting Processors

Arxiv

0+阅读 · 2022年1月6日

On the optimization of hyperparameters in Gaussian process regression with the help of low-order high-dimensional model representation

Arxiv

0+阅读 · 2022年1月6日

Functional-Input Gaussian Processes with Applications to Inverse Scattering Problems

Arxiv

0+阅读 · 2022年1月5日

An approximate Bayes factor based high dimensional MANOVA using Random Projections

Arxiv

0+阅读 · 2022年1月5日

High-dimensional variable selection with heterogeneous signals: A precise asymptotic perspective

Arxiv

0+阅读 · 2022年1月5日

Partial Separability and Functional Graphical Models for Multivariate Gaussian Processes

Arxiv

0+阅读 · 2022年1月4日

Modeling Human Driver Interactions Using an Infinite Policy Space Through Gaussian Processes

Arxiv

0+阅读 · 2022年1月3日

Large-Scale Stochastic Sampling from the Probability Simplex

Arxiv

3+阅读 · 2018年6月19日

Being Robust (in High Dimensions) Can Be Practical

Arxiv

3+阅读 · 2017年12月14日

微信扫码咨询专知VIP会员