高多维下同声推断的分布布设布设布设 (Distributed Bootstrap for Simultaneous Inference Under High Dimensionality) - 专知论文

会员服务 ·

0

自助法/自举法 · Extensibility · 推断 · Performer · tuning ·

2021 年 2 月 19 日

Distributed Bootstrap for Simultaneous Inference Under High Dimensionality

翻译：高多维下同声推断的分布布设布设布设

Yang Yu,Shih-Kang Chao,Guang Cheng

from arxiv, arXiv admin note: text overlap with arXiv:2002.08443

We propose a distributed bootstrap method for simultaneous inference on high-dimensional massive data that are stored and processed with many machines. The method produces a $\ell_\infty$-norm confidence region based on a communication-efficient de-biased lasso, and we propose an efficient cross-validation approach to tune the method at every iteration. We theoretically prove a lower bound on the number of communication rounds $\tau_{\min}$ that warrants the statistical accuracy and efficiency. Furthermore, $\tau_{\min}$ only increases logarithmically with the number of workers and intrinsic dimensionality, while nearly invariant to the nominal dimensionality. We test our theory by extensive simulation studies, and a variable screening task on a semi-synthetic dataset based on the US Airline On-time Performance dataset. The code to reproduce the numerical results is available at GitHub: https://github.com/skchao74/Distributed-bootstrap.

翻译：我们建议了一种分布式靴套方法,用于同时推断用多种机器储存和处理的高维大规模数据。该方法产生一个基于通信效率低偏向的诺尔姆信任区,我们建议了一种有效的交叉校准方法,以调和每次迭代的方法。我们理论上证明,对于需要统计准确性和效率的通信回合数($tau ⁇ min}$)的约束较低。此外,$sau ⁇ min}美元仅会增加工人的数量和内在的维度的对数,而几乎与名义的维度不相容。我们通过广泛的模拟研究测试我们的理论,以及基于美国空线实时性能数据集的半合成数据集的可变筛选任务。复制数字结果的代码可在GitHub:https://github.com/skchao74/ditrated-botstspreg。

0

相关内容

自助法/自举法

自助法/自举法

【经典书】线性代数，436页pdf

专知会员服务

75+阅读 · 2021年3月16日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

105+阅读 · 2020年5月3日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

92+阅读 · 2020年3月12日

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

专知会员服务

9+阅读 · 2020年1月7日

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

专知会员服务

31+阅读 · 2019年12月26日

【斯坦福大学CS229】面向机器学习的线性代数和微积分要点速览(中文版)《CS 229 - Linear Algebra and Calculus refresher》by Afshine Amidi, Shervine Amidi

【斯坦福大学CS229】面向机器学习的线性代数和微积分要点速览(中文版)《CS 229 - Linear Algebra and Calculus refresher》by Afshine Amidi, Shervine Amidi

专知会员服务

189+阅读 · 2019年12月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

52+阅读 · 2019年9月29日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

42+阅读 · 2019年6月1日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Factor Models for High-Dimensional Functional Time Series

Arxiv

0+阅读 · 2021年4月13日

Least Squares Approximation for a Distributed System

Arxiv

0+阅读 · 2021年4月13日

Bootstrap inference for quantile-based modal regression

Arxiv

0+阅读 · 2021年4月12日

System-Level Dynamics of Highly Directional Distributed Networks

Arxiv

0+阅读 · 2021年4月12日

Multi-Group Multicast Beamforming: Optimal Structure and Efficient Algorithms

Arxiv

0+阅读 · 2021年4月10日

Exact-corrected confidence interval for risk difference in noninferiority binomial trials

Arxiv

0+阅读 · 2021年4月10日

Model-assisted analyses of cluster-randomized experiments

Arxiv

0+阅读 · 2021年4月9日

Localizing differences in smooths with simultaneous confidence bounds on the true discovery proportion

Arxiv

0+阅读 · 2021年4月9日

Hi-CI: Deep Causal Inference in High Dimensions

Arxiv

0+阅读 · 2021年4月9日

A precise local limit theorem for the multinomial distribution and some applications

Arxiv

0+阅读 · 2021年4月8日

VIP会员

文章信息

相关主题

自助法/自举法

相关VIP内容

【经典书】线性代数，436页pdf

专知会员服务

75+阅读 · 2021年3月16日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

105+阅读 · 2020年5月3日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

92+阅读 · 2020年3月12日

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

专知会员服务

9+阅读 · 2020年1月7日

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

专知会员服务

31+阅读 · 2019年12月26日

【斯坦福大学CS229】面向机器学习的线性代数和微积分要点速览(中文版)《CS 229 - Linear Algebra and Calculus refresher》by Afshine Amidi, Shervine Amidi

【斯坦福大学CS229】面向机器学习的线性代数和微积分要点速览(中文版)《CS 229 - Linear Algebra and Calculus refresher》by Afshine Amidi, Shervine Amidi

专知会员服务

189+阅读 · 2019年12月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

52+阅读 · 2019年9月29日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

42+阅读 · 2019年6月1日

热门VIP内容

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Factor Models for High-Dimensional Functional Time Series

Arxiv

0+阅读 · 2021年4月13日

Least Squares Approximation for a Distributed System

Arxiv

0+阅读 · 2021年4月13日

Bootstrap inference for quantile-based modal regression

Arxiv

0+阅读 · 2021年4月12日

System-Level Dynamics of Highly Directional Distributed Networks

Arxiv

0+阅读 · 2021年4月12日

Multi-Group Multicast Beamforming: Optimal Structure and Efficient Algorithms

Arxiv

0+阅读 · 2021年4月10日

Exact-corrected confidence interval for risk difference in noninferiority binomial trials

Arxiv

0+阅读 · 2021年4月10日

Model-assisted analyses of cluster-randomized experiments

Arxiv

0+阅读 · 2021年4月9日

Localizing differences in smooths with simultaneous confidence bounds on the true discovery proportion

Arxiv

0+阅读 · 2021年4月9日

Hi-CI: Deep Causal Inference in High Dimensions

Arxiv

0+阅读 · 2021年4月9日

A precise local limit theorem for the multinomial distribution and some applications

Arxiv

0+阅读 · 2021年4月8日

微信扫码咨询专知VIP会员